CN116996488A - Interactive multimedia information playing method and device, medium and electronic equipment - Google Patents

Interactive multimedia information playing method and device, medium and electronic equipment Download PDF

Info

Publication number
CN116996488A
CN116996488A CN202211157488.XA CN202211157488A CN116996488A CN 116996488 A CN116996488 A CN 116996488A CN 202211157488 A CN202211157488 A CN 202211157488A CN 116996488 A CN116996488 A CN 116996488A
Authority
CN
China
Prior art keywords
multimedia
target
multimedia information
code stream
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211157488.XA
Other languages
Chinese (zh)
Inventor
李大龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Beijing Co Ltd
Original Assignee
Tencent Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Beijing Co Ltd filed Critical Tencent Technology Beijing Co Ltd
Priority to CN202211157488.XA priority Critical patent/CN116996488A/en
Publication of CN116996488A publication Critical patent/CN116996488A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/22Parsing or analysis of headers

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The application discloses a playing method, a device, a medium and electronic equipment of interactive multimedia information, wherein the method comprises the following steps: generating description information of a target multimedia fragment in the target multimedia information code stream according to the interactive operation when the interactive operation for the current multimedia fragment in the current multimedia information code stream is detected; the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time; searching target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment; and acquiring the target multimedia fragment according to the target access information for playing. The application can realize smooth switching between different code streams in the interactive video playing process and effectively avoid the occurrence of the clamping phenomenon in the interactive process.

Description

Interactive multimedia information playing method and device, medium and electronic equipment
Technical Field
The application belongs to the technical field of multimedia, and particularly relates to a playing method, device, medium and electronic equipment of interactive multimedia information.
Background
With the rapid development of internet technology, multimedia information propagated by means of the internet is also new multimedia information, such as interactive video, of interactive multimedia information in the continuous development, and the video watching object is allowed to interact with the video in the video playing process, so that the immersion of the video watching object on the watched video can be enhanced. The playing of such interactive video is typically a clip-on playing, i.e. when playing one video clip, an interactive action is triggered, and the next played video clip is determined according to the specific interactive action. However, when the video clips are frequently switched, the existing interactive video playing method is easy to cause the phenomenon of video playing clamping.
It should be noted that the information disclosed in the above background section is only for enhancing understanding of the background of the application and thus may include information that does not form the prior art that is already known to those of ordinary skill in the art.
Disclosure of Invention
The application aims to provide a playing method, a playing device, a medium and electronic equipment of interactive multimedia information, so as to optimize the problem that video playing is easy to be blocked in the process of playing interactive video in the related technology.
Other features and advantages of the application will be apparent from the following detailed description, or may be learned by the practice of the application.
According to an aspect of an embodiment of the present application, there is provided a method for playing interactive multimedia information, including:
generating description information of a target multimedia fragment in a target multimedia information code stream according to the interactive operation when the interactive operation for the current multimedia fragment in the current multimedia information code stream is detected; the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time;
searching target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment; the manifest file includes access information for each multimedia segment of the multimedia information;
and acquiring the target multimedia fragment according to the target access information for playing.
According to an aspect of an embodiment of the present application, there is provided a playing device for interactive multimedia information, including:
The interactive operation module is used for generating description information of a target multimedia fragment in the target multimedia information code stream according to the interactive operation when the interactive operation of the current multimedia fragment in the current multimedia information code stream is detected; the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time;
the target access information determining module is used for searching target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment; the manifest file includes access information for each multimedia segment of the multimedia information;
and the target multimedia fragment playing module is used for acquiring the target multimedia fragment according to the target access information to play.
In one embodiment of the present application, the description information of the target multimedia segment includes a segment identifier of the target multimedia segment and a multimedia information code stream identifier corresponding to the target multimedia segment; the manifest file includes a first manifest file and a second manifest file; the target access information determining module includes:
A first manifest file searching unit, configured to search a second manifest file corresponding to the target multimedia segment in a first manifest file corresponding to the multimedia information according to a multimedia information code stream identifier corresponding to the target multimedia segment; the first manifest file comprises a corresponding relation between a multimedia information code stream identifier and a second manifest file;
a second manifest file searching unit, configured to search, according to the segment identifier of the target multimedia segment, target access information of the target multimedia segment in the second manifest file; the second manifest file includes a correspondence between a fragment identification of the multimedia fragment and access information of the multimedia fragment.
In one embodiment of the application, the apparatus further comprises:
the multimedia information acquisition module is used for acquiring a plurality of multimedia information code streams corresponding to the multimedia information;
the multimedia information analysis module is used for analyzing each path of multimedia information code stream to obtain a plurality of multimedia fragments included in each path of multimedia information code stream; wherein, the number of the multimedia fragments included in each path of multimedia information code stream is the same;
a second manifest file generating module, configured to generate, for each multimedia information code stream, the second manifest file according to the segment identifiers of each multimedia segment of the multimedia information code stream and the access information of each multimedia segment;
And the first list file generating module is used for generating the first list file according to the second list files corresponding to the multimedia information code streams and the multimedia information code stream identifiers corresponding to the multimedia information code streams.
In one embodiment of the application, the apparatus further comprises:
the ordering module is used for ordering a plurality of multimedia fragments of each path of multimedia information code stream according to the playing time sequence;
the segment identification generation module is used for generating segment identifications of the multimedia segments according to the multimedia information code stream identifications of the multimedia information code streams and the sequencing numbers of the multimedia segments; wherein, in different multimedia information code streams, the playing time length of the multimedia fragments corresponding to the same sequencing number is the same.
In one embodiment of the present application, the multimedia information acquisition module is specifically configured to:
acquiring multi-channel coded data corresponding to the multimedia information from a server;
and analyzing the multi-path coding data of the multimedia information to obtain multi-path multimedia information code streams, wherein one path of coding data corresponds to one path of multimedia information code stream.
In one embodiment of the present application, the multimedia information acquisition module is specifically configured to:
Acquiring encoded data of the multimedia information from a server;
and analyzing the encoded data of the multimedia information to obtain a multi-path multimedia information code stream.
In one embodiment of the present application, the target multimedia clip playing module includes:
a target coded data acquisition unit, configured to acquire target coded data of the target multimedia segment according to the target access information;
the decoding unit is used for performing decoding operation on the target coded data to obtain target decoded data corresponding to the target multimedia fragment;
and the rendering unit is used for rendering and displaying the target decoding data so as to play the target multimedia fragment.
According to an aspect of the embodiments of the present application, there is provided a computer-readable medium having stored thereon a computer program which, when executed by a processor, implements a method of playing interactive multimedia information as in the above technical solution.
According to an aspect of an embodiment of the present application, there is provided an electronic apparatus including: a processor; and a memory for storing executable instructions of the processor; wherein execution of the executable instructions by the processor causes the electronic device to perform the method of playing interactive multimedia information as in the above technical solution.
According to an aspect of embodiments of the present application, there is provided a computer program product or computer program comprising computer instructions stored in a computer readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device performs the playing method of the interactive multimedia information as in the above technical solution.
In the technical scheme provided by the embodiment of the application, when the interactive operation of the current multimedia fragment in the current multimedia information code stream is detected, the description information of the target multimedia fragment in the target multimedia information code stream is generated according to the interactive operation; searching target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment; and finally, acquiring the target multimedia fragment according to the target access information for playing. Because the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time, the technical scheme of the application can realize the switching between different code streams in the process of playing the multimedia information. In the code stream switching process, because the manifest file comprises the access information of each multimedia segment of the multimedia information, compared with the prior art that the terminal equipment needs to initiate a request to the server again to acquire the access information of the target multimedia, the method and the device can quickly acquire the access information of the target multimedia segment according to the manifest file, so that the playing of the multimedia information is smoother and smoother in the code stream switching process, and the phenomenon of blocking in the interaction process is effectively avoided or reduced.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the application as claimed.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description, serve to explain the principles of the application. It is evident that the drawings in the following description are only some embodiments of the present application and that other drawings may be obtained from these drawings without inventive effort for a person of ordinary skill in the art.
Fig. 1 schematically shows a block diagram of an exemplary system architecture to which the technical solution of the present application is applied.
Fig. 2 schematically shows a schematic diagram of an application scenario of the technical solution of the present application.
Fig. 3 schematically illustrates a flowchart of a method for playing interactive multimedia information according to an embodiment of the present application.
Fig. 4 schematically illustrates a multi-path multimedia information code stream corresponding to multimedia information provided in an embodiment of the present application.
Fig. 5 schematically illustrates a multi-path multimedia information code stream corresponding to multimedia information provided in an embodiment of the present application.
Fig. 6 schematically illustrates a flowchart of a method for playing interactive multimedia information according to an embodiment of the present application.
Fig. 7 schematically shows a schematic diagram of a manifest file provided by an embodiment of the application.
Fig. 8 schematically illustrates a block diagram of an interactive multimedia information playing system according to an embodiment of the present application.
Fig. 9 schematically shows a block diagram of an interactive multimedia information playing device according to an embodiment of the present application.
Fig. 10 schematically shows a block diagram of a computer system of an electronic device for implementing an embodiment of the application.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. However, the exemplary embodiments may be embodied in many forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the example embodiments to those skilled in the art.
Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided to give a thorough understanding of embodiments of the application. One skilled in the relevant art will recognize, however, that the application may be practiced without one or more of the specific details, or with other methods, components, devices, steps, etc. In other instances, well-known methods, devices, implementations, or operations are not shown or described in detail to avoid obscuring aspects of the application.
The block diagrams depicted in the figures are merely functional entities and do not necessarily correspond to physically separate entities. That is, the functional entities may be implemented in software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor devices and/or microcontroller devices.
The flow diagrams depicted in the figures are exemplary only, and do not necessarily include all of the elements and operations/steps, nor must they be performed in the order described. For example, some operations/steps may be decomposed, and some operations/steps may be combined or partially combined, so that the order of actual execution may be changed according to actual situations.
Fig. 1 schematically shows a block diagram of an exemplary system architecture to which the technical solution of the present application is applied.
As shown in fig. 1, system architecture 100 may include a terminal device 110, a network 120, and a server 130. Terminal device 110 may include a cell phone, a computer, an intelligent voice interaction device, an intelligent home appliance, a vehicle terminal, an aircraft, and the like. The server 130 may be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, or a cloud server providing cloud computing services. Network 120 may be a communication medium of various connection types capable of providing a communication link between terminal device 110 and server 130, and may be, for example, a wired communication link or a wireless communication link.
The system architecture in embodiments of the present application may have any number of terminal devices, networks, and servers, as desired for implementation. For example, the server 130 may be a server group composed of a plurality of server devices. In addition, the technical solution provided in the embodiment of the present application may be applied to the terminal device 110, or may be applied to the server 130, or may be implemented by the terminal device 110 and the server 130 together, which is not limited in particular.
For example, the technical scheme provided by the embodiment of the present application is implemented by the terminal device 110. The terminal device 110 generates description information of a target multimedia segment in the target multimedia information code stream according to the interactive operation when detecting the interactive operation for the current multimedia segment in the current multimedia information code stream. Wherein the multimedia information includes multiple multimedia information code streams, the terminal device 110 may obtain a desired multimedia information code stream from the server 130 through the network 120. The current multimedia information code stream and the target multimedia information code stream are two different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time. Then, the terminal device 110 searches the target access information of the target multimedia segment in the manifest file corresponding to the multimedia information according to the description information of the target multimedia segment; the manifest file includes access information for each multimedia segment of the multimedia information. Finally, the terminal device 110 obtains the target multimedia fragment according to the target access information for playing.
The implementation process of the technical scheme of the application is illustrated by taking multimedia information as video as an example. As shown in fig. 2, the video provided by the embodiment of the present application is obtained by shooting the same shooting object P (the same shooting object may also refer to the same shooting scene, which will not be described in detail later), and different shooting angles are used to shoot the shooting object P in the shooting process, thereby generating a multi-path video code stream. As shown in fig. 2, a shooting angle is a shooting position, and the video in this embodiment includes 3 paths of video code streams, which are a video code stream s0 corresponding to a shooting position 0, a video code stream s1 corresponding to a shooting position 1, and a video code stream s2 corresponding to a shooting position 2.
Suppose that the video is played through a cell phone. When the video code stream s0 corresponding to the camera position 0 is currently played, the object generates an interactive operation on the currently played video through the mobile phone at the time T1, and the camera position angle of the video is required to be adjusted to be the camera position 1, then it can be determined according to the interactive operation, the current video segment is the video segment s0-1 in the video code stream s0 corresponding to the camera position 0, the target video segment is the video segment s1-2 in the video code stream s1 corresponding to the camera position 1, and further the description information of the target video segment s1-2 is generated, and is used for describing the target video segment s1-2 to determine which video segment is specifically the target video segment, for example, the description information is the segment identifier "s1-2" of the target video segment. Obviously, the current video segment s0-1 and the target video segment s1-2 are consecutive in time.
And then, the mobile phone searches the access information of the target multimedia fragment in the manifest file according to the description information of the target multimedia fragment. The mobile phone stores a manifest file corresponding to the video, and the manifest file can be provided by an Application program (APP) for playing the video. The manifest file includes access information of video clips in each video code stream of the video, where the access information of the video clips is used to obtain corresponding video clips, and for example, the access information of the video clips may be a storage address of the video clips.
Finally, the mobile phone acquires a target video clip to play according to the determined target access information, namely, the played video is switched from the machine position angle 0 to the machine position angle 1.
For example, as shown in fig. 2, assuming that the object wants to switch the machine position angle 1 to the machine position angle 2 at the time T2, after detecting the interaction operation, the mobile phone acquires the target video segment s2-5 to play through the above operation again, that is, the playing method of the interactive multimedia information provided by the embodiment of the present application.
The technical scheme of the application can be suitable for playing any video information, such as online video program playing, can be suitable for high-frequency interaction, can continuously respond to object operation and provide application scenes with seamless quality change, can be widely used for brand advertisement video, wide-angle or panoramic video, application scenes with continuously enlarged or reduced picture resolution, continuously increased or reduced video frame rate and the like, and brings about video interaction experience with more impact force. For example, when playing a certain commodity brand display video, the object may zoom in or out the video image frame by clicking "+", "-" keys on the keyboard, or the volume "+", "-" keys on the mobile phone or the tablet, so as to achieve the effect of viewing the commodity display video in more detail or globally.
The following describes the playing of the interactive multimedia information provided by the present application in detail with reference to the specific embodiments.
Fig. 3 schematically illustrates a flowchart of a method for playing interactive multimedia information according to an embodiment of the present application, as shown in fig. 3, the method includes steps 310 to 330, specifically as follows:
step 310, when detecting the interactive operation for the current multimedia segment in the current multimedia information code stream, generating the description information of the target multimedia segment in the target multimedia information code stream according to the interactive operation; the current multimedia information code stream and the target multimedia information code stream are two different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time.
In the embodiment of the application, the multimedia information comprises a plurality of multimedia information code streams, and one multimedia information code stream displays the multimedia information from a certain aspect, so that the multimedia information corresponding to different multimedia information code streams is the same.
In one embodiment of the present application, the multi-path multimedia information code streams respectively display multimedia information from different quality levels, and if the multimedia information is video, the quality levels of the multimedia information can be represented from multiple aspects such as image resolution, image frame rate, texture fidelity, sharpness, etc. By way of example, the multimedia information may include a fluent multimedia information stream, a high definition multimedia information stream, an ultra definition multimedia information stream, and so forth.
In one embodiment of the present application, the present application employs layered coding techniques to provide multiple multimedia information code streams of different quality levels for the same multimedia information. For example, taking video as an example, a scalable coding (Scalable Video Coding, SVC) technology is adopted as an extension coding algorithm tool under a video compression framework, and the technology can provide a plurality of output code streams with variable quality for an input image sequence, wherein the variable quality refers to different quality grades proposed in the foregoing, for example, for dimensions of image resolution, image frame rate, texture fidelity, definition and the like, the different quality grades can be a change of one dimension or a change of a plurality of dimension organizations. Fig. 4 schematically illustrates a schematic diagram of a multi-path multimedia information code stream corresponding to multimedia information according to an embodiment of the present application. As shown in fig. 4, reference layer 0 represents the multimedia information stream 410 of the lowest quality level, enhancement layer 1 represents the multimedia information stream 420 of increased frame rate after reference layer 0 is superimposed, and enhancement layer 2 represents the multimedia information stream 430 of increased frame rate and image resolution after reference layer 0 and enhancement layer 1 are superimposed.
In one embodiment of the present application, the multi-path multimedia information code streams respectively display the multimedia information from different machine angles, and the machine angles refer to angles of the image capturing device capturing the multimedia information. The multimedia information may include a first-machine-angle multimedia information code stream, a second-machine-angle multimedia information code stream, a third-machine-angle multimedia information code stream, and so on. Fig. 5 schematically illustrates a schematic diagram of a multi-path multimedia information code stream corresponding to multimedia information according to an embodiment of the present application. As shown in fig. 5, the same shooting object P is shot by 3 shooting machine positions, so that 3 paths of multimedia information code streams can be obtained: the multimedia information code stream s0 corresponding to the camera position 0, the multimedia information code stream s1 corresponding to the camera position 1 and the multimedia information code stream s2 corresponding to the camera position 2.
In the embodiment of the application, the current multimedia information code stream refers to the multimedia information code stream played at the current moment, and is one of multiple paths of multimedia information code streams included in the multimedia information, which may be the default multimedia information code stream corresponding to the multimedia information or the multimedia information code stream determined after the last interactive operation. The one-path multimedia information code stream comprises a plurality of multimedia fragments, and then the current multimedia fragment of the multimedia information is one multimedia fragment in the multimedia information code stream played at the current moment.
The interactive operation for the current multimedia segment refers to the interactive operation that the object performs through the terminal device playing the multimedia information, and switches the current multimedia information code stream to the target multimedia information code stream, for example, the object performs the interactive operation through the screen gesture, the UI (user interface) interface virtual key, the device gyroscope rotation, and the like of the terminal device. For example, as shown in fig. 2, the object may wish to switch the video viewing angle from camera position 0 to camera position 1 at time T1, and the object may perform interactive operation through a virtual key set on the video playing interface in the smart phone, or may perform interactive operation by rotating the smart phone.
When the interactive operation for the current multimedia segment is detected, the object is indicated to need to switch the multimedia information code stream, and at the moment, the description information of the target multimedia segment is generated according to the interactive operation, wherein the description information is used for determining which target multimedia segment needs to be switched. In the embodiment of the application, the interactive operation is an operation generated by switching the multimedia information code streams, so that the current multimedia fragment and the target multimedia fragment are fragments in different multimedia information code streams, and meanwhile, the current multimedia fragment and the target multimedia fragment are continuous in time, thereby ensuring continuous playing of the multimedia information.
In one embodiment of the present application, when the description information of the target multimedia segment is generated according to the interactive operation, the target multimedia code stream where the target multimedia segment is located, that is, the target multimedia code stream where the interactive operation indicates to switch, may be determined first according to the interactive operation. Then, a multimedia segment in the target multimedia information code stream, which is continuous with the current multimedia segment in playing time, is used as a target multimedia segment, for example, a reference multimedia segment which is the same as the current multimedia segment in playing time is found in the target multimedia information code stream, and then a multimedia segment adjacent to and located behind the reference multimedia segment in the target multimedia information code stream is used as a target multimedia segment.
Step 320, searching target access information of the target multimedia segment in a manifest file corresponding to the multimedia information according to the description information of the target multimedia segment; the manifest file includes access information for each multimedia segment of the multimedia information.
Specifically, the manifest file of the multimedia information records access information of the multimedia segments included in each multimedia information code stream of the multimedia information, where the access information of the multimedia segments is information required for obtaining the multimedia segments, and for example, the access information may be a storage address of the multimedia segments, ID information of the multimedia segments, and the like. For example, the corresponding relation between the segment identifier of the multimedia segment and the storage address of the multimedia segment is recorded in the manifest file, and then the storage address of the target multimedia segment can be obtained by searching for a match in the manifest file according to the segment identifier in the target multimedia description information, where the storage address is the target access information of the target multimedia segment.
And 330, acquiring a target multimedia fragment according to the target access information for playing.
Specifically, after the target access information is determined, the corresponding target multimedia fragment can be obtained according to the target access information, and then the target multimedia fragment is played, so that the process of switching from the current multimedia fragment of the current multimedia information code stream to the target multimedia fragment of the target multimedia information code stream is realized.
In the technical scheme provided by the embodiment of the application, when the interactive operation of the current multimedia fragment in the current multimedia information code stream is detected, the description information of the target multimedia fragment in the target multimedia information code stream is generated according to the interactive operation; searching target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment; and finally, acquiring the target multimedia fragment according to the target access information for playing. Because the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time, the technical scheme of the application can realize the switching between different code streams in the process of playing the multimedia information. In the code stream switching process, because the manifest file comprises the access information of each multimedia segment of the multimedia information, compared with the prior art that the terminal equipment needs to initiate a request to the server again to acquire the access information of the target multimedia, the method and the device can quickly acquire the access information of the target multimedia segment according to the manifest file, so that the playing of the multimedia information is smoother and smoother in the code stream switching process, and the phenomenon of blocking in the interaction process is effectively avoided or reduced.
Fig. 6 schematically shows a flowchart of a method for playing interactive multimedia information according to an embodiment of the present application, which is a further refinement of the above embodiment. As shown in fig. 6, the method includes steps 610 to 660, which are specifically as follows:
step 610, when detecting an interactive operation for a current multimedia segment in a current multimedia information code stream, generating description information of a target multimedia segment in a target multimedia information code stream according to the interactive operation; the description information of the target multimedia fragment comprises a fragment identifier of the target multimedia fragment and a multimedia information code stream identifier corresponding to the target multimedia fragment.
Specifically, the multimedia information code stream identifier corresponding to the target multimedia segment is an identifier of the target multimedia information code stream in which the target multimedia segment is located, and the multimedia information code stream identifier is generated according to the identifier of the switched target multimedia information code stream indicated by the interaction operation. For example, as shown in fig. 2, when the interworking is to switch the camera position 0 to the camera position 1, it may be determined that the multimedia information code stream corresponding to the target multimedia clip is identified as "s1".
The segment identification of the target multimedia segment is used to identify the location of the target multimedia segment in the target multimedia information code stream. For example, the multimedia segments included in the multimedia information code stream may be numbered according to the play time sequence, and then the numbers are used as segment identifiers of the multimedia segments. When the segment identification of the target multimedia segment is generated according to the interactive operation, firstly, determining the segment identification of the current multimedia segment, namely determining the number of the current multimedia segment, and then adding 1 to the number to obtain the segment identification of the target multimedia segment. Illustratively, the segment identifier of the current multimedia segment is 1, and the segment identifier of the target multimedia segment is 2.
In the first embodiment of the present application, the number of the multimedia segment in the multimedia information code stream and the identification of the multimedia information code stream may be used together as the segment identification of the multimedia segment, for example, the segment identification of the multimedia segment is in the form of "multimedia information code stream identification-number", and the segment identification of the multimedia segment is s0-0, which indicates the multimedia segment with the number 0 in the multimedia information code stream s 0. When the segment identification of the target multimedia is generated, firstly determining the code stream identification of the target multimedia information, then adding 1 to the number of the number represented in the segment identification of the current multimedia segment to obtain the target number of the target multimedia segment, and further generating the segment identification of the target multimedia segment according to the code stream identification of the target multimedia information and the target number. Illustratively, the segment identifier of the current multimedia segment is s0-1, the target multimedia information code stream identifier is s1, and the segment identifier of the target multimedia segment is s1-2.
Step 620, searching a second manifest file corresponding to the target multimedia segment in the first manifest file corresponding to the multimedia information according to the multimedia information code stream identifier corresponding to the target multimedia segment; the first manifest file includes a correspondence between multimedia information stream identifications and the second manifest file.
Step 630, searching target access information of the target multimedia fragment in the second manifest file according to the fragment identification of the target multimedia fragment; the second manifest file includes a correspondence between a fragment identification of the multimedia fragment and access information of the multimedia fragment.
Specifically, the first manifest file defines a correspondence between the multimedia information code stream identifier and the second manifest file, and the second manifest file defines a correspondence between the fragment identifier of the multimedia fragment and the access information of the multimedia fragment. When searching the list file according to the description information of the target multimedia fragment, firstly searching the first list file according to the multimedia information code stream identification in the description information, and determining a second list file corresponding to the target multimedia fragment, namely determining a second list file for recording the relation between the fragment identification and the access information of each multimedia fragment in the target multimedia information code stream. And then searching in the second manifest file according to the segment identification of the target multimedia segment to obtain the target access information of the target multimedia segment.
By way of example, fig. 7 schematically illustrates a schematic diagram of a manifest file provided by an embodiment of the present application. As shown in fig. 7, the manifest files provided in this embodiment include a parent manifest file 710 and a child manifest file 720, where the parent manifest file 710 is a first manifest file, and the child manifest file 720 is a second manifest file. The master manifest file 710 is mainly used for describing how many sub-manifest files 720 of the next level are included, and the quality level (i.e., the quality level in fig. 7) corresponding to each sub-manifest file 720, where the master manifest file 710 describes each included sub-manifest file 720 by recording the address of each sub-manifest file 720. As shown in fig. 7, "quality level 0", "quality level 1", and "quality level 2" are used to indicate multimedia information stream identifications, "address of sub-manifest file 0", "address of sub-manifest file 1", and "address of sub-manifest file 2" are used to indicate storage addresses of sub-manifest files 720 corresponding to the corresponding multimedia information stream identifications.
The sub-manifest file 720 shown in fig. 7 is the detailed content of the sub-manifest file 1 corresponding to the quality file 1. The submanifest file 720 is used for specifically describing specific file data piece information contained in the multimedia information code stream under the corresponding quality level, and the file data piece corresponds to the multimedia fragment. The file data piece information at least contains description information directly used for a player (a component or a functional component for playing multimedia information), for example, the sub-manifest file 720 includes: the CDN (Content Delivery Network ) server URL (Uniform Resource Locator, uniform resource locator) of the file data piece, that is, the storage address of the multimedia clip, is used by the player to obtain the data of the multimedia clip to the corresponding address. By way of example, "file data slice 0", "file data slice 1", "file data slice 2", etc. in fig. 7 represent the slice identifications of the multimedia slices. The "duration" in fig. 7 indicates a play duration of a corresponding multimedia clip, and play durations of different multimedia clips may be different. The "address" in fig. 7 represents a storage address of the corresponding multimedia clip.
For example, taking the manifest file shown in fig. 7 as an example, assume that the segment identifier of the target multimedia segment is s1-2, and the multimedia information code stream corresponding to the target multimedia segment is s1. First, searching in the master list file 710 according to the multimedia information code stream identifier s1, and finding the sub list file 1 address corresponding to the quality file 1. And then searching in the sub-manifest file 720 according to the segment identifier s1-2 to obtain a storage address corresponding to the file data segment 2, namely the target access information of the target multimedia segment s 1-2.
In one embodiment of the present application, the manifest files may describe different levels of manifest files in a nested manner according to actual application needs, i.e., the manifest files may include not only the first manifest file and the second manifest file, but also a third manifest file, a fourth manifest file, etc. of a further level. The embodiment of the application does not limit the hierarchy and the number of the list files.
In one embodiment of the present application, before searching the manifest file according to the description information, the method further includes a manifest file generating step, which specifically includes: acquiring a plurality of paths of multimedia information code streams corresponding to the multimedia information; analyzing each path of multimedia information code stream to obtain a plurality of multimedia fragments contained in each path of multimedia information code stream; for each path of multimedia information code stream, generating a second manifest file according to the segment identification of each multimedia segment of the multimedia information code stream and the access information of each multimedia segment; and generating a first list file according to the second list file corresponding to each path of multimedia information code stream and the multimedia information code stream identifier corresponding to each path of multimedia information code stream.
In the embodiment of the present application, the step of generating the manifest file may be performed by the terminal device, or the manifest file may be sent to the terminal device after the manifest file is generated by the server. The generation process of the manifest file in the present application will be described below with the server as the execution subject.
Firstly, multi-path multimedia information code streams corresponding to multimedia information are obtained, and a multimedia information code stream identifier is given to each path of multimedia information code stream. And then analyzing each path of multimedia information code stream, wherein for each path of multimedia information code stream, a plurality of multimedia fragments can be obtained through analysis, and the number of the multimedia fragments corresponding to each path of multimedia information code stream is the same. And then, a segment identifier is given to each multimedia segment in each path of multimedia information code stream, and a second list file is generated according to the segment identifier and the storage address of each multimedia segment in each path of multimedia information code stream. And finally, generating a first list file according to the second list file of each multimedia information code stream and the multimedia information code stream identification.
In one embodiment of the present application, when the execution body of the step of generating the manifest file is a terminal device, a process of parsing each multimedia information code stream to generate the second manifest file and the first manifest file is the same as when the server is the execution body, which is not repeated herein. The difference is that when the execution subject is a terminal device, the terminal device needs to acquire a multi-channel multimedia information code stream of multimedia series information from a server.
In one embodiment of the application, the multi-path multimedia information code stream of the multimedia information is uniformly encoded into one path of encoded data, and the terminal equipment obtains the encoded data from the server and then analyzes the encoded data to obtain the multi-path multimedia information code stream.
In one embodiment of the application, multiple multimedia information code streams of the multimedia information are respectively and independently encoded and transmitted, one path of multimedia information code stream is encoded to obtain one path of encoded data, and the terminal equipment obtains the multiple paths of encoded data from the server and then respectively analyzes the multiple paths of encoded data to obtain multiple paths of multimedia information code streams. When the multi-path multimedia information code streams are independently encoded and transmitted, each path of multimedia information code stream is encoded by using a standard encoding tool, and an SVC algorithm tool is not required.
In one embodiment of the present application, the process of assigning segment identifiers to multimedia segments includes: sequencing a plurality of multimedia fragments of each path of multimedia information code stream according to the playing time sequence; generating segment identifiers of the multimedia segments according to the sequence numbers of the multimedia information code streams and the multimedia segments; wherein, in different multimedia information code streams, the playing time length of the multimedia fragments corresponding to the same sequencing number is the same.
Specifically, for a plurality of multimedia fragments corresponding to each path of multimedia information code stream, firstly, the plurality of multimedia fragments are ordered according to the playing time sequence, the multimedia fragments played first are arranged in front, and the multimedia fragments played later are arranged in back. Because the number of the multimedia fragments corresponding to each path of multimedia information code stream is equal, for the same sequence number, each path of multimedia information code stream has the corresponding multimedia fragment. In order to ensure smooth transition of the multimedia information played during the code stream switching, the playing time lengths of the multimedia fragments corresponding to the same sequence number are the same in each path of multimedia information code stream, for example, the playing time lengths of the multimedia fragments with sequence number 0 are all 2s. However, in the multiple multimedia segments corresponding to the same multimedia information code stream, the playing time lengths of the multimedia segments corresponding to different ordering codes may be the same or different, for example, in the multimedia information code stream s0, the playing time length of the multimedia segment with the ordering number 0 is 2s, and the playing time length of the multimedia segment with the ordering number 1 is 2.5s.
For example, as shown in fig. 2, the video code stream s0 corresponding to the camera position 0 includes video clips s0-0 to s0-7, the video code stream s1 corresponding to the camera position 1 includes video clips s1-0 to s1-7, and the video code stream s2 corresponding to the camera position 2 includes video clips s2-0 to s2-7. It should be noted that, the segment identifier of the video segment shown in fig. 2 is a "video code stream identifier-number. Coding format" style, that is, "fmp4" is a coding format of the video segment, and in this embodiment, for convenience of description, the segment identifier of the video segment is abbreviated as a "video code stream identifier-number" style, so that the ". Fmp4" field in the segment identifier is omitted, and the description is omitted.
The same numbered multimedia segments in different multimedia information code streams have the same playing time length, for example, the video segments s0-0 in the video code stream s0, the video segments s1-0 in the video code stream s1, and the video segments s2-0 in the video code stream s2 all have the same playing time length, for example, all are 2s. The multimedia clips with different numbers in the same multimedia information code stream may have different playing durations, for example, the video clip s0-0, the video clip s0-1, the video clip s0-2, etc. in the video code stream s0 may correspond to different playing durations.
After the multimedia fragments are sequenced, generating fragment identifiers of the multimedia fragments according to the multimedia information code stream identifiers corresponding to the multimedia fragments and sequencing numbers of the multimedia fragments. For example, the sequence number is 0, the multimedia information code stream is identified as s0, and the segment of the multimedia segment is identified as s0-0.
Step 640, obtaining target coding data of the target multimedia segment according to the target access information.
Specifically, the target access information is a storage address of the target multimedia segment, and based on the target access information, target encoding data of the target multimedia segment can be obtained.
In one embodiment of the application, the terminal device can download the encoded data of the multi-path multimedia information code stream from the server in advance, so that when the target encoded data is acquired, the target access information points to the local storage address of the terminal device, thereby accelerating the speed of acquiring the target encoded data by the terminal device and further improving the smoothness of the code stream switching.
In one embodiment of the present application, the present application does not limit the encoding format of the multimedia information, and may be MP4, fMP4, MKV, etc.
In one embodiment of the present application, the terminal device may first request to the server to obtain the target multimedia code stream according to the multimedia information code stream identifier corresponding to the target multimedia segment, and then extract the target encoded data of the target multimedia segment corresponding to the target access information from the target multimedia code stream. That is, the terminal device does not receive the multi-channel multimedia information code stream transmitted by the server at the same time, but requests the corresponding multimedia information code stream from the server according to the current need, so as to avoid the waste of storage resources caused by receiving the multimedia information code stream which is not needed at the current time.
And 650, performing decoding operation on the target coded data to obtain target decoded data corresponding to the target multimedia segment.
Specifically, decoding is the inverse process of encoding, and is used to restore the target encoded data to the state before encoding, and decode the audio/video compressed code stream into the original image and sound data, so as to obtain the target decoded data. The specific decoding method is determined according to the corresponding encoding rule, and the application is not limited thereto, for example, the image in the target decoding Data may be original Data (Raw Data) in the form of YUV, RGB, etc., and the sound may be original Data in the form of PCM.
Step 660, rendering and displaying the target decoding data to play the target multimedia segment.
Specifically, the rendering is to convert the target decoding data into an image actually drawn on a display screen for display, so as to realize the playing of the target multimedia segment.
The embodiment of the application realizes the switching play of different multimedia information code streams, provides the interactive multimedia information with variable quality, queries the target access information of the target multimedia fragments through the first list file and the second list file, effectively improves the speed of acquiring the target multimedia fragments, ensures smooth and seamless play among the multimedia fragments in the interaction process, and improves the play experience of the multimedia information.
The implementation process of the technical scheme of the present application is illustrated below by taking multimedia information as video as an example. As shown in fig. 5, the video in the embodiment of the present application is a video obtained by photographing the same photographic object (or a photographic scene) by 3 camera positions, for example, the wide-angle view corresponding to camera position 0 is-45 ° to 45 °, the wide-angle view corresponding to camera position 1 is-90 ° to 0 °, and the wide-angle view corresponding to camera position 2 is 0 ° to 90 °. Thus, the video includes 3 video streams: video stream s0 corresponding to camera bit 0, video stream s1 corresponding to camera bit 1, and video stream s2 corresponding to camera bit 2. The present embodiment records the information of the 3-channel video code stream by using the manifest file shown in fig. 7, where the quality file in fig. 7 corresponds to the camera position.
Based on the video information, as shown in fig. 2, the video playing flow of the present embodiment assumes that the video starts playing from time T0, and defaults to taking camera position 0 as the initial viewing angle. And in the time period of T0-T1, playing the video content corresponding to the camera position 0.
At time T1, the object performs interactive operation on the currently played video clip s0-1 through the terminal equipment, adjusts the viewing position angle, and performs interactive operation through screen gestures, UI virtual keys, equipment gyroscope rotation and the like. Since the video code stream of each camera bit is recorded and described in the main manifest file 710 in the corresponding coverage view angle range, at this time, the angle range desired to be adjusted is found to be the camera bit 1 according to the main manifest file 710, and the storage address of the sub-manifest file 1 is determined. Because the video code streams corresponding to all the shooting machine positions are strictly aligned on the time axis, after the time T1, the player can ensure that the played audio and video content is completely seamless and continuous only by sequentially playing the video slice files s1-2 with the next serial numbers (one slice file corresponds to one video clip). In the video code stream s1 corresponding to the camera bit 1, the video slice file playing address of the serial number 2 is recorded in the sub-manifest file 1 to which the video slice file playing address belongs, and the player can directly acquire and switch.
And playing the video content corresponding to the camera position 1 in the time period of T1-T2. At the time T2, the object carries out interactive operation on the currently played video clip s1-4 through the terminal equipment, and adjusts the viewing machine position angle. Assuming that the angle desired to be adjusted at this time is camera position 2, then, according to the same method as in the previous step, the next video slice file s2-5 is found in the sub-manifest file 2 corresponding to camera position 2, and seamless smooth playing is performed.
And playing the video content corresponding to the camera position 2 in the time period of T2-T3. At the time T3, the object performs interactive operation on the currently played video clip s2-5 again through the terminal device, adjusts the viewing angle, and the adjustment and playing process is similar to the foregoing steps, so that no further description is provided.
In one embodiment of the present application, the 3-way video code stream in the video playing flow shown in fig. 2 may also be implemented by using layered coding, for example, by using SVC technology to perform coding, so as to obtain 3-way video code streams with different quality levels. Taking a played video as an example of a commodity brand display video, in the process of watching the video, an object can zoom in or zoom out a video image picture by clicking "+", "-" keys on a keyboard or volume "+", "-" keys on a mobile phone and a tablet, so as to achieve the effect of watching the commodity display video in more detail or in the whole world. For example, in the playing flow shown in fig. 2, the commodity video corresponding to the video code stream S0 is played by default from the time T0; at the time T1, the object switches the video to the commodity video corresponding to the video code stream S1 through the terminal equipment volume "+" key, and the video code stream S1 is the video for amplifying and observing the commodity; at the time T2, the object switches the video to the commodity video corresponding to the video code stream S2 through the terminal equipment volume "-" key, and the video code stream S2 is the video of the commodity viewed globally. The specific details of the switching process are similar to those of the shooting machine, so that the details are not repeated.
In the interactive video playing process of the related art, video information and interactive description information are often stored separately, and in particular, video information files prepared for responding to different object interactive selections generally exist independently. Therefore, the player in the video playing device needs to acquire different video information files and perform splicing playing to realize a complete one-time playing process. The player composes the complete playing content corresponding to the object selection by splicing the video information, and in the process, the steps of acquiring the video information data, starting playing, stopping playing and acquiring new video information again need to be repeated, and the process is circulated until all the video information is played.
The player needs time to start, stop and acquire the video information, and the interaction of the corresponding video information can cause the clamping of the player. The reason for this is that, in general, the video image frame rate is about 24 to 25fps, that is, the longest waiting interval between two adjacent frames of images is about 40 milliseconds, and if the video information frame rate is higher, the interval is shorter. The time consumed by the player to switch the video information once, namely, the time consumed by the player to stop, acquire new materials and start playing again is often significantly higher than 40 milliseconds, so that the phenomenon of jamming occurs in the video playing process. Particularly, when the video information switching scheme is directed to an application scene with high-frequency interactive operation and in which the player needs to continuously respond to the interaction and continuously adjust the playing content, seamless playing experience cannot be provided due to the problem of the player jamming, and continuous jamming of the picture or the sound of the player can be shown in the playing process, so that the object experience in the continuous response interaction is reduced.
According to the playing method of the interactive multimedia information provided by the embodiment of the application, the access information of the video clips is described through the manifest file, and in the process of switching the video clips based on object interaction, the player directly and sequentially plays the next video clip indicated by the access address according to the manifest file, so that the splicing of the video information is not needed, and the player is not needed to be repeatedly started and stopped, thereby improving the switching efficiency of the video clips and realizing smooth and seamless video switching. In addition, the video information in the embodiment of the application adopts a mode of layered coding and multi-channel code stream transmission, so that the playing of the interactive video with variable quality is realized, for example, the interactive video can realize continuous change in the aspects of image resolution, video frame rate, field of View (FOV) and the like, and particularly, under a high-frequency interaction scene, the video with different quality can be rapidly switched, and high-quality playing experience is provided.
It should be noted that although the steps of the methods of the present application are depicted in the accompanying drawings in a particular order, this does not require or imply that the steps must be performed in that particular order, or that all illustrated steps be performed, to achieve desirable results. Additionally or alternatively, certain steps may be omitted, multiple steps combined into one step to perform, and/or one step decomposed into multiple steps to perform, etc.
The following describes embodiments of an apparatus or system according to the present application, which may be used to perform the method for playing interactive multimedia information in the foregoing embodiments of the present application.
Fig. 8 schematically illustrates a block diagram of an interactive multimedia information playing system according to an embodiment of the present application. As shown in fig. 8, the system 810 includes: a video material acquisition module 811, a video format analysis module 812, a video interaction point acquisition module 813, a user interaction instruction processing module 814, a play data piece selection module 815, a play data decoding module 816, and an image sound rendering module 817.
The video material acquisition module 811 is connected to the video server 820, and is configured to acquire video material file data, i.e., acquire a video code stream, from the video server 820. The video material is generally acquired and transmitted by using an application layer streaming media protocol based on HTTP (HyperText Transfer Protocol ), such as: apple HLS (HTTP Live Streaming, HTTP-based streaming network transport protocol) or MPEG DASH (Dynamic Adaptive Streaming over HTTP, HTTP-based adaptive streaming). In the process of acquiring the video material file data, the downloading and transmission of the video material file is not required to be completed, but a part of data is downloaded and transmitted, the subsequent video format analysis module 812, the play data sheet selection module 815, the play data decoding module 816 and the image sound rendering module 817 can start to work, and then the downloading and transmission of the video file and the video file playing of the downstream module are synchronously carried out in parallel.
The video format parsing module 812 parses and records the video material acquired by the video material acquisition module 811 according to the format of the manifest file provided by the present application, and generates a corresponding manifest file. Please refer to the description of the foregoing method embodiment, and the description is omitted herein.
The video interaction point obtaining module 813 is configured to obtain information describing the interaction mode and content of the object and the video, i.e. obtain interaction description information. The video interaction point obtaining module 813 may be connected to the video server 820, that is, the video interaction point obtaining module 813 may obtain interaction description information from the video server 820, and it should be noted that the video server storing the interaction description information and the video server storing the video material may be the same server or different servers. The interaction description information generally has at least a time point when the interaction occurs, and is used for binding the Ji Jiaohu interface and the video picture content thereof; and elements available for object interaction, such as: all video content branches selectable at this point in time need to be described in the interactive play. In particular, according to the playing method of the interactive multimedia information provided by the embodiment of the application, the time for which the object is allowed to interact can be equivalent to the video duration at the longest. That is, the object can continuously interoperate to change image quality, viewing angle, etc. throughout viewing of the video.
The user interaction instruction processing module 814 is configured to provide an actual interaction operation interface according to the interaction description information acquired by the video interaction point acquiring module 813, where the presentation form may be a list selection box on the UI interface of the program, buttons arranged together, a picture that can be skipped after clicking, and so on. The user interaction instruction processing module 814 will ultimately convert the interaction operation of the object into a specific description of the operation of the video material, for example, the object rotates the terminal device in some manner, the user interaction instruction processing module 814 detects the interaction operation and outputs an instruction for performing a specific operation on the video material corresponding to the interaction operation, if the rotation of the terminal device corresponds to adjusting the viewing angle of the video, the URL (or access ID) of the video material under the adjusted target viewing angle is output. For another example, the output may be based on other interactions: the timeline progress of the video file currently being played, the operation of the player (pause, switch definition, stop, etc.), etc.
The play data slice selection module 815 is configured to find a suitable quality video material (i.e. find a corresponding video code stream) according to an instruction corresponding to the interactive operation and the manifest file, and address a specific video data slice according to the corresponding sub-manifest file. Specific processes may refer to the related descriptions in the foregoing method embodiments, and are not repeated herein.
The play Data decoding module 816 is configured to decode the audio/video compressed code stream into original image and sound Data, where the image may be YUV, RGB, etc. form of original Data (Raw Data), and the sound may be PCM form of original Data.
The image and sound rendering module 817 is used for displaying the original image and sound data output in S6 through the peripheral device.
Fig. 9 schematically shows a block diagram of an interactive multimedia information playing device according to an embodiment of the present application. As shown in fig. 9, the apparatus includes:
an interactive operation module 910, configured to, when detecting an interactive operation for a current multimedia segment in a current multimedia information code stream, generate description information of a target multimedia segment in a target multimedia information code stream according to the interactive operation; the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time;
the target access information determining module 920 is configured to search, according to the description information of the target multimedia segment, target access information of the target multimedia segment in a manifest file corresponding to the multimedia information; the manifest file includes access information for each multimedia segment of the multimedia information;
And the target multimedia segment playing module 930 is configured to obtain the target multimedia segment according to the target access information for playing.
In one embodiment of the present application, the description information of the target multimedia segment includes a segment identifier of the target multimedia segment and a multimedia information code stream identifier corresponding to the target multimedia segment; the manifest file includes a first manifest file and a second manifest file; the target access information determination module 920 includes:
a first manifest file searching unit, configured to search a second manifest file corresponding to the target multimedia segment in a first manifest file corresponding to the multimedia information according to a multimedia information code stream identifier corresponding to the target multimedia segment; the first manifest file comprises a corresponding relation between a multimedia information code stream identifier and a second manifest file;
a second manifest file searching unit, configured to search, according to the segment identifier of the target multimedia segment, target access information of the target multimedia segment in the second manifest file; the second manifest file includes a correspondence between a fragment identification of the multimedia fragment and access information of the multimedia fragment.
In one embodiment of the application, the apparatus further comprises:
the multimedia information acquisition module is used for acquiring a plurality of multimedia information code streams corresponding to the multimedia information;
the multimedia information analysis module is used for analyzing each path of multimedia information code stream to obtain a plurality of multimedia fragments included in each path of multimedia information code stream; wherein, the number of the multimedia fragments included in each path of multimedia information code stream is the same;
a second manifest file generating module, configured to generate, for each multimedia information code stream, the second manifest file according to the segment identifiers of each multimedia segment of the multimedia information code stream and the access information of each multimedia segment;
and the first list file generating module is used for generating the first list file according to the second list files corresponding to the multimedia information code streams and the multimedia information code stream identifiers corresponding to the multimedia information code streams.
In one embodiment of the application, the apparatus further comprises:
the ordering module is used for ordering a plurality of multimedia fragments of each path of multimedia information code stream according to the playing time sequence;
the segment identification generation module is used for generating segment identifications of the multimedia segments according to the multimedia information code stream identifications of the multimedia information code streams and the sequencing numbers of the multimedia segments; wherein, in different multimedia information code streams, the playing time length of the multimedia fragments corresponding to the same sequencing number is the same.
In one embodiment of the present application, the multimedia information acquisition module is specifically configured to:
acquiring multi-channel coded data corresponding to the multimedia information from a server;
and analyzing the multi-path coding data of the multimedia information to obtain multi-path multimedia information code streams, wherein one path of coding data corresponds to one path of multimedia information code stream.
In one embodiment of the present application, the multimedia information acquisition module is specifically configured to:
acquiring encoded data of the multimedia information from a server;
and analyzing the encoded data of the multimedia information to obtain a multi-path multimedia information code stream.
In one embodiment of the present application, the target multimedia clip playing module 930 includes:
a target coded data acquisition unit, configured to acquire target coded data of the target multimedia segment according to the target access information;
the decoding unit is used for performing decoding operation on the target coded data to obtain target decoded data corresponding to the target multimedia fragment;
and the rendering unit is used for rendering and displaying the target decoding data so as to play the target multimedia fragment.
Specific details of the interactive multimedia information playing device provided in each embodiment of the present application have been described in the corresponding method embodiments, and are not described herein.
Fig. 10 schematically shows a block diagram of a computer system of an electronic device for implementing an embodiment of the application.
It should be noted that, the computer system 1000 of the electronic device shown in fig. 10 is only an example, and should not impose any limitation on the functions and the application scope of the embodiments of the present application.
As shown in fig. 10, the computer system 1000 includes a central processing unit 1001 (Central Processing Unit, CPU) which can execute various appropriate actions and processes according to a program stored in a Read-Only Memory 1002 (ROM) or a program loaded from a storage section 1008 into a random access Memory 1003 (Random Access Memory, RAM). In the random access memory 1003, various programs and data necessary for the system operation are also stored. The cpu 1001, the rom 1002, and the ram 1003 are connected to each other via a bus 1004. An Input/Output interface 1005 (i.e., an I/O interface) is also connected to bus 1004.
The following components are connected to the input/output interface 1005: an input section 1006 including a keyboard, a mouse, and the like; an output portion 1007 including a Cathode Ray Tube (CRT), a liquid crystal display (Liquid Crystal Display, LCD), and a speaker; a storage portion 1008 including a hard disk or the like; and a communication section 1009 including a network interface card such as a local area network card, a modem, or the like. The communication section 1009 performs communication processing via a network such as the internet. The drive 1010 is also connected to the input/output interface 1005 as needed. A removable medium 1011, such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like, is installed as needed in the drive 1010, so that a computer program read out therefrom is installed as needed in the storage section 1008.
In particular, the processes described in the various method flowcharts may be implemented as computer software programs according to embodiments of the application. For example, embodiments of the present application include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method shown in the flowcharts. In such an embodiment, the computer program may be downloaded and installed from a network via the communication portion 1009, and/or installed from the removable medium 1011. The computer programs, when executed by the central processor 1001, perform the various functions defined in the system of the present application.
It should be noted that, the computer readable medium shown in the embodiments of the present application may be a computer readable signal medium or a computer readable storage medium, or any combination of the two. The computer readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples of the computer-readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-Only Memory (ROM), an erasable programmable read-Only Memory (Erasable Programmable Read Only Memory, EPROM), flash Memory, an optical fiber, a portable compact disc read-Only Memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In the present application, however, the computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, with the computer-readable program code embodied therein. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination of the foregoing. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wired, etc., or any suitable combination of the foregoing.
The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
It should be noted that although in the above detailed description several modules or units of a device for action execution are mentioned, such a division is not mandatory. Indeed, the features and functions of two or more modules or units described above may be embodied in one module or unit in accordance with embodiments of the application. Conversely, the features and functions of one module or unit described above may be further divided into a plurality of modules or units to be embodied.
From the above description of embodiments, those skilled in the art will readily appreciate that the example embodiments described herein may be implemented in software, or may be implemented in software in combination with the necessary hardware. Thus, the technical solution according to the embodiments of the present application may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (may be a CD-ROM, a U-disk, a mobile hard disk, etc.) or on a network, and includes several instructions to cause a computing device (may be a personal computer, a server, a touch terminal, or a network device, etc.) to perform the method according to the embodiments of the present application.
Other embodiments of the application will be apparent to those skilled in the art from consideration of the specification and practice of the application disclosed herein. This application is intended to cover any variations, uses, or adaptations of the application following, in general, the principles of the application and including such departures from the present disclosure as come within known or customary practice within the art to which the application pertains.
It is to be understood that the application is not limited to the precise arrangements and instrumentalities shown in the drawings, which have been described above, and that various modifications and changes may be effected without departing from the scope thereof. The scope of the application is limited only by the appended claims.

Claims (10)

1. A method for playing interactive multimedia information, comprising:
generating description information of a target multimedia fragment in a target multimedia information code stream according to the interactive operation when the interactive operation for the current multimedia fragment in the current multimedia information code stream is detected; the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time;
searching target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment; the manifest file includes access information for each multimedia segment of the multimedia information;
and acquiring the target multimedia fragment according to the target access information for playing.
2. The method for playing back interactive multimedia information according to claim 1, wherein the description information of the target multimedia segment includes a segment identifier of the target multimedia segment and a multimedia information code stream identifier corresponding to the target multimedia segment; the manifest file includes a first manifest file and a second manifest file; searching the target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment, wherein the method comprises the following steps:
Searching a second list file corresponding to the target multimedia fragment in a first list file corresponding to the multimedia information according to the multimedia information code stream identifier corresponding to the target multimedia fragment; the first manifest file comprises a corresponding relation between a multimedia information code stream identifier and a second manifest file;
searching target access information of the target multimedia fragment in the second manifest file according to the fragment identification of the target multimedia fragment; the second manifest file includes a correspondence between a fragment identification of the multimedia fragment and access information of the multimedia fragment.
3. The method for playing back interactive multimedia information according to claim 2, wherein before searching for the target access information of the target multimedia segment in the manifest file corresponding to the multimedia information according to the description information of the target multimedia segment, the method further comprises:
acquiring a plurality of paths of multimedia information code streams corresponding to the multimedia information;
analyzing each path of multimedia information code stream to obtain a plurality of multimedia fragments contained in each path of multimedia information code stream; wherein, the number of the multimedia fragments included in each path of multimedia information code stream is the same;
For each path of multimedia information code stream, generating the second list file according to the segment identification of each multimedia segment of the multimedia information code stream and the access information of each multimedia segment;
and generating the first list file according to the second list file corresponding to each path of multimedia information code stream and the multimedia information code stream identifier corresponding to each path of multimedia information code stream.
4. A method of playing back interactive multimedia information according to claim 3, characterized in that before generating the second manifest file from the segment identifications of the respective multimedia segments of the multimedia information code stream and the access information of the respective multimedia segments, the method further comprises:
sequencing a plurality of multimedia fragments of each path of multimedia information code stream according to the playing time sequence;
generating a segment identifier of the multimedia segment according to the multimedia information code stream identifier of the multimedia information code stream and the sequencing number of the multimedia segment; wherein, in different multimedia information code streams, the playing time length of the multimedia fragments corresponding to the same sequencing number is the same.
5. The method for playing back interactive multimedia information according to claim 3, wherein obtaining a multi-channel multimedia information code stream corresponding to the multimedia information comprises:
Acquiring multi-channel coded data corresponding to the multimedia information from a server;
and analyzing the multi-path coding data of the multimedia information to obtain multi-path multimedia information code streams, wherein one path of coding data corresponds to one path of multimedia information code stream.
6. The method for playing back interactive multimedia information according to claim 3, wherein obtaining a multi-channel multimedia information code stream corresponding to the multimedia information comprises:
acquiring encoded data of the multimedia information from a server;
and analyzing the encoded data of the multimedia information to obtain a multi-path multimedia information code stream.
7. The method for playing back interactive multimedia information according to any one of claims 1 to 6, wherein obtaining the target multimedia clip according to the target access information for playing back includes:
acquiring target coding data of the target multimedia fragment according to the target access information;
performing decoding operation on the target coded data to obtain target decoded data corresponding to the target multimedia fragment;
rendering and displaying the target decoding data so as to play the target multimedia fragment.
8. An interactive multimedia information playing device, comprising:
The interactive operation module is used for generating description information of a target multimedia fragment in the target multimedia information code stream according to the interactive operation when the interactive operation of the current multimedia fragment in the current multimedia information code stream is detected; the current multimedia information code stream and the target multimedia information code stream are two paths of different multimedia information code streams corresponding to the same multimedia information, and the target multimedia fragment and the current multimedia fragment are continuous in time;
the target access information determining module is used for searching target access information of the target multimedia fragment in a list file corresponding to the multimedia information according to the description information of the target multimedia fragment; the manifest file includes access information for each multimedia segment of the multimedia information;
and the target multimedia fragment playing module is used for acquiring the target multimedia fragment according to the target access information to play.
9. A computer readable medium on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the method of playing back interactive multimedia information according to any one of claims 1 to 7.
10. An electronic device, comprising:
a processor; and
a memory for storing executable instructions of the processor;
wherein execution of the executable instructions by the processor causes the electronic device to perform the method of playing interactive multimedia information according to any one of claims 1 to 7.
CN202211157488.XA 2022-09-22 2022-09-22 Interactive multimedia information playing method and device, medium and electronic equipment Pending CN116996488A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211157488.XA CN116996488A (en) 2022-09-22 2022-09-22 Interactive multimedia information playing method and device, medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211157488.XA CN116996488A (en) 2022-09-22 2022-09-22 Interactive multimedia information playing method and device, medium and electronic equipment

Publications (1)

Publication Number Publication Date
CN116996488A true CN116996488A (en) 2023-11-03

Family

ID=88525367

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211157488.XA Pending CN116996488A (en) 2022-09-22 2022-09-22 Interactive multimedia information playing method and device, medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN116996488A (en)

Similar Documents

Publication Publication Date Title
US12028564B2 (en) Methods, systems, processors and computer code for providing video clips
US20210099690A1 (en) Video production sharing apparatus and method
KR102027410B1 (en) Transmission of reconstruction data in a tiered signal quality hierarchy
US10225613B2 (en) Method and apparatus for video playing processing and television
EP2830322A1 (en) Method, terminal and server for implementing trickplay
IL220482A (en) Audio splitting with codec-enforced frame sizes
CN109587514B (en) Video playing method, medium and related device
EP4131983A1 (en) Method and apparatus for processing three-dimensional video, readable storage medium, and electronic device
CN106789976A (en) The player method of media file, service end, client and system
CN111836076A (en) Video resolution switching method and device and electronic equipment
US20150268808A1 (en) Method, Device and System for Multi-Speed Playing
CN106331763B (en) Method for seamlessly playing fragmented media file and device for implementing method
CN116088732A (en) Multimedia playing method, device and equipment
US20200099987A1 (en) Systems and methods for displaying a live video stream in a graphical user interface
CN104994406B (en) A kind of video editing method and device based on Silverlight plug-in units
CN114257572B (en) Data processing method, device, computer readable medium and electronic equipment
US20230217047A1 (en) Method, system, and computer-readable recording medium for implementing fast-switching mode between channels in multi-live transmission environment
EP3522525B1 (en) Method and apparatus for processing video playing
CN116996488A (en) Interactive multimedia information playing method and device, medium and electronic equipment
JP6063952B2 (en) Method for displaying multimedia assets, associated system, media client, and associated media server
CN113766255B (en) Video stream merging method, device, electronic equipment and computer medium
CN116527992A (en) Video playing resolution switching method and device
CN111131845A (en) Method, electronic device and storage medium for multimedia resource carousel
WO2007055445A1 (en) A pre-viewing method of multiple movies or movie-clips in multimedia apparatus
CN112887786A (en) Video playing method and device and computer readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination