WO2020140478A1 - Method for playing audio, video, and picture data - Google Patents

Method for playing audio, video, and picture data Download PDF

Info

Publication number
WO2020140478A1
WO2020140478A1 PCT/CN2019/106073 CN2019106073W WO2020140478A1 WO 2020140478 A1 WO2020140478 A1 WO 2020140478A1 CN 2019106073 W CN2019106073 W CN 2019106073W WO 2020140478 A1 WO2020140478 A1 WO 2020140478A1
Authority
WO
WIPO (PCT)
Prior art keywords
layer
data
audio
picture
same
Prior art date
Application number
PCT/CN2019/106073
Other languages
French (fr)
Chinese (zh)
Inventor
李庆成
鹿毅忠
Original Assignee
李庆成
鹿毅忠
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 李庆成, 鹿毅忠 filed Critical 李庆成
Publication of WO2020140478A1 publication Critical patent/WO2020140478A1/en

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • G10L21/055Time compression or expansion for synchronising with other signals, e.g. video signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/24Systems for the transmission of television signals using pulse code modulation
    • H04N7/52Systems for transmission of a pulse code modulated video signal with one or more other pulse code modulated signals, e.g. an audio signal or a synchronizing signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A method for playing audio, video, and picture data, comprising: downloading and parsing audio, video, and picture data to obtain an upper-layer picture audio, an upper-layer picture, and/or an upper-layer picture alignment parameter and/or an upper-layer audio alignment parameter in upper-layer audio and picture data; and playing the upper-layer picture audio or the upper picture automatically or upon receipt of a command of playing the upper-layer picture audio or the upper picture, and playing the corresponding upper picture or the corresponding upper picture audio when a playback time indicated by the upper-layer picture alignment parameter or the upper-layer audio alignment parameter elapses or upon receipt of the command of playing the upper-layer picture audio or the upper picture. A picture is played together with a specific audio segment in a corresponding manner, so that audio content and picture content are perfectly matched or associated; furthermore, chained or layered playback and embedded playback of audio and picture data and film and television data can be conveniently achieved.

Description

音视图数据的播放方法Play method of audio view data 技术领域Technical field
本发明涉及一种媒体播放技术,具体是一种多种媒体交叉播放的方法;属于互联网媒体技术。The invention relates to a media playback technology, in particular to a method for cross-playing multiple media; belongs to the Internet media technology.
背景技术Background technique
媒体播放方式有音频、视频、动画、图片等单独播放的方式,也有音频和图片组合播放的方式。现有的音频和图片组合播放时,通常是在播放一段音频时,陆续地按照一定的顺序依次播放一个或者多个图片。这种播放方式虽然可以做到音图并茂,但也有一个重大的缺点:音频的播放与图片呈现的方式基本上无关。即:各个图片在播放时,虽然可以由观看者顺序或者倒序控制播放的顺序,但是音频只能顺序播放,不能像切换图片那样倒回去。这样就使音频和图片组合播放时,音频的内容和图片的内容无法得到对应。这种缺陷使得现有音频和图片组合播放的媒体播放方式在很多网络培训、授课、交流中无法得到应用或者用户体验很差。Media playback methods include audio, video, animation, pictures, and other individual playback methods, as well as audio and picture combination playback methods. When a combination of existing audio and pictures is played, usually when one piece of audio is played, one or more pictures are successively played in a certain order. Although this type of playback can achieve both audio and video, it also has a major disadvantage: audio playback is basically irrelevant to the way the picture is presented. That is to say: while each picture is being played, although the order of playing can be controlled by the viewer order or in reverse order, the audio can only be played in order and cannot be reversed like switching pictures. In this way, when the audio and the picture are played together, the content of the audio and the content of the picture cannot be matched. This defect makes the existing media playback method of combined audio and picture playback unavailable in many online trainings, lectures, and exchanges, or the user experience is poor.
此外,在一些场景下,将音频和图片组合播放这种形式与短视频或者动画结合起来是非常有意义的。例如:在进行机械原理网络授课的场景中,一方面需要教师结合静态的机械图纸来讲解有关的专业内容,另一方面,演示相应机械结构在工作状态下的运动过程更有助于学生对教师讲授的理论内容的理解。但是,采用现有的媒体播放方式,要么只能使用前述音频和图片组合播放的方式,要么只能单独使用视频或者动画演示的方式,而不能将两者有机地结合起来,相互嵌套。如果只是使用音频和图片组合播放的方式,则往往会使得教师讲述的内容过于枯燥,造成学生学习的效果不理想;但如果只是用视频或者动画播放的方式来授课,则一方面视频或者动画的制作成本较高,另一方面,播放时也需要较好的网络传输质量和较高的带宽资源,成本高,并且对于网络环境相对不是很好或者很稳定的地区,这样的网络授课方式也会受到限制。In addition, in some scenes, it is very meaningful to combine the combination of audio and picture playback with short video or animation. For example: in the scenario of network teaching of mechanical principles, on the one hand, teachers need to combine static mechanical drawings to explain relevant professional content; on the other hand, the demonstration of the corresponding mechanical structure in the working state of the movement process is more helpful for students to teachers The understanding of the theoretical content taught. However, the existing media playback method can only use the aforementioned combination of audio and picture playback, or can only use video or animation presentation alone, and cannot organically combine the two and nest each other. If you only use the combination of audio and pictures to play, it will often make the teacher's content too boring, resulting in unsatisfactory student learning; but if you only use video or animation playback to teach, then on the one hand, video or animation The production cost is high. On the other hand, it also needs better network transmission quality and higher bandwidth resources during playback. The cost is high, and for areas where the network environment is not very good or very stable, this network teaching method will also be restricted.
发明目的Purpose of the invention
本发明的主要目的是提供一种音视图数据的播放方法,借助于该方法,一方面,可以在播放音图数据时,可以使任何图片都能够和前述的音频中某一特定段落相对应地播放,实现音频内容和图片内容的完美匹配或关联; 另一方面,可以在播放影视数据时,为随时切换或者嵌套其他音图数据、影视数据做好必要的准备。The main purpose of the present invention is to provide a method for playing audiovisual data. With this method, on the one hand, when playing audiographic data, any picture can correspond to a specific paragraph in the aforementioned audio Play to achieve perfect matching or association of audio content and picture content; on the other hand, when playing video data, you can make necessary preparations for switching or nesting other audiovisual data and video data at any time.
本发明的目的是采用如下的技术方案实现的:The purpose of the present invention is achieved using the following technical solutions:
下载音视图数据并对其进行解析,以获得所述上层音图数据中的上层图片音频、上层图片和/或上层图片对准参数和/或上层音频对准参数;Download the audio view data and parse it to obtain upper layer picture audio, upper layer picture and/or upper layer picture alignment parameters and/or upper layer audio alignment parameters in the upper layer audiogram data;
播放所述上层图片音频或者上层图片,并在所述上层图片对准参数或者上层音频对准参数指示的播放时间到达,或者在接收到播放所述上层图片音频或者上层图片的命令时,播放对应的所述上层图片或者上层图片音频;Play the upper layer picture audio or upper layer picture, and when the playback time indicated by the upper layer picture alignment parameter or upper layer audio alignment parameter arrives, or when receiving a command to play the upper layer picture audio or upper layer picture, play the corresponding The upper layer picture or the upper layer picture audio;
或者,or,
下载音视图数据并对其进行解析,以获得所述上层影视数据中的上层视频数据或者上层动画数据、下层音图标识和/或下层影视标识;Download the audiovisual data and parse it to obtain the upper layer video data or the upper layer animation data, the lower layer sound image logo and/or the lower layer film logo in the upper layer video data;
自动,或者在接收到播放所述上层视频数据或者上层动画数据时,播放所述上层视频数据或者上层动画数据。Automatically, or when receiving the upper layer video data or the upper layer animation data, playing the upper layer video data or the upper layer animation data.
利用本发明上述的方法,人们可以预先对要播放的图片以及相应的音频段落进行关联处理,获得它们之间在播放时的对准(关联)参数;在播放时,则根据预先形成的、并且与这些音图数据或影视数据一同下载的参数来对有关的音图数据或影视数据的播放进行控制,可以使任何图片都能够和前述的某一特定音频段落相对应地播放,实现了音频内容和图片内容的完美匹配或关联;另一方面,还能够方便地实现音图数据、影视数据链式或者层式播放和嵌入播放。Using the above-mentioned method of the present invention, people can pre-associate the pictures to be played and corresponding audio passages to obtain the alignment (association) parameters between them during playback; during playback, according to the pre-formed and The parameters downloaded together with these audiographic data or video data to control the playback of the relevant audiographic data or video data can enable any picture to be played corresponding to a specific audio paragraph described above, and realize audio content Perfect match or association with the picture content; on the other hand, it can also easily realize the audio or video data, film and television data chain or layered playback and embedded playback.
以下,将结合各个具体的实施方式,对本发明的技术方案做更为详细的披露。In the following, the technical solutions of the present invention will be disclosed in more detail in conjunction with various specific embodiments.
具体的实施方式Specific implementation
在详细介绍本发明的各个具体实施方式之前,有必要先对本发明涉及的一些数据对象和术语做一个具体的说明。本发明人在对本发明的各类技术方案进行研究和开发的时候,对本发明涉及的各个数据对象做了系统性的梳理,由此建立和定义了如下的若干数据对象:Before introducing the specific embodiments of the present invention in detail, it is necessary to make a specific description of some data objects and terms involved in the present invention. When researching and developing various technical solutions of the present invention, the present inventor systematically sorted out the various data objects involved in the present invention, thus establishing and defining the following data objects:
1.音视图数据:音视图数据主要有音图数据、影视数据、同层音图标识和同层影视标识。1. Audio-view data: Audio-view data mainly includes audio-visual data, video data, audio-visual identification of the same layer and video identification of the same layer.
2.音图数据:音图数据主要有两种类型。2. Audiographic data: There are two main types of audiographic data.
第一种类型的音图数据是由一幅静态图片和一段将和该图片一道播放的音频所构成;对于该静态图片,在本发明中统称为图片;而对于该音频,在本发明中统称为图片音频。此外,在音图数据中还设计有被本发明人称之为对准参数的数据;对准参数根据其作用的不同,被分为图片对准参数和音频对准参数。The first type of audiogram data is composed of a static picture and a piece of audio to be played together with the picture; the static picture is collectively referred to as a picture in the present invention; and the audio is collectively referred to as the present invention Audio for pictures. In addition, the audiogram data is also designed with data that the inventors call alignment parameters; the alignment parameters are divided into picture alignment parameters and audio alignment parameters according to their different functions.
第二种类型的音图数据是由多幅静态图片和多段与多幅静态图片对应播放的音频所构成;对于这些静态图片,在本发明中也统称为图片;而对于这些音频,在本发明中也统称为图片音频。此外,由于图片和音频都是多个,因此,在音图数据中被设计的对准参数与之相对应,也是多个;其数量和图片的数量或者音频的数量是对应的;第二种类型音图数据中的对准参数与第一种类型的音图数据一样,也被分为图片对准参数和音频对准参数。The second type of audiogram data is composed of multiple static pictures and multiple pieces of audio corresponding to multiple static pictures; these static pictures are also collectively referred to as pictures in the present invention; and these audio pictures are provided in the present invention Also referred to collectively as picture audio. In addition, since there are many pictures and audios, the alignment parameters designed in the audiogram data correspond to them, and there are also many; the number corresponds to the number of pictures or the number of audios; the second The alignment parameters in the typed audiogram data are the same as the first type of audiogram data, and are also divided into picture alignment parameters and audio alignment parameters.
在本发明中音图数据是一个完整的数据对象,它可以采用现有的任何图片、音频以及信息的数据格式拼合构成,也可以在具体的方案中,由相关的技术人员根据具体的需要将它们重新构成一个全新格式的一体化数据对象。而无论如何,在本发明中,只要具有上述数据成分的数据对象,都被称之为音图数据。In the present invention, the audiogram data is a complete data object, which can be composed of any existing data formats of pictures, audio and information, or in a specific scheme, by relevant technical personnel according to specific needs. They reconstruct an integrated data object in a completely new format. In any case, in the present invention, as long as the data object has the above-mentioned data components, it is called phonetic data.
3.影视数据:影视数据则主要是由视频数据或者动画数据所构成,除此之外,影视数据中还设计有下层音图标识和下层影视标识这两种数据。一般而言,影视数据中可以设计为只有一个视频数据或者动画数据;当然,也可以设置有多个视频数据或者动画数据,那样的话,需要在设计播放软件更加细心,一面出现数据逻辑上的错误,而对于本发明的技术方案而言,不过是增加了更多的组合方式而已。3. Film and television data: Film and television data is mainly composed of video data or animation data. In addition, the film and television data is also designed with two types of data: the lower layer audiogram logo and the lower layer film and television logo. Generally speaking, the video data can be designed to have only one video data or animation data; of course, multiple video data or animation data can also be set. In that case, you need to be more careful in designing the playback software, and there will be data logic errors on the one hand. However, for the technical solution of the present invention, only more combinations are added.
4.同层音图标识和同层影视标识。4. The same layer audiograph logo and the same layer video logo.
同层音图标识是用来告诉播放设备:在已经下载的这个音图数据或者影视数据之后,是不是还存在需要后续下载和播放的音图数据;为此,在同层音图标识中至少要包含同层音图下载参数和同层音图播放参数,同层音图下载参数用来指示后续音图数据如何下载,同层音图播放参数用来告诉播放设备在什么时候播放下载好的后续音图数据。The same layer audiogram identifier is used to tell the playback device: after the downloaded audiogram data or video data, is there any audiogram data that needs to be downloaded and played later; for this reason, at least in the same layer audiogram identifier To include the same layer audio image download parameters and the same layer audio image playback parameters, the same layer audio image download parameters are used to indicate how the subsequent audio image data is downloaded, and the same layer audio image playback parameters are used to tell the playback device when to play the downloaded Subsequent audiogram data.
同层影视标识是用来告诉播放设备:在已经下载的这个音图数据或者影视数据之后,是不是还存在需要后续下载和播放的影视数据;为此,在同层影视标识中至少要包含同层影视下载参数和同层影视播放参数,同层影视下载参数用来指示后续影视数据如何下载,同层影视播放参数用来告诉播放设备在什么时候播放下载好的后续影视数据。The same layer of film and television logo is used to tell the playback device: after the downloaded audiovisual data or film and television data, is there any film and television data that needs to be downloaded and played later; for this reason, at least the same layer of film and television logo must contain the same The film and television download parameters of the same layer and the film and television playback parameters of the same layer. The film and television download parameters of the same layer are used to indicate how to download the subsequent film and television data.
5.上层、下层和同层的含义。5. The meaning of upper layer, lower layer and same layer.
基于此后的各类具体的实施方式就可以看到,本发明技术方案的一类非常有价值的方案是可以实现如下的技术效果:可以将多个本发明的音视图数据组成音视图数据“链”,这个“链”上的多个音视图数据可以顺序播放;另一方面,前述音视图数据“链”上的任何一个音视图数据在播放时,均可以插入播放另一个音视图数据,而这个被插入播放的音视图数据可能是下一层音视图数据“链”中的一个音视图数据。显然,由此就出现了音视图数据同层、上下层等结构关系,为了在本发明后续各类具体的实施方式中描述方便,本发明人引入“上层”、“下层”和“同层”等概念,用于作为音图数据、影视数据以及相应数据内容的定语,其描述的含义正如前面所述内容的解释。Based on various specific implementations thereafter, it can be seen that a very valuable solution of the technical solution of the present invention is that it can achieve the following technical effects: multiple audio view data of the present invention can be combined into an audio view data “chain ", multiple audio-view data on this "chain" can be played sequentially; on the other hand, any audio-view data on the aforementioned audio-view data "chain" can be inserted to play another audio-view data, and The audio view data inserted and played may be one audio view data in the "chain" of audio view data in the next layer. Obviously, there is a structural relationship between the same layer and the upper and lower layers of the audio view data. For the convenience of description in various subsequent specific embodiments of the present invention, the inventor introduced the "upper layer", "lower layer" and "same layer" And other concepts, which are used as attributions of audiovisual data, video data, and corresponding data content, and the meaning of their description is just as explained in the foregoing content.
在本发明的第1类具体的实施方式中,主要是针对音图数据播放的具体方案,在这个方案中:首先需要下载以音图数据为主体的音视图数据;当这个音图数据被下载到播放设备中以后,需要对其进行解析;具体的解析方案,则需要根据前述的音图数据的具体格式来进行。通过解析,就可以从音图数据中获得可以用来播放的图片、图片音频。接下来,播放就可以开始播放音图数据中的图片音频,同时将图片一并显示。In the first specific embodiment of the present invention, it is mainly directed to the specific scheme of playing audiogram data. In this scheme: first, it is necessary to download audiogram data with audiogram data as the main body; when this audiogram data is downloaded After reaching the playback device, it needs to be parsed; the specific analysis scheme needs to be carried out according to the specific format of the aforementioned audiogram data. Through parsing, you can obtain pictures and picture audio that can be used for playback from the audiogram data. Next, you can start playing the picture audio in the audiogram data, and display the picture together.
在本发明的第2类具体的实施方式中,如前所述,在一些情形下,与前述第1类具体实施方式中的音图数据不同,音图数据中的图片音频或者图片的数量还会出现后面的三种情形:a.一个图片音频和多幅图片;b.多个图片音频和一幅图片;c.多个图片音频和多幅图片。这时,就需要使用对准参数来指示播放设备如何来播放这样的音图数据。当然,基于前述的三种情况,对准参数不仅分为图片对准参数和音频对准参数,还分为一对多,多对一以及多对多等三种情形。In the second specific embodiment of the present invention, as mentioned above, in some cases, unlike the audiographic data in the foregoing first specific embodiment, the number of picture audios or pictures in the audiographic data is also The following three situations will appear: a. one picture audio and multiple pictures; b. multiple picture audio and one picture; c. multiple picture audio and multiple pictures. At this time, it is necessary to use the alignment parameter to instruct the playback device how to play such audiogram data. Of course, based on the foregoing three cases, the alignment parameters are not only divided into picture alignment parameters and audio alignment parameters, but also divided into three cases of one-to-many, many-to-one, and many-to-many.
当音图数据中包含有一个图片音频和多幅图片时,在播放时,首先开 始播放图片音频,在播放图片音频的同时或者之后,陆续显示多个图片。对应于这种情况,在音图数据中配置的对准参数是多个图片对准参数,它们分别与多个图片一一对应,分别用来指示所对应的图片在图片音频播放到什么时候开始显示。When the audio image data contains a picture audio and multiple pictures, the picture audio will be played first when playing, and at the same time or after the picture audio is played, multiple pictures will be displayed one after another. Corresponding to this situation, the alignment parameters configured in the audiogram data are multiple picture alignment parameters, which correspond to multiple pictures one by one, and are used to indicate when the corresponding picture starts when the picture audio is played. display.
当音图数据中包含有一幅图片和多个图片音频时(尽管这种可能的情况很少),在播放时,则是首先开始显示图片,在显示图片的同时或者之后,陆续播放多个图片音频。对应于这种情况,在音图数据中配置的对准参数是多个音频对准参数,它们分别与多个图片音频一一对应,分别用来指示所对应的图片音频在图片显示到什么时候开始播放。这种音图数据中包含有一幅图片和多个图片音频的情形,往往可以用于那种有下层音视图数据要插入播放,或者可以用于被其他音视图引用的情形。When the audio image data contains a picture and multiple picture audios (although this may be rare), when playing, the picture is first displayed, and at the same time or after the picture is displayed, multiple pictures are played one after another Audio. Corresponding to this situation, the alignment parameters configured in the audiogram data are multiple audio alignment parameters, which are in one-to-one correspondence with multiple picture audios, and are used to indicate when the corresponding picture audio is displayed in the picture. Start playing. This kind of audio image data contains a picture and multiple image audios, which can often be used in the case where the underlying audio view data needs to be inserted and played, or it can be used when it is referenced by other audio views.
当音图数据中包含有多个图片音频和多幅图片时,在播放时,首先开始播放的是图片音频还是图片,则需要根据图片对准参数或者音频对准参数来确定。当然,这些图片对准参数或者音频对准参数的数量分别与图片或者图片音频的数量一一对应,分别用来指示所对应的图片或者图片音频播放的时机。When the audio image data includes multiple picture audios and multiple pictures, whether to start playing picture audio or pictures first during playback needs to be determined according to picture alignment parameters or audio alignment parameters. Of course, the number of these picture alignment parameters or audio alignment parameters corresponds to the number of pictures or picture audios, respectively, and is used to indicate the timing of the corresponding picture or picture audio playback, respectively.
此外,在一些情形下,图片对准参数可以被设置为对应的图片音频的索引或者标志;这样,在播放某一图片时,播放设备可以根据该图片对应的图片音频的索引或者标志,找到下载到播放设备的音频段落并执行播放该音频段落的步骤。同样,音频对准参数可以被设置为对应图片的索引或者标志;这样,在播放某一音频的时候,播放设备可以根据该音频对应图片的索引或者标志,找到下载到播放设备的图片并执行显示该图片的步骤。这样做的好处是:对于音图数据的播放,无论自动的还是用户人为控制的,都可以根据前述的方式,使得被播放的图片和图片音频总能够准确地对应,而再也不会出现图片的显示与音频的播放毫无关联的情形。真正做到音、图并茂。In addition, in some cases, the picture alignment parameter can be set to the index or flag of the corresponding picture audio; in this way, when playing a certain picture, the playback device can find the download according to the index or flag of the picture audio corresponding to the picture Go to the audio section of the playback device and perform the steps to play the audio section. Similarly, the audio alignment parameter can be set to the index or logo of the corresponding picture; in this way, when playing an audio, the playback device can find the image downloaded to the playback device according to the index or logo of the corresponding picture of the audio and perform display The steps of the picture. The advantage of this is that for the playback of audiovisual data, whether it is automatic or user-controlled, according to the aforementioned method, the played picture and the picture audio can always accurately correspond, and the picture will never appear again. The display has nothing to do with audio playback. Really achieve both sound and picture.
需要额外地说明的是:前述这种将对准参数设置为索引或者标识的方式,不仅仅能够使用在音图数据的播放控制,同样可以使用于影视数据的播放控制。所不同的只是这些对准参数设置于何处;这一点,在本发明后续的具体实施方式中还会进行披露。此外,前述的音图数据也好,影视数 据也罢,其都涵盖了上层、同层和下层等各种情形。在此不再赘述。It should be additionally noted that the aforementioned way of setting the alignment parameter as an index or a mark can be used not only for playback control of audiographic data, but also for playback control of video data. The only difference is where these alignment parameters are set; this point will also be disclosed in subsequent specific embodiments of the present invention. In addition, the aforementioned audiovisual data, as well as film and television data, cover various situations such as the upper layer, the same layer, and the lower layer. I will not repeat them here.
如前所述,本发明第1、2类具体实施方案实现了这样的技术效果:在播放一个由一段以上的音频和一幅以上的图片所组成的音图数据时,由于引入了图片音频对准参数或者图片对准参数,就可以使得任何一段图片音频都可以和与之相应的图片关联播放,从而做到播放的图片和音频具有用户所需要的相关性;使得现有技术中的音频与图片无法关联的缺陷得到克服。更为有意义的是:这种图片和音频的对应,使得音图数据作为一种新的数字媒体形式,可以被方便地制作和灵活地播放。As mentioned before, the specific embodiments of the first and second categories of the present invention achieve such a technical effect: when playing a sound image data composed of more than one audio and more than one picture, due to the introduction of the picture audio pair The quasi-parameters or picture alignment parameters can make any piece of picture audio can be associated with the corresponding picture to play, so that the played picture and audio have the correlation required by the user; making the audio in the prior art The defect that the picture cannot be connected is overcome. What is more meaningful is that this correspondence between pictures and audio makes the audiogram data as a new form of digital media, which can be easily produced and played flexibly.
本发明的第3类具体实施方式是关与音视图数据是影视数据的情形。在这种情况下,当然,首先还是下载并解析该音视图数据,在这个音视图数据的影视数据中,通常包含一个视频数据或者一个动画数据,实际上,无论是视频数据还是动画数据,在观看者那里的视觉感受是没有太大的区别的,无非都是一些列动态影像和伴音音频的组合。只是视频数据和动画数据在存储格式上有些区别而已。在从影视数据中解析出视频数据或者动画数据后,就可以开始播放它们。The third embodiment of the present invention is the case where the audio and video data is video data. In this case, of course, the audio view data is first downloaded and parsed. The video data of this audio view data usually contains a video data or an animation data. In fact, whether it is video data or animation data, There is not much difference in the visual experience of the viewers. They are just a combination of a series of dynamic images and audio and audio. It's just that there is a difference in storage format between video data and animation data. After parsing video data or animation data from movie data, you can start playing them.
在执行上述操作的同时或者之后,本发明的一个必要的步骤是需要从下载的音视图数据中解析出其中所携带的下层音图标识、下层影视标识。如前所述,本发明的众多具体的实施方式中,包括在上层的一个音视图数据播放的时候,可以插入播放下层的音视图数据的方案。而这类方案的实现,需要在音视图数据中事先设置好需要插入播放的音视图数据的标识。这种音视图数据的标识可以是关于音图数据的标识,也可以是关于影视数据的标识,或者两种兼而有之,具体视需要插入的数据类型去置入。这些标识分别被称作下层音图标识、下层影视标识,分别用来告知播放设备需要插入的有哪些音视图数据。此外,需要说明的是:这些下层音图标识、下层影视标识分别可以是一个或者多个。At the same time or after performing the above operations, a necessary step of the present invention is to parse out the lower-layer audiograph logo and the lower-layer video logo carried in the downloaded audio-view data. As mentioned above, in many specific embodiments of the present invention, when a sound view data of the upper layer is played, a scheme of playing the sound view data of the lower layer may be inserted. For the realization of this kind of solution, the identifier of the audio view data to be inserted and played needs to be set in advance in the audio view data. The identification of the audiovisual data may be the identification of the audiographic data, the identification of the video data, or both, depending on the type of data to be inserted. These logos are called the lower audio image logo and the lower video logo, respectively, and are used to inform the playback device which audio view data needs to be inserted. In addition, it should be noted that: these lower-layer audiographic logos and lower-layer video logos may be one or more, respectively.
本发明第3类具体实施方案实现了这样的技术效果:由于引入了下层音图标识、下层影视标识,就可以在播放当前的影视数据的时候,随时可以插入播放一个下层的音图数据或者影视数据。这为某些知识性音视图数据的制作和播放提供了引用播放背景信息和知识的手段;更实现了音视图数据的多层次结构。The third specific embodiment of the present invention achieves such a technical effect: due to the introduction of the lower layer audiograph logo and the lower layer movie logo, you can insert and play a lower layer audiogram data or video at any time while playing the current film and television data data. This provides a means of referencing and playing background information and knowledge for the production and playback of certain knowledge-based audiovisual data; it also realizes a multi-level structure of audiovisual data.
在本发明上述的所有具体实施方式是本发明最为基础的若干类具体的技术方案,其中,所有的音视图数据被下载后,其播放的条件都可能有三种,第一种是下载后自动播放,第二种是按照相应的对准参数的指示进行播放,第三种则是在接收到播放图片音频或者图片的命令,或者接收到播放视频数据或者动画数据的命令时开始播放。All the above specific embodiments of the present invention are the most basic types of specific technical solutions of the present invention. After all the audio view data is downloaded, there may be three playing conditions. The first is automatic playback after downloading. The second is to play according to the instructions of the corresponding alignment parameters, and the third is to start playing when a command to play picture audio or pictures, or a command to play video data or animation data is received.
在本发明上述的所有具体实施方式中,无论是音图数据还是影视数据以及各自包含在其中的各个组成,都可以被看做是上层的数据或者标识。因此,在本发明的文本中,均在它们的前面增加了定语“上层”,以表示它们在本发明的音视图数据播放技术方案中的具体位置。In all the specific embodiments of the present invention described above, whether it is audiovisual data, video data, and the respective components contained therein, it can be regarded as upper layer data or identification. Therefore, in the text of the present invention, the attributive "upper layer" is added in front of them to indicate their specific positions in the audiovisual data playback technical solution of the present invention.
如前所述,本发明的音图数据中,可以有a.一个图片音频和多个图片,b.多幅图片音频和一个图片,c.多个图片音频和多幅图片以及d.一个图片音频和一幅图片等四种情况,这就使得音图数据的产生非常灵活。人们可以根据音图数据的生成、播放、相互引用以及嵌套播放来生成各种类型的音图数据。基于这样的背景,就有了将多个相互独立的音图数据串行播放的情形。本发明把这些相互独立且串行播放的音图数据称为:同层音图数据,在音视图数据这个整体下,它们之间的结构关系是同层关系。As mentioned above, in the audiogram data of the present invention, there can be a. one picture audio and multiple pictures, b. multiple picture audio and one picture, c. multiple picture audio and multiple pictures and d. one picture There are four situations such as audio and a picture, which makes the generation of audiogram data very flexible. People can generate various types of audiographic data according to the generation, playback, mutual reference, and nested playback of audiographic data. Based on this background, there are cases where multiple mutually independent audiographic data are played in series. The present invention refers to these mutually independent and serially played audiogram data as: audiogram data of the same layer. Under the overall audiogram data, the structural relationship between them is the same layer relationship.
为此,基于前述本发明的第1、2、3类具体实施方式中的任何一种,在本发明第4类具体的实施方式中,就需要在音视图数据中设置同层音图标识。当设置有同层音图标识的同层音视图数据被下载后,播放设备可以将它们从中解析并提取出来。在同层音图标识之中,一般设置有同层音图下载参数和同层音图播放参数,同层音图下载参数用以指示播放设备如何下载对应的同层音图数据,例如:同层音图下载参数可以直接就是一个链接地址,用以指向被下载的同层音图数据的互联网地址;再例如:它也可以是一个代码串,播放设备获得这个代码串以后,可以向固定的服务器发送带有这个代码串的下载请求,服务器端则根据带有这个代码串的请求来生成或者查询对应的同层音图数据,并与相应的播放设备做进一步的下载操作。同层音图播放参数则是用来指示播放设备在什么时候,以什么样的方式来播放下载的同层音图数据。显然,在一个音视图数据中,除了第一个被下载的音图数据之外,与该第一个被下载音图数据处于同一层的其他音图数据才是同层音图数据。同层音图数据可以有多个,与此相对应,同 层音图标识也相应地被设置为多个,它们分别于同层音图数据相对应。For this reason, based on any one of the aforementioned specific embodiments of the first, second, and third categories of the present invention, in the fourth specific embodiment of the present invention, it is necessary to set the same layer audiogram identifier in the audioview data. When the same layer audio view data set with the same layer audio image identifier is downloaded, the playback device can parse and extract them from it. In the same layer audiograph identification, the same layer audiograph download parameter and the same layer audiograph playback parameter are generally set. The same layer audiograph download parameter is used to instruct the playback device how to download the corresponding same layer audiograph data, for example: The layer audiogram download parameter can be directly a link address to point to the Internet address of the same layer audiogram data to be downloaded; for another example: it can also be a code string, after the playback device obtains this code string, it can be sent to a fixed The server sends a download request with this code string, and the server generates or queries corresponding audiogram data of the same layer according to the request with this code string, and performs further download operations with the corresponding playback device. The audio layer playback parameters of the same layer are used to indicate when and in what manner the playback device plays the downloaded audio layer data of the same layer. Obviously, in one sound view data, except for the first downloaded sound image data, other sound image data on the same layer as the first downloaded sound image data is the same layer sound image data. There can be multiple audiogram data of the same layer. Correspondingly, the audiogram identifiers of the same layer are correspondingly set to be multiple, and they correspond to the audiogram data of the same layer.
在前述本发明第4类具体实施方式中,仅涉及了同层音图数据这一种情形。实际上,与第一个被下载、播放的音图数据以及前述同层音图数据具有同层、串行播放关系的还有同层影视数据。也就是说:在一个音视图数据“链”中,也可以存在同层音图数据和同层影视数据依次混合串行的情形。In the foregoing fourth embodiment of the present invention, only the case of audiogram data of the same layer is involved. In fact, the same layer and serial playback relationship with the first downloaded and played audiogram data and the aforementioned audiogram data of the same layer also have the same layer of video data. That is to say: in a "chain" of audiovisual data, there can also be a situation in which audiovisual data of the same layer and video data of the same layer are serially mixed in sequence.
为此,基于前述本发明的第1、2、3类具体实施方式中的任何一种,在本发明第5类具体的实施方式中,还可以在音视图数据中设置同层影视标识。当设置有同层影视标识的同层音视图数据被下载后,播放设备可以将它们从中解析并提取出来。在同层影视标识之中,一般设置有同层影视下载参数和同层影视播放参数,同层影视下载参数用以指示播放设备如何下载对应的同层影视数据,例如:同层影视下载参数可以直接就是一个链接地址,用以指向被下载的同层影视数据的互联网地址;再例如:它也可以是一个代码串,播放设备获得这个代码串以后,可以向固定的服务器发送带有这个代码串的下载请求,服务器端则根据带有这个代码串的请求来生成或者查询对应的同层影视数据,并与相应的播放设备做进一步的下载操作。同层影视播放参数则是用来指示播放设备在什么时候,以什么样的方式来播放下载的同层影视数据。显然,在一个音视图数据中,除了第一个被下载的音图数据之外,与该第一个被下载影视数据处于同一层的其他影视数据才是同层影视数据。同层影视数据可以有多个,与此相对应,同层影视标识也相应地被设置为多个,它们分别于同层影视数据相对应。For this reason, based on any one of the foregoing specific embodiments of the first, second, and third categories of the present invention, in the specific embodiment of the fifth category of the present invention, it is also possible to set the same layer of video identification in the audio view data. After the audio data of the same layer set with the film and television identification of the same layer is downloaded, the playback device can parse and extract them. In the same layer of film and television logo, the same layer of film and television download parameters and the same layer of film and television playback parameters are set. The same layer of film and television download parameters are used to instruct the playback device how to download the corresponding film and television data of the same layer, for example: the same layer of film and television download parameters can Directly is a link address to point to the Internet address of the downloaded film and television data of the same layer; another example: it can also be a code string, after the playback device obtains this code string, it can send this code string to a fixed server For the download request of the server, the server generates or queries the corresponding film and television data of the same layer according to the request with this code string, and performs further download operations with the corresponding playback device. The same layer video playback parameters are used to indicate when and in what manner the playback device plays the downloaded same layer video data. Obviously, in the audio view data, in addition to the first downloaded audio image data, other video data on the same layer as the first downloaded video data is the same layer of video data. There can be multiple film and television data on the same layer. Correspondingly, the film and television identifications on the same layer are correspondingly set to be multiple, and they correspond to the film and television data on the same layer.
前述的本发明第4类和第5类具体实施方式,在很多情况下是结合在一起使用的,这些情况实际上是在一个音视图数据中,除了首先下载播放的那个音图数据之外,此后还存在一个或者多个同层音图数据、一个或者多个同层影视数据;这些同层音图数据和同层影视数据的排放顺序可以是任意的。The foregoing specific embodiments of the fourth and fifth categories of the present invention are used together in many cases. These situations are actually in a sound view data, except for the sound picture data that is downloaded and played first, Thereafter, there are one or more audiogram data of the same layer, and one or more video data of the same layer; the order of the discharge of the audiogram data of the same layer and the video data of the same layer can be arbitrary.
本发明第6类具体实施方式:前述本发明第4类和第5类具体实施方式中,涉及的仅仅是第一个被下载和播放的是音图数据的情形。事实上,对于第一个被下载和播放的是影视数据的情形,也同样存在同层音图数据和/或同层影视数据顺序下载和播放的情形。有关同层音图数据和/或同层 影视数据顺序下载和播放的具体实施方式,与前述本发明第4和第5具体实施方式中相应的同层音图数据和/或同层影视数据的下载和播放方案是相同的。Embodiment 6 of the present invention: In the foregoing embodiments 4 and 5 of the present invention, it is only concerned that the first one to be downloaded and played is audiogram data. In fact, for the first case where the video data is downloaded and played, the same layer of audiovisual data and/or the same layer of video data are also downloaded and played in sequence. The specific embodiment of the sequential downloading and playing of the audiovisual data of the same layer and/or the video data of the same layer corresponds to the audiovisual data of the same layer and/or video data of the same layer in the fourth and fifth specific embodiments of the present invention. The download and playback scheme is the same.
上述有关本发明第4类、第5类和第6类具体实施方式,一方面,一个或者多个同层音图数据、一个或者多个同层影视数据串行播放,为音视图数据组织、下载和播放提供了非常灵活的优势;另一方面,这种技术方案也可以被用来合理地配置和使用网络带宽。由于同层音图数据、同层影视数之间的播放关系是串行,在网络带宽资源有限的情况下,可以按照播放的次序,分批次下载不同的同层音图数据和同层影视数据;这样能更加有效地利用带宽资源,给用户带来更为良好的播放体验。The above-mentioned specific embodiments of the fourth, fifth, and sixth categories of the present invention. On the one hand, one or more audio data of the same layer, one or more video data of the same layer are serially played, organized for audio view data, Downloading and playing provide a very flexible advantage; on the other hand, this technical solution can also be used to reasonably configure and use network bandwidth. Since the playback relationship between the audiovisual data of the same layer and the number of movies and videos of the same layer is serial, in the case of limited network bandwidth resources, different audiographic data and video of the same layer can be downloaded in batches in accordance with the order of playback. Data; this can make more efficient use of bandwidth resources and bring users a better playback experience.
如此前本发明具体实施方式4、5、6所述的那样,尽管同层音图数据和/或同层影视数据是根据同层音图标识和/或同层影视标识下载的,但对于下载到播放设备的这些同层音图数据和/或同层影视数据的播放,还需要一个合适的操作处理,而不是简单地下载了就开始播放。因此,在本发明前述各个具体实施方式中任何一种的基础上,本发明第7类具体实施方式中提供了如下的具体方案:As described in the previous specific embodiments 4, 5, and 6 of the present invention, although the audiovisual data and/or video data of the same layer are downloaded according to the audiographic identification and/or video identification of the same layer, for the download The playback of the same layer of audiovisual data and/or the same layer of video data to the playback device also requires a proper operation process, rather than simply downloading and starting to play. Therefore, based on any one of the foregoing specific embodiments of the present invention, the following specific solutions are provided in the seventh specific embodiment of the present invention:
在同层音图播放参数所指示的播放时间到达的时候,需要终止播放上层图片音频或者上层图片,或者,终止播放上层视频数据或者上层动画数据;然后再播放同层图片音频或者同层图片,并在同层图片对准参数或者同层音频对准参数所指示的播放时间到达时,播放对应的同层图片或者同层图片音频。这样做的目的在于:如果在同层音图播放参数所指示的播放时间到达的时候,上层音图数据或者上层影视数据还处于正在播放的状态,则需要终止上层音图数据或者上层影视数据的播放,然后才能开始播放同层音图数据;从而避免了上层音图数据或者上层影视数据与同层音图数据同时播放。When the playback time indicated by the same layer audio image playback parameter arrives, you need to stop playing the upper layer image audio or upper layer image, or stop playing the upper layer video data or upper layer animation data; then play the same layer image audio or same layer image, And when the playback time indicated by the same layer picture alignment parameter or the same layer audio alignment parameter arrives, the corresponding same layer picture or the same layer picture audio is played. The purpose of this is: if the upper layer audiogram data or the upper layer video data is still playing when the playback time indicated by the same layer audiogram playback parameters arrives, you need to terminate the upper layer audiogram data or the upper layer video data Play, then you can start playing the same layer of audiographic data; thus avoiding the upper layer audiographic data or upper layer video data and the same layer of audiographic data playing at the same time.
此外,有的时候,当同层音图数据已经被下载到播放设备之中,而上层音图数据或者上层影视数据还没有播放完,用户去希望立即播放同层音图数据或者同层影视数据,这种情形应该也是常见的。在本发明第7类具体实施方式中提供了可以由用户来直接干预同层音图数据或者同层影视数据播放的方式,即:在接收到启动播放同层音图数据或者影视数据的用 户命令时,终止播放所述上层图片音频或者上层图片;或者,终止播放所述上层视频数据或者上层动画数据;In addition, sometimes, when the audiogram data of the same layer has been downloaded to the playback device, and the upper layer audiogram data or the upper layer video data has not been played, the user wants to immediately play the same layer audiogram data or the same layer video data , This situation should also be common. The seventh embodiment of the present invention provides a way for the user to directly intervene in the playback of audiovisual data of the same layer or video data of the same layer, that is: upon receiving a user command to start the playback of audiographic data or video data of the same layer When, stop playing the audio of the upper layer picture or the upper layer picture; or, stop playing the upper layer video data or the upper layer animation data;
与前述播放同层音图数据的操作基本相同,在播放同层影视数据的时候,也分为同样的两种情形,针对这两种情形,在本发明前述第1-6类具体实施方式中任何一种的基础上,本发明第8类具体实施方式如下:It is basically the same as the operation of playing the audiovisual data of the same layer. When playing the video data of the same layer, it is divided into the same two situations. For these two situations, in the specific embodiments of the foregoing categories 1-6 of the present invention On the basis of any one, the eighth specific implementation of the present invention is as follows:
在同层影视播放参数所指示的播放时间到达时,终止播放上层图片音频或者上层图片;或者,终止播放上层视频数据或者上层动画数据;然后开始播放已经下载的同层视频数据或者同层动画数据。When the playback time indicated by the same layer of video playback parameters arrives, stop playing the upper layer image audio or upper layer picture; or, stop playing the upper layer video data or upper layer animation data; and then start playing the downloaded same layer video data or same layer animation data .
此外,在接收到启动播放同层影视数据的用户命令时,终止播放上层图片音频或者上层图片;或者,终止播放上层视频数据或者上层动画数据;然后开始播放已经下载的同层视频数据或者同层动画数据。In addition, when receiving a user command to start playing the same layer of video data, stop playing the upper layer image audio or upper layer picture; or, stop playing the upper layer video data or upper layer animation data; and then start playing the downloaded layer video data or the same layer Animation data.
前述本发明第7类和第8类具体实施方式的技术方案主要提供了播放同层音图数据或者同层影视数据的方案;其中,均仅仅涉及了启动播放同层音图数据或者同层影视数据的操作;在一些情况下,该同层音图数据或者同层影视数据播放完成后,则播放操作停止。但是,如前所述,在一些情况下,除了当前播放的这个同层音图数据或者同层影视数据之外,还存在其他同层音图数据或者同层影视数据,在音视图数据中,也存在着多个同层音图标识和/或同层影视标识以指示播放设备:存在多个同层音图数据或者同层影视数据。这个时候,就需要对如何从当前音图数据或者影视数据的播放切换到其他同层音图数据或者同层影视数据的播放进行处理。The foregoing technical solutions of the specific embodiments of the 7th and 8th embodiments of the present invention mainly provide a solution for playing the audiovisual data of the same layer or the video data of the same layer; all of them only involve starting the playback of the audiographic data of the same layer or the video of the same layer. Data operation; in some cases, after the same layer of audiovisual data or the same layer of video data is played, the playback operation stops. However, as mentioned above, in some cases, in addition to the currently playing audiovisual data or audiovisual data of the same layer, there are other audiographic data or audiovisual data of the same layer. In the audio view data, There are also multiple audiogram identifications of the same layer and/or video identifications of the same layer to indicate the playback device: there are multiple audiographic data of the same layer or video data of the same layer. At this time, it is necessary to handle how to switch from the current audiovisual data or video data playback to other audiovisual data of the same layer or video data of the same layer.
在本发明第9类具体实施方式的技术方案中,基于前述本发明第7类和第8类具体实施方式当前同层音图数据的播放的情形,还进一步提供了如下的方案:当接收到终止播放当前正在播放的同层音图数据的用户命令,或者当前同层音图数据播放结束时;根据前述同层音图标识和/或同层影视标识,顺序播放其他同层音图数据或者同层影视数据。In the technical solution of the ninth specific embodiment of the present invention, based on the foregoing situation of playing the same layer audiogram data of the specific embodiments of the present invention of the seventh and eighth embodiments, the following solution is further provided: User command to stop playing the same layer audiogram data currently being played, or the end of the current layer audiogram data playback; according to the aforementioned same layer audiogram logo and/or the same layer video logo, sequentially play other layer audiogram data or Video data on the same layer.
类似地,在本发明第10类具体实施方式的技术方案中,基于前述本发明第7类和第8类具体实施方式当前同层影视数据的播放的情形,还进一步提供了如下的方案:当接收到终止播放当前正在播放的同层影视数据的用户命令,或者当前同层影视数据播放结束时;根据前述同层音图标识和/或同层影视标识,顺序播放其他同层音图数据或者同层影视数据。Similarly, in the technical solution of the tenth embodiment of the present invention, based on the current situation of the same layer of the seventh and eighth embodiments of the present invention playing the same layer of video data, the following solutions are further provided: Receive a user command to stop playing the same layer of film and television data currently being played, or when the current layer of film and television data is finished playing; according to the aforementioned same layer audiograph logo and/or the same layer film and television logo, sequentially play other same layer audiovisual data or Video data on the same layer.
本发明前述第9类、第10类具体实施方式,提供了在音视图数据中存在多个同层音图数据和/或同层影视数据的情形下,如何实现多个同层音图数据和/或同层影视数据播放的方案,使得如前所述的音视图数据“链”获得丰富的播放方式。另外,正是由于本发明第9类、第10类具体实施方式的提供,也为以音视图数据为基础的音视图节目的制作和表达,有了相比于传统影视节目的剪辑、非线性编辑更为丰富和灵活的表达方式。The foregoing specific embodiments of the 9th and 10th categories of the present invention provide how to implement multiple audiogram data of the same layer and multiple audiogram data of the same layer in the audio view data when there are multiple audiogram data and/or video data of the same layer. /Or the same layer of film and television data playback program, so that the aforementioned audio view data "chain" has a rich playback method. In addition, it is precisely because of the provision of the specific embodiments of the 9th and 10th categories of the present invention, as well as the production and expression of audio view programs based on audio view data, there are clips and non-linearities compared to traditional film and television programs. The editor has richer and more flexible expressions.
需要额外再指出的是:对于任何音图数据、影视数据,都存在前述“上层”、“下层”和“同层”这三个概念。It should be additionally pointed out that for any audiovisual data and video data, there are the aforementioned three concepts of "upper layer", "lower layer" and "same layer".
具体而言,对于一个“上层音图数据”,根据音视图数据中的同层音图标识、同层影视标识的指示,都有可能存在“同层音图数据”和“同层影视数据”,而前述的“上层音图数据”相对于它的“同层音图数据”和“同层影视数据”而言,也是“同层音图数据”。同样,对于一个“上层影视数据”,根据音视图数据中的同层音图标识、同层影视标识的指示,都有可能存在“同层音图数据”和“同层影视数据”,而前述的“上层影视数据”相对于它的“同层音图数据”和“同层影视数据”而言,也是“同层影视数据”。Specifically, for an "upper layer audiogram data", according to the instructions of the same layer audiogram identifier and the same layer movie and television label in the audio view data, there may be "same layer audiogram data" and "same layer audiovisual data" And the aforementioned "upper layer audiogram data" is also "same layer audiogram data" relative to its "same layer audiogram data" and "same layer audiovisual data". Similarly, for an "upper film and television data", according to the instructions of the same layer audiograph and audiovisual data in the audio view data, there may be "same layer audiograph data" and "same layer audiovisual data". Compared with its "same layer audiovisual data" and "same layer video data", its "upper layer video data" is also "same layer video data".
此外,在此后的若干类具体实施方式中,还会出现“下层音图数据”和“下层影视数据”。所谓“下层音图数据”和“下层影视数据”是相对于“上层音图数据”和“上层影视数据”而言的。即:“下层音图数据”和“下层影视数据”是“上层音图数据”和“上层影视数据”的下层。但是,就像任何一个“上层音图数据”或者“上层影视数据”都会具有“同层音图数据”和/或“同层影视数据”一样,任何一个“下层音图数据”或者“下层影视数据”也都会具有“同层音图数据”和/或“同层影视数据”;并且,任何一个“下层音图数据”或者“下层影视数据”相对于它的“同层音图数据”和“同层影视数据”而言,也是“同层音图数据”或者“同层影视数据”。由此可知:“上层音图数据”或者“上层影视数据”的“同层音图数据”和“同层影视数据”与“下层音图数据”或者“下层影视数据”的“同层音图数据”和“同层影视数据”之间,也同样是“上层音图数据”或者“上层影视数据”与“下层音图数据”或者“下层影视数据”之间的关系。In addition, in several specific implementations thereafter, "lower layer audiographic data" and "lower layer video data" will also appear. The so-called "lower layer audiographic data" and "lower layer audiovisual data" are relative to "upper layer audiographic data" and "upper layer audiovisual data". That is: "lower layer audiographic data" and "lower layer video data" are the lower layers of "upper layer audiogram data" and "upper layer video data". However, just like any “upper layer audiographic data” or “upper layer audiovisual data” will have “same layer audiographic data” and/or “same layer audiovisual data”, any “lower layer audiographic data” or “lower layer audiovisual data” The data will also have "same layer audiographic data" and/or "same layer audiovisual data"; and, any "lower layer audiographic data" or "underlayer audiovisual data" relative to its "same layer audiographic data" and "Same layer video data" is also "same layer audiovisual data" or "same layer video data". It can be seen from this: "Same layer audiographic data" and "Same layer audiovisual data" of "Upper layer audiographic data" or "Upper layer audiovisual data" and "Same layer audiographic data" of "Lower layer audiographic data" or "Lower layer audiovisual data" The relationship between "data" and "same layer video data" is also the relationship between "upper layer audiographic data" or "upper layer audiovisual data" and "lower layer audiographic data" or "lower layer audiovisual data".
在此后的各类具体实施方式,均涉及下层“下层音图数据”和/或“下层影视数据”。The various specific implementations thereafter refer to the lower layer "lower layer audiographic data" and/or "lower layer video data".
为了能够在播放一个上层音图数据的时候,还能够插入播放下层音图数据或者下层影视数据,在本发明的音视图数据中,还需要设置下层音图标识和/或下层影视标识;在下层音图标识中,至少包含下层音图下载参数和下层音图播放参数;其中,下层音图下载参数用来指示播放设备如何下载相应的音图数据,而下层音图播放参数用来指示播放设备在什么时间开始播放相应的音图数据;在下层影视标识中,至少包含下层影视下载参数和下层影视播放参数;其中,下层影视下载参数用来指示播放设备如何下载相应的影视数据,而下层影视播放参数用来指示播放设备在什么时间开始播放相应的影视数据。In order to be able to insert and play the lower layer audiogram data or the lower layer video data when playing an upper layer audiogram data, in the audio view data of the present invention, it is also necessary to set the lower layer audiogram data and/or the lower layer video symbol; in the lower layer The audiogram identification includes at least the lower layer audiogram download parameters and the lower layer audiogram playback parameters; where the lower layer audiogram download parameters are used to instruct the playback device how to download the corresponding audiogram data, and the lower layer audiogram playback parameters are used to instruct the playback device When to start playing the corresponding audiovisual data; the lower layer movie logo contains at least the lower layer movie download parameters and the lower layer movie playback parameters; where the lower layer movie download parameters are used to instruct the playback device how to download the corresponding movie data, while the lower layer movie The playback parameter is used to indicate when the playback device starts playing the corresponding video data.
为此,在本发明第12类具体实施方式中,在本发明第1、2、3类具体实施方式的基础上,还包括对音视图数据进行解析,并从中解析获得下层影视标识的操作。如果音视图数据中存在前述的下层影视标识,说明在上层影视数据播放的时候,需要插入播放一个或者多个下层影视数据;由此,播放设备会根据下层影视标识下载与该下层影视下载参数对应的下层影视数据;其中:该下层影视数据的数据内容和结构与上层影视数据的数据内容和结构是相同的,至少由下层视频数据或者下层动画数据。For this reason, in the twelfth embodiment of the present invention, on the basis of the first, second, and third embodiments of the present invention, it also includes the operation of parsing the audiovisual data and parsing it to obtain the lower-layer video logo. If there is the aforementioned lower film and television logo in the audio view data, it means that when the upper film and television data is played, one or more lower film and television data needs to be inserted and played; thus, the playback device will download the corresponding to the lower film and television download parameters according to the lower film and television logo The lower layer film and television data; wherein: the data content and structure of the lower layer film and television data and the upper layer film and television data are the same, at least by the lower layer video data or lower layer animation data.
本发明第12类具体实施方式在本发明第1、2、3类具体实施方式的基础上提供了基于下层影视标识下载下层影视数据的具体技术方案,为在播放上层音图数据或者影视数据时,进一步插入播放下层影视数据提供了数据准备的操作。The twelfth specific embodiment of the present invention provides a specific technical solution for downloading lower-layer video data based on the lower-level film and television logo on the basis of the specific embodiments of the first, second, and third types of the present invention. , Further inserting and playing the lower layer video data provides data preparation operation.
在本发明第11、12类具体实施方式中,下层音图下载参数和下层影视下载参数与前述同层音图下载参数或者同层影视下载参数是类似的,可以直接就是一个链接地址,用以指向被下载的下层音图数据或者下层影视数据的互联网地址;它也可以是一个代码串,播放设备获得这个代码串以后,可以向固定的服务器发送带有这个代码串的下载请求,服务器端则根据带有这个代码串的请求来生成或者查询对应的下层音图数据或者下层影视数据,并与相应的播放设备做进一步的下载操作。In the specific embodiments of the 11th and 12th categories of the present invention, the lower layer audiogram download parameters and the lower layer video download parameters are similar to the aforementioned same layer audiogram download parameters or the same layer video download parameters, which can be directly a link address to It points to the Internet address of the downloaded lower layer audiovisual data or lower layer film and television data; it can also be a code string. After the playback device obtains this code string, it can send a download request with this code string to a fixed server, and the server side According to the request with this code string, generate or query the corresponding lower layer audiographic data or lower layer film and television data, and do further download operations with the corresponding playback device.
在本发明第13类具体实施方式中,基于前述第11类具体实施方式, 由于在本发明第11类具体的实施方式中,为下层音图数据的播放下载了相应的下层音图数据;因此可以进一步执行如下的操作:在下层音图播放参数所指示的播放时间到达时,或者,在接收到启动播放下层音图数据的用户命令时,中止播放当前正在播放的上层图片音频或者上层图片,并生成对应的上层音图数据中止播放标记;或者,中止播放所述上层视频数据或者上层动画数据,并生成对应的上层影视数据中止播放标记;In the 13th embodiment of the present invention, based on the aforementioned 11th embodiment, since in the 11th embodiment of the present invention, the corresponding lower layer audiogram data is downloaded for the playback of the lower layer audiogram data; The following operations may be further performed: when the playback time indicated by the playback parameters of the lower-layer audiogram reaches, or, when a user command to start playing the lower-layer audiogram data is received, the playback of the audio or upper-layer image of the currently-playing upper-layer image is suspended, And generate a corresponding upper layer audiovisual data suspension playing mark; or, stop playing the upper layer video data or upper layer animation data, and generate a corresponding upper layer film and television data suspension playing mark;
由于下层音图数据的播放是在上层音图数据或者上层影视数据播放的时候插入播放的,因此,在下层音图数据播放结束后,还需要返回到原来的插入点,并从该插入点起继续播放后续还没有播放的上层音图数据或者上层影视数据,所以,需要在播放下层音图数据之前对前述的插入点进行记录,以确保能返回。因此,本发明的第13类具体实施方式中提供了生成对应的上层音图数据终止播放标记或者上层影视数据中止播放标记的方案。需要另外说明的是:如前所述,相对于本发明的下层音图数据而言,它可以具有多个上层音图数据和/或上层影视数据;因此,在一些情况下,插入播放下层音图数据的时间点,可能会恰好在两个上层数据的播放之间,即:在一个上层音图数据播放结束后,并且在一个上层音图数据或者上层影视数据播放之前;或者,在一个上层影视数据播放结束后,并且在一个上层音图数据或者上层影视数据播放之前。在这样的情形下,同样也属于前述插入播放。因此也同样需要生成对应的上层音图数据终止播放标记或者上层影视数据中止播放标记。Since the playback of the lower layer audiogram data is inserted when the upper layer audiogram data or the upper layer video data is played, after the playback of the lower layer audiogram data ends, you need to return to the original insertion point and start from the insertion point Continue to play the upper layer audiogram data or upper layer video data that have not been played later, so you need to record the aforementioned insertion point before playing the lower layer audiogram data to ensure that you can return. Therefore, the thirteenth embodiment of the present invention provides a solution to generate a corresponding playback tag for the upper layer audiogram data or a playback tag for the upper layer video data. It should be additionally noted that, as mentioned above, relative to the lower layer audiographic data of the present invention, it may have multiple upper layer audiographic data and/or upper layer video data; therefore, in some cases, the lower layer audio is inserted and played The time point of the image data may be exactly between the playback of the two upper layer data, that is: after the playback of an upper layer audiogram data and before the playback of an upper layer audiogram data or upper layer video data; or, in an upper layer After playing the video data, and before playing the upper layer audiovisual data or the upper layer video data. In this case, it also belongs to the aforementioned insert play. Therefore, it is also necessary to generate a corresponding upper layer audiograph data termination playback mark or upper layer video data termination playback mark.
在前述中止播放标记被生成的同时或之后,执行播放下层图片音频或者下层图片,并在下层图片对准参数或者下层音频对准参数所指示的播放时间到达时,播放对应的下层图片或者下层图片音频。Simultaneously with or after the aforementioned stop play flag is generated, the playback of the lower layer picture audio or lower layer picture is performed, and when the playback time indicated by the lower layer picture alignment parameter or lower layer audio alignment parameter arrives, the corresponding lower layer picture or lower layer picture is played Audio.
在本发明第14类具体实施方式中,基于前述第11类具体实施方式,由于在本发明第11类具体的实施方式中,为下层影视数据的播放下载了相应的下层影视数据;因此可以进一步执行如下的操作:在下层影视播放参数所指示的播放时间到达时,或者,在接收到启动播放下层影视数据的用户命令时,中止播放当前正在播放的上层图片音频或者上层图片,并生成对应的上层音图数据中止播放标记;或者,中止播放所述上层视频数据或者上层动画数据,并生成对应的上层影视数据中止播放标记;In the 14th specific embodiment of the present invention, based on the aforementioned 11th specific embodiment, since in the 11th specific embodiment of the present invention, the corresponding lower layer video data is downloaded for the playback of the lower layer video data; therefore, it can be further Perform the following operations: when the playback time indicated by the lower-layer video playback parameters arrives, or when a user command to start playing the lower-layer video data is received, stop playing the audio or the upper-layer image currently playing, and generate the corresponding The upper layer audiovisual data suspends the playback mark; or, suspends the playback of the upper layer video data or the upper layer animation data, and generates the corresponding upper layer film and television data suspends the playback mark;
由于下层影视数据的播放是在上层音图数据或者上层影视数据播放的时候插入播放的,因此,在下层影视数据播放结束后,还需要返回到原来的插入点,并从该插入点起继续播放后续还没有播放的上层音图数据或者上层影视数据,所以,需要在播放下层音图数据之前对前述的插入点进行记录,以确保能返回。因此,本发明的第14类具体实施方式中提供了生成对应的上层音图数据终止播放标记或者上层影视数据中止播放标记的方案。需要另外说明的是:如前所述,相对于本发明的下层影视数据而言,它可以具有多个上层音图数据和/或上层影视数据;因此,在一些情况下,插入播放下层影视数据的时间点,可能会恰好在两个上层数据的播放之间,即:在一个上层音图数据播放结束后,并且在一个上层音图数据或者上层影视数据播放之前;或者,在一个上层影视数据播放结束后,并且在一个上层音图数据或者上层影视数据播放之前。在这样的情形下,同样也属于前述插入播放。因此也同样需要生成对应的上层音图数据终止播放标记或者上层影视数据中止播放标记。Because the playback of the lower layer video data is inserted when the upper layer audiovisual data or the upper layer video data is played, after the playback of the lower layer video data ends, you need to return to the original insertion point and continue to play from the insertion point There is no upper layer audiographic data or upper layer video data that has not been played later, so you need to record the aforementioned insertion point before playing the lower layer audiographic data to ensure that you can return. Therefore, the 14th embodiment of the present invention provides a solution for generating a corresponding playback tag for the upper layer audiogram data or a playback tag for the upper layer video data. It should be additionally noted that, as mentioned above, relative to the lower-layer video data of the present invention, it may have multiple upper-layer audiographic data and/or upper-layer video data; therefore, in some cases, the lower-layer video data is inserted and played The time may be exactly between the playback of the two upper layer data, that is: after the playback of an upper layer audiogram data, and before the playback of an upper layer audiogram data or upper layer video data; or, in an upper layer video data After the playback ends, and before the playback of an upper layer audiovisual data or upper layer video data. In this case, it also belongs to the aforementioned insert play. Therefore, it is also necessary to generate a corresponding upper layer audiograph data termination playback mark or upper layer video data termination playback mark.
在前述中止播放标记被生成的同时或之后,执行播放下层影视数据中的下层视频数据或者下层动画数据的操作。Simultaneously with or after the aforementioned stop play flag is generated, an operation of playing the lower layer video data or the lower layer animation data in the lower layer movie data is performed.
本发明第13、14类具体实施方式,是在本发明前述第1、2、3类具体实施方式的基础上,增加了插入播放下层音图数据或者下层影视数据的技术方案。这两类技术方案还分别可以引入结合本发明前述第4至12类具体实施方式的技术方案。因此,可以带来如下更为有意义的技术效果:The specific embodiments of categories 13 and 14 of the present invention are based on the foregoing specific embodiments of categories 1, 2, and 3 of the present invention, and a technical solution for inserting and playing lower layer audiographic data or lower layer video data is added. These two types of technical solutions can also introduce technical solutions combining the specific embodiments of the foregoing categories 4 to 12 of the present invention. Therefore, it can bring the following more meaningful technical effects:
如前所述,本发明第1-12类具体的实施方式,提供了多种同层音图数据和/或同层影视数据链式播放的方案,既可以一次只播放一个音图数据或者影视数据,也可以播放多个前后衔接的多个音图数据和影视数据所组成的影视图数据链。As mentioned above, the specific embodiments of categories 1-12 of the present invention provide a variety of solutions for the same layer of audiographic data and/or the same layer of video data playback, which can play only one audiographic data or video at a time. The data can also play a video data link composed of multiple audiovisual data and video data connected before and after.
而本发明第13、14类具体实施方式则是在前述本发明第1-12类具体实施方式的基础上,增加了下层音图数据或者下层影视数据插入播放的技术方案。如前所述:无论下层音图数据或者下层影视数据,也可以分别拥有它们各自的同层音图数据或者同层影视数据;这样一来,本发明第13、14类具体实施方式为本发明提供了上下分层,每层均可以有多个插入点,每个插入点又可以插入下层“链式”的音视图数据。这为本发明的技术方 案用于多种应用领域和情形,提供了极为丰富的方案支持。The specific embodiments of categories 13 and 14 of the present invention are based on the foregoing specific embodiments of categories 1-12 of the present invention, and a technical solution for inserting and playing lower layer audiographic data or lower layer video data is added. As mentioned before: no matter the lower layer audiographic data or the lower layer video data, they can also have their own same layer audiographic data or the same layer video data; in this way, the specific embodiments of the 13th and 14th categories of the present invention are the present invention Provides upper and lower layers, each layer can have multiple insertion points, and each insertion point can be inserted into the lower layer "chained" audiovisual data. This provides the technical solutions of the present invention with various application fields and situations, and provides extremely rich solution support.
本发明第15类具体实施方式提供了下层音图数据或者下层影视终止播放及此后所要执行的技术方案。如前所述,下层音图数据或者下层影视数据的播放是在上层音图数据播放时插入的,因此,无论播放的是单独的一个下层音图数据或者影视数据,还是由多个下层音图数据或者下层影视数据所组成的“同层音视图数据链”,只要这个“链”被依序播放,当这个“链”播放结束时;需要执行如下的操作:基于上层音图数据中止播放标记继续播放上层音图数据;或者,基于上层影视数据中止播放标记继续播放上层影视数据。The fifteenth embodiment of the present invention provides the lower layer audiographic data or the lower layer film and television to terminate playback and the technical solution to be executed thereafter. As mentioned above, the playback of the lower layer audiographic data or lower layer video data is inserted when the upper layer audiographic data is played. Therefore, whether a single lower layer audiographic data or video data is played, or multiple lower layer audiographic images are played The "same layer audio view data link" composed of data or lower layer video data, as long as the "chain" is played in sequence, when the "chain" playback ends; the following operations need to be performed: based on the upper layer audiograph data, the playback mark is aborted Continue to play the upper layer audiovisual data; or, based on the upper layer film and television data, stop playing the mark to continue playing the upper layer film and television data.
本发明第16类具体实施方式提供了下层音图数据或者下层影视终止播放及此后所要执行的技术方案。The 16th embodiment of the present invention provides the lower layer audiographic data or the lower layer film and television to terminate playback and the technical solution to be executed thereafter.
区别于本发明第15类具体实施方式的另外一种情况是:无论播放的是单独的一个下层音图数据或者下层影视数据,或者播放的是由多个下层音图数据或者下层影视数据所组成的“同层音视图数据链”,在这个“链”中任何一个下层音图数据播放时,播放设备接收到终止播放该下层音图数据或者下层影视数据的用户命令时,基于上层音图数据中止播放标记继续播放上层音图数据;或者,基于上层影视数据中止播放标记继续播放上层影视数据。Another case that is different from the specific embodiment of the 15th category of the present invention is: whether it is playing a single lower layer audiographic data or lower layer video data, or playing is composed of multiple lower layer audiographic data or lower layer video data "Same layer audio view data link", when any lower layer audio image data is played in this "chain", when the playback device receives a user command to terminate playing the lower layer audio image data or the lower layer video data, it is based on the upper layer audio image data The playback stop mark continues to play the upper layer audiovisual data; or, the playback stop mark based on the upper layer video data continues to play the upper layer video data.
本发明第15、16类具体实施方式所提供的技术方案,保证了本发明下层音图数据或者影视数据播放结束后,能够返回播放上层音图数据或者影视数据。使得本发明前述各个具体实施方式更为完善。The technical solutions provided by the 15th and 16th specific embodiments of the present invention ensure that the lower layer audiographic data or video data of the present invention can be returned to play the upper layer audiographic data or video data after the playback is completed. This makes the aforementioned specific embodiments of the present invention more complete.
在前述所有的具体实施方式中,分别涉及到:上层图片对准参数、上层音频对准参数、下层音图标识、下层影视标识、下层音图下载参数、下层音图播放参数、同层音图标识、同层影视标识、同层音图下载参数、同层音图播放参数、同层图片对准参数、同层音频对准参数等设置在音视图数据中的信息,这些信息在整个音视图数据中的存在方式,可以采用与音频、图片、视频、动画等数据相互分离的方式,例如:将这些信息单独构建一个信息包(流),然后再将这个信息包(流)和音频、图片、视频、动画等数据组合在一起。也可以将这些信息嵌入到音频、图片、视频、动画等数据之中,使这些信息分别与这些音频、图片、视频、动画等数据融 为一体。这样就可以在传输这些音频、图片、视频、动画的时候,将这些信息也随带着一并传输。编号为PCT/CN2016/087445的国际专利申请公开了一种将数据嵌入到音频数据之中的一种技术方案。在视频、动画数据中也保留有相应音频数据的空间,因此,将数据嵌入到视频、动画数据之中,实际上也就是将数据嵌入到视频或者动画数据内的音频数据之中的技术方案。除此之外,一些图片格式、视频格式以及动画格式中也保留了一些可选的字段,以允许用户来存放自己的数据;因此,前述的那些信息也可以存放于这样的字段里,使得它们作为图片数据的一部分,与图片一道被传输。此外,在一些特定的情况下,可以利用一些与前述的国际专利申请类似的技术方案,像在音频中嵌入数据那样,将这些信息嵌入到图片、视频以及动画的内容字段中,而不是写入到保留字段中。In all the specific embodiments mentioned above, they are respectively related to: upper layer image alignment parameters, upper layer audio alignment parameters, lower layer audio image identification, lower layer video identification, lower layer audio image download parameters, lower layer audio image playback parameters, same layer audio image The information set in the audio view data such as the logo, the same layer of film and television logo, the same layer of audio image download parameters, the same layer of audio image playback parameters, the same layer of picture alignment parameters, the same layer of audio alignment parameters, etc. The way of existence in the data can be separated from the audio, pictures, videos, animations and other data, for example: the information is separately constructed into a packet (stream), and then the packet (stream) and audio, pictures , Video, animation and other data are combined together. You can also embed this information into data such as audio, pictures, videos, animations, etc., so that the information is integrated with these data such as audio, pictures, videos, animations, etc. In this way, when these audios, pictures, videos, and animations are transmitted, the information can be transmitted along with them. The international patent application numbered PCT/CN2016/087445 discloses a technical solution for embedding data into audio data. There is also space for corresponding audio data in video and animation data. Therefore, embedding data in video and animation data is actually a technical solution for embedding data in audio data in video or animation data. In addition, some image formats, video formats, and animation formats also retain some optional fields to allow users to store their own data; therefore, the aforementioned information can also be stored in such fields so that they As part of the picture data, it is transmitted along with the picture. In addition, in some specific cases, you can use some technical solutions similar to the aforementioned international patent applications. Like embedding data in audio, you can embed this information in the content fields of pictures, videos, and animations instead of writing To the reserved field.
有鉴于此,在前述所有具体的实施方式的基础上,本发明第17类具体的实施方式还包括这样的技术内容:在解析上层图片和/或上层图片音频的时候,从上层图片和/或上层图片音频中解析或者提取嵌入在其中的上层图片对准参数和/或上层音频对准参数。另外,对于嵌入在上层图片、上层图片音频和/或上层视频数据或者上层动画数据的音频数据和/或私有数据中的下层音图标识、下层影视标识,则可以从相应的上层图片、上层图片音频和/或上层视频数据或者上层动画数据的音频数据和/或私有数据中提取或者解析出来。其中的私有数据与前文所提到的图片格式、视频格式以及动画格式中保留的那些可选的、保留给用户使用的字段。In view of this, on the basis of all the foregoing specific embodiments, the 17th specific embodiment of the present invention also includes such technical content: when parsing the upper layer picture and/or upper layer picture audio, from the upper layer picture and/or Parse or extract the upper layer picture alignment parameters and/or upper layer audio alignment parameters embedded in the upper layer picture audio. In addition, for the lower layer audio image logo and the lower layer video logo embedded in the upper layer image, upper layer image audio and/or upper layer video data or upper layer animation data, audio data and/or private data, you can select the corresponding upper layer image, upper layer image Audio and/or upper layer video data or upper layer animation data are extracted or parsed from audio data and/or private data. The private data and the optional fields reserved in the picture format, video format and animation format mentioned above are reserved for users.
在前述所有具体的实施方式的基础上,本发明第18类具体的实施方式还包括这样的技术内容:在解析上层图片和/或上层图片音频的时候,从上层图片和/或上层图片音频中解析或者提取嵌入在其中的同层音图标识和/或同层影视标识。另外,对于嵌入在同层图片、同层图片音频之中的同层图片对准参数和/或同层音频对准参数,则可以从相应的同层图片、同层图片音频中提取或者解析出来。On the basis of all the foregoing specific embodiments, the 18th specific embodiment of the present invention also includes such technical content: when parsing the upper layer picture and/or upper layer picture audio, from the upper layer picture and/or upper layer picture audio Parse or extract the same layer audiograph logo and/or the same layer video logo embedded in it. In addition, the alignment parameters and/or audio alignment parameters of the same layer image embedded in the same layer image and the same layer image audio can be extracted or parsed from the corresponding same layer image and the same layer image audio .
在前述所有具体的实施方式的基础上,本发明第19类具体的实施方式还包括这样的技术内容:在解析上层图片和/或上层图片音频的时候,从上层图片和/或上层图片音频中解析嵌入在其中的下层音图下载参数和下层音图播放参数。另外,对于嵌入在下层图片、下层图片音频之中的下层 图片对准参数和/或下层音频对准参数,则可以从相应的下层图片、下层图片音频中提取或者解析出来。On the basis of all the foregoing specific embodiments, the 19th specific embodiment of the present invention also includes such technical content: when parsing the upper layer picture and/or upper layer picture audio, from the upper layer picture and/or upper layer picture audio Analyze the download parameters and playback parameters of the lower layer audiograph embedded in it. In addition, the lower layer picture alignment parameters and/or lower layer audio alignment parameters embedded in the lower layer picture and the lower layer picture audio can be extracted or parsed from the corresponding lower layer picture and lower layer picture audio.
本发明前述第17、18、19等三类具体的实施方式主要是用来支持用来指示播放设备工作的参数、信息的解析和提取;而这些参数和信息可以采用多种方式嵌入到上层、同层和下层的图片、图片音频、视频或者动画数据之中。这使得这些参数和信息可以通过适当的方式由相应的图片、图片音频、视频或者动画数据在传输时被一体携带,而无需另行传送。既保证了传输的便利性又实现了播放控制的及时有效。The aforementioned three specific implementations of the 17th, 18th, 19th, etc. of the present invention are mainly used to support the analysis and extraction of parameters and information used to indicate the operation of the playback device; and these parameters and information can be embedded into the upper layer in various ways. In the picture, picture audio, video or animation data of the same layer and the lower layer. This allows these parameters and information to be carried by the corresponding pictures, pictures, audio, video, or animation data in an appropriate way without any additional transmission. It not only ensures the convenience of transmission but also realizes the timely and effective playback control.

Claims (10)

  1. 一种音视图数据播放的方法,包括:A method for playing audio view data, including:
    下载音视图数据并对其进行解析,以获得所述上层音图数据中的上层图片音频、上层图片和/或上层图片对准参数和/或上层音频对准参数;Download the audio view data and parse it to obtain upper layer picture audio, upper layer picture and/or upper layer picture alignment parameters and/or upper layer audio alignment parameters in the upper layer audiogram data;
    自动地,或者在接收到播放所述上层图片音频或者上层图片的命令时,播放所述上层图片音频或者上层图片,并在所述上层图片对准参数或者上层音频对准参数指示的播放时间到达,或者在接收到播放所述上层图片音频或者上层图片的命令时,播放对应的所述上层图片或者上层图片音频;Automatically, or upon receiving a command to play the audio or picture of the upper layer picture, play the audio or picture of the upper layer picture, and the play time indicated by the alignment parameter of the upper picture or the audio alignment parameter of the upper layer arrives Or, when receiving the command to play the audio of the upper layer picture or the upper layer picture, play the corresponding audio of the upper layer picture or the upper layer picture;
    或者,or,
    下载音视图数据并对其进行解析,以获得所述上层影视数据中的上层视频数据或者上层动画数据、下层音图标识和/或下层影视标识;Download the audiovisual data and parse it to obtain the upper layer video data or the upper layer animation data, the lower layer sound image logo and/or the lower layer film logo in the upper layer video data;
    自动地,或者在接收到播放所述上层视频数据或者上层动画数据的命令时,播放所述上层视频数据或者上层动画数据。Automatically, or upon receiving a command to play the upper layer video data or upper layer animation data, play the upper layer video data or upper layer animation data.
  2. 根据权利要求1所述的方法,其特征在于还包括:The method of claim 1, further comprising:
    对所述音视图数据解析,以获得同层音图标识和/或同层影视标识,所述同层音图标识至少包含同层音图下载参数和同层音图播放参数;所述同层影视数据标识至少包含同层影视下载参数和同层影视播放参数;Parsing the audiovisual data to obtain audiogram identification and/or film and television identification of the same layer, the audiogram identification of the same layer at least includes download parameters of audiograms of the same layer and playback parameters of audiograms of the same layer; the same layer The movie and TV data identification contains at least the same layer of movie download parameters and the same layer of movie and TV playback parameters;
    基于所述同层音图标识下载与所述同层音图下载参数对应的同层音图数据;其中:所述同层音图数据至少由同层图片音频、同层图片和/或同层图片对准参数和/或同层音频对准参数;Download the same-layer audiogram data corresponding to the same-layer audiogram download parameters based on the same-layer audiogram identifier; wherein: the same-layer audiogram data consists of at least the same-layer audio picture, the same-layer picture, and/or the same-layer audiogram data Picture alignment parameters and/or audio alignment parameters of the same layer;
    和/或,and / or,
    基于所述同层影视标识下载与所述同层影视下载参数对应的同层影视数据;所述同层影视数据至少由同层视频数据或者同层动画数据构成。Download the same layer of film and television data corresponding to the same layer of film and television download parameters based on the same layer of film and television identification; the same layer of film and television data consists of at least the same layer of video data or the same layer of animation data.
  3. 根据权利要求2所述的方法,其特征在于还包括:The method of claim 2, further comprising:
    在所述同层音图播放参数所指示的播放时间到达时,或者,当接收到启动播放所述同层音图数据的用户命令时,终止播放所述上层图片音频或者上层图片;或者,终止播放所述上层视频数据或者上层动画数据;When the playback time indicated by the playback parameters of the audiograms of the same layer arrives, or when a user command to start playing the audiogram data of the same layer is received, the playback of the audio of the upper layer picture or the upper layer picture is terminated; or, the termination Play the upper layer video data or upper layer animation data;
    播放所述同层图片音频或者同层图片,并在所述同层图片对准参数或者同层音频对准参数所指示的播放时间到达时,播放对应的所述同层图片或者同层图片音频;Play the same layer picture audio or the same layer picture, and when the play time indicated by the same layer picture alignment parameter or the same layer audio alignment parameter arrives, play the corresponding same layer picture or same layer picture audio ;
    或者,or,
    在所述同层影视播放参数所指示的播放时间到达时,或者,当接收到启动播放所述同层影视数据的用户命令时,终止播放所述上层图片音频或者上层图片;或者,终止播放所述上层视频数据或者上层动画数据;When the playback time indicated by the video playback parameters of the same layer arrives, or when a user command to start playing the video data of the same layer is received, the playback of the audio or picture of the upper layer picture is terminated; or, the playback place is terminated Describe the upper layer video data or upper layer animation data;
    播放所述同层视频数据或者同层动画数据。Play the same layer of video data or same layer of animation data.
  4. 根据权利要求3所述的方法,其特征在于还包括:The method according to claim 3, further comprising:
    当接收到终止播放当前所述同层音图数据的用户命令,或者当前所述同层音图数据播放结束时;根据所述同层音图标识和/或同层影视标识,顺序播放其他同层音图数据或者同层影视数据;When a user command to stop playing the current layer audiogram data is received, or the current layer audiogram data is finished playing; according to the same layer audiogram identifier and/or the same layer video identifier, other sequencers are played in sequence Layer audiogram data or film and television data of the same layer;
    或者,or,
    当接收到终止播放当前所述同层影视数据的用户命令,或者当前所述同层影视数据播放结束时;根据所述同层音图标识和/或同层影视标识,顺序播放其他同层音图数据或者同层影视数据。When a user command to stop playing the current film and television data of the same layer is received, or the current playback of the film and television data of the same layer ends; according to the audiogram identification of the same layer and/or the film and television identification of the same layer, other audios of the same layer are sequentially played Picture data or video data on the same layer.
  5. 根据权利要求1所述的方法,其特征在于还包括:The method of claim 1, further comprising:
    对所述音视图数据解析,以获得下层音图标识和/或下层影视标识,所述下层音图标识至少包含下层音图下载参数和下层音图播放参数;所述下层影视标识至少包含下层影视下载参数和下层影视播放参数;Analyze the audiovisual data to obtain a lower-level audiographic logo and/or a lower-level audiovisual logo. The lower-level audiographic logo includes at least a lower-level audiographic image download parameter and a lower-level audiographic image playback parameter; the lower-level audiovisual image includes at least a lower-level videographic image Download parameters and lower-layer video playback parameters;
    基于所述下层音图标识下载与所述下层音图下载参数对应的下层音图数据;其中:所述下层音图数据至少由下层图片音频、下层图片和/或下层音图对准参数组合而成;其中:所述下层音图对准参数至少包括:下层图片对准参数和/或下层音频对准参数;Download the lower layer audiogram data corresponding to the lower layer audiogram download parameters based on the lower layer audiogram identifier; wherein: the lower layer audiogram data is at least composed of a combination of lower layer image audio, lower layer image, and/or lower layer audiogram alignment parameters Wherein: the lower layer audiogram alignment parameters at least include: lower layer image alignment parameters and/or lower layer audio alignment parameters;
    和/或,and / or,
    基于所述下层影视标识下载与所述下层影视下载参数对应的下层影视数据;所述下层影视数据至少由下层视频数据或者下层动画数据构成。Download the lower-layer movie data corresponding to the lower-layer movie download parameters based on the lower-layer movie logo; the lower-layer movie data is composed of at least lower-layer video data or lower-layer animation data.
  6. 根据权利要求5所述的方法,其特征在于还包括:The method of claim 5, further comprising:
    在所述下层音图播放参数所指示的播放时间到达时,或者,当接收到启动播放所述下层音图数据的用户命令时,中止播放所述上层图片音频或者上层图片,并生成对应的上层音图数据中止播放标记;或者,中止播放所述上层视频数据或者上层动画数据,并生成对应的上层影视数据中止播放标记;When the playback time indicated by the playback parameters of the lower-layer audiogram reaches, or when a user command to start playing the lower-layer audiogram data is received, the playback of the audio or the upper-layer picture audio is suspended, and the corresponding upper-layer is generated The audiovisual data suspends the playback mark; or, suspends the playback of the upper layer video data or the upper layer animation data, and generates a corresponding upper layer video data suspension playback mark;
    播放所述下层图片音频或者下层图片,并在所述下层图片对准参数或者下层音频对准参数所指示的播放时间到达时,播放对应的所述下层图片或者下层图片音频;Playing the lower layer picture audio or the lower layer picture, and when the play time indicated by the lower layer picture alignment parameter or the lower layer audio alignment parameter arrives, playing the corresponding lower layer picture or the lower layer picture audio;
    或者,or,
    在所述下层影视播放参数所指示的播放时间到达时,或者,当接收到启动播放所述下层影视数据的用户命令时,中止播放所述上层图片音频或者上层图片,并生成对应的上层音图数据中止播放标记;或者,中止播放所述上层视频数据或者上层动画数据,并生成对应的上层影视数据中止播放标记;When the playback time indicated by the lower-layer video playback parameters arrives, or when a user command to start playing the lower-layer video data is received, the playback of the upper-layer picture audio or the upper-layer picture is suspended, and a corresponding upper-layer audio picture is generated The data stop playing mark; or, stop playing the upper layer video data or the upper layer animation data, and generate a corresponding upper layer video data stop playing mark;
    播放所述下层视频数据或者下层动画数据。Play the lower layer video data or the lower layer animation data.
  7. 根据权利要求6所述的方法,其特征在于还包括:The method of claim 6, further comprising:
    当接收到终止播放所述下层音图数据的用户命令,或者所述下层音图数据播放结束时;基于所述上层音图数据中止播放标记继续播放所述上层音图数据;或者,基于所述上层影视数据中止播放标记继续播放所述上层影视数据;When a user command to stop playing the lower layer audiogram data is received, or the playback of the lower layer audiogram data ends; the playback pause flag is continued based on the upper layer audiogram data to continue playing the upper layer audiogram data; or, based on the The upper layer movie data suspends the playback mark to continue playing the upper layer movie data;
    或者,or,
    当接收到终止播放所述下层影视数据的用户命令,或者所述下层影视数据播放结束时;基于所述上层音图数据中止播放标记继续播放所述上层音图数据;或者,基于所述上层影视数据中止播放标记继续播放所述上层影视数据。When a user command to stop playing the lower layer video data is received, or the playback of the lower layer video data ends; based on the upper layer audio image data, the playback stop mark continues to play the upper layer audio image data; or, based on the upper layer video The data suspension play flag continues to play the upper layer video data.
  8. 根据权利要求1-7所述的任一方法,其特征在于还包括:The method according to any one of claims 1-7, further comprising:
    从所述上层图片和/或上层图片音频中解析嵌入在其中的所述上层图片对准参数和/或所述上层音频对准参数;和/或,Parsing the upper layer picture alignment parameters and/or the upper layer audio alignment parameters embedded therein from the upper layer picture and/or upper layer picture audio; and/or,
    从所述上层图片、所述上层图片音频和/或所述上层视频数据或者所述上层动画数据的音频数据和/或私有数据中解析嵌入在其中的所述下层音图标识和/或所述下层影视标识。Parsing the lower layer audiograph logo embedded in the upper layer picture, the upper layer picture audio and/or the upper layer video data or the upper layer animation data audio data and/or private data, and/or the The lower film and television logo.
  9. 根据权利要求2-4所述的任一方法,其特征在于还包括:The method according to any one of claims 2-4, further comprising:
    从所述上层图片和/或所述上层图片音频中解析嵌入在其中的所述同层音图标识和/或所述同层影视标识;和/或,Parsing the same-layer audiographic logo and/or the same-layer video logo embedded in the upper-layer picture and/or the upper-layer picture audio; and/or,
    从所述同层图片和/或所述同层图片音频中解析嵌入在其中的所述同 层图片对准参数和/或所述同层音频对准参数。Parsing the same-layer picture alignment parameters and/or the same-layer audio alignment parameters embedded therein from the same-layer picture and/or the same-layer picture audio.
  10. 根据权利要求5-7所述的任一方法,其特征在于还包括:The method according to any one of claims 5-7, further comprising:
    从所述上层图片和/或上层图片音频中解析嵌入在其中的所述下层音图下载参数和所述下层音图播放参数;和/或,Parsing the download parameters and playback parameters of the lower layer audiogram embedded in the upper layer image and/or the upper layer image audio; and/or,
    从所述下层图片和/或所述下层图片音频中解析嵌入在其中的所述下层图片对准参数和/或所述下层音频对准参数。Parsing the lower layer picture alignment parameter and/or the lower layer audio alignment parameter embedded therein from the lower layer picture and/or the lower layer picture audio.
PCT/CN2019/106073 2019-01-03 2019-09-17 Method for playing audio, video, and picture data WO2020140478A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910004506.2A CN111402935B (en) 2019-01-03 2019-01-03 Method for playing audio and video data
CN201910004506.2 2019-01-03

Publications (1)

Publication Number Publication Date
WO2020140478A1 true WO2020140478A1 (en) 2020-07-09

Family

ID=71407123

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/106073 WO2020140478A1 (en) 2019-01-03 2019-09-17 Method for playing audio, video, and picture data

Country Status (2)

Country Link
CN (1) CN111402935B (en)
WO (1) WO2020140478A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060109273A1 (en) * 2004-11-19 2006-05-25 Rams Joaquin S Real-time multi-media information and communications system
CN106971635A (en) * 2017-03-20 2017-07-21 厦门云开云科技有限公司 A kind of teaching, training method and system
CN107295284A (en) * 2017-08-03 2017-10-24 浙江大学 A kind of generation of video file being made up of audio and picture and index playing method, device
CN107888558A (en) * 2017-10-09 2018-04-06 广东教教圈圈动漫科技有限公司 One kind paints this dubbing method, device and system
CN108282677A (en) * 2018-01-24 2018-07-13 上海哇嗨网络科技有限公司 Realize that content throws method, throwing screen device and the system of screen by client
CN108881992A (en) * 2018-07-09 2018-11-23 深圳市潮流网络技术有限公司 A kind of multimedia audio-video data synchronization calculation method

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002101380A (en) * 2000-09-22 2002-04-05 Fujitsu Ltd Device for reproducing data stream
CA2709623A1 (en) * 2007-12-17 2009-06-25 Samuel Palahnuk Communications network system
CN101673267B (en) * 2008-09-12 2012-11-07 未序网络科技(上海)有限公司 Method for searching audio and video content
TW201127051A (en) * 2010-01-26 2011-08-01 Hon Hai Prec Ind Co Ltd Television receiver and method for playing television program thereof
CN102222227B (en) * 2011-04-25 2013-07-31 中国华录集团有限公司 Video identification based system for extracting film images
CN102316361B (en) * 2011-07-04 2014-05-21 深圳市车音网科技有限公司 Audio-frequency / video-frequency on demand method based on natural speech recognition and system thereof
CN104125491A (en) * 2014-07-07 2014-10-29 乐视网信息技术(北京)股份有限公司 Audio comment information generating method and device and audio comment playing method and device
CN105992042B (en) * 2015-03-05 2019-07-16 北京图音数码科技有限公司 Media player and media playing method
CN105005578A (en) * 2015-05-21 2015-10-28 中国电子科技集团公司第十研究所 Multimedia target information visual analysis system
US10489453B2 (en) * 2016-02-26 2019-11-26 Amazon Technologies, Inc. Searching shared video footage from audio/video recording and communication devices
US10074012B2 (en) * 2016-06-17 2018-09-11 Dolby Laboratories Licensing Corporation Sound and video object tracking
CN106126617B (en) * 2016-06-22 2018-11-23 腾讯科技(深圳)有限公司 A kind of video detecting method and server
CN108847258B (en) * 2018-06-10 2021-06-04 北京酷我科技有限公司 Method for realizing interception of audio control

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060109273A1 (en) * 2004-11-19 2006-05-25 Rams Joaquin S Real-time multi-media information and communications system
CN106971635A (en) * 2017-03-20 2017-07-21 厦门云开云科技有限公司 A kind of teaching, training method and system
CN107295284A (en) * 2017-08-03 2017-10-24 浙江大学 A kind of generation of video file being made up of audio and picture and index playing method, device
CN107888558A (en) * 2017-10-09 2018-04-06 广东教教圈圈动漫科技有限公司 One kind paints this dubbing method, device and system
CN108282677A (en) * 2018-01-24 2018-07-13 上海哇嗨网络科技有限公司 Realize that content throws method, throwing screen device and the system of screen by client
CN108881992A (en) * 2018-07-09 2018-11-23 深圳市潮流网络技术有限公司 A kind of multimedia audio-video data synchronization calculation method

Also Published As

Publication number Publication date
CN111402935B (en) 2022-09-13
CN111402935A (en) 2020-07-10

Similar Documents

Publication Publication Date Title
CN105844987B (en) Multimedia teaching interactive operation method and device
CN112468822B (en) Multimedia recording and broadcasting course interaction method based on video SEI message
CN110570698A (en) Online teaching control method and device, storage medium and terminal
CN104796455A (en) Cross-platform multi-screen interacting method, device and system
CN104539436A (en) Lesson content real-time live broadcasting method and system
TW200425710A (en) Method for distributing contents
CN107155080A (en) A kind of curriculum video preparation method for imitating scene of giving lessons on the spot
US20130189664A1 (en) Method and apparatus for providing media stream switching based interactive lecture service, and receiving method and apparatus
KR101198091B1 (en) Method and system for learning contents
CN114363648A (en) Method, equipment and storage medium for audio and video alignment in mixed flow process of live broadcast system
KR20010056342A (en) Effective user interfaces and data structure of a multi-media lecture, and a system structure for transferring and management of the multi-media lecture for distance education in computer networks
WO2020140478A1 (en) Method for playing audio, video, and picture data
CN109523844B (en) Virtual live broadcast simulation teaching system and method
KR20010067612A (en) Method and system for virtual reality based on internet tele-lecturing
Crowther et al. Delivering video-streamed library orientation on the web: technology for the educational setting
CN108364518A (en) A kind of classroom interactions' process record method based on panorama teaching pattern
CN111726692B (en) Interactive playing method of audio-video data
CN104506565A (en) Remote education system and equipment based on fourth generation Internet
Bhosale et al. A Review on Video Streaming in Education
JP2001298431A (en) Information-providing system, information-providing method and terminal
Lugmayr et al. E= MC2+ 1: a fully digital, collaborative, high-definition (HD) production from scene to screen
JP2013150096A (en) Information processor, information processing method, and program
US20140176667A1 (en) Code stream processing method and system, multipoint control unit
CN1333595C (en) Apparatus and method for real-time communication between people with the same television program hobby
Fenting Research on the Production Skills of Dependent Micro Video

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19907159

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19907159

Country of ref document: EP

Kind code of ref document: A1