CN110929056A - Multimedia file generation method and playback method, generation device and playback device - Google Patents

Multimedia file generation method and playback method, generation device and playback device Download PDF

Info

Publication number
CN110929056A
CN110929056A CN201811091556.0A CN201811091556A CN110929056A CN 110929056 A CN110929056 A CN 110929056A CN 201811091556 A CN201811091556 A CN 201811091556A CN 110929056 A CN110929056 A CN 110929056A
Authority
CN
China
Prior art keywords
multimedia file
data track
positions
file
image object
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811091556.0A
Other languages
Chinese (zh)
Other versions
CN110929056B (en
Inventor
袁嘉尚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Acer Inc
Original Assignee
Acer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Acer Inc filed Critical Acer Inc
Priority to CN201811091556.0A priority Critical patent/CN110929056B/en
Publication of CN110929056A publication Critical patent/CN110929056A/en
Application granted granted Critical
Publication of CN110929056B publication Critical patent/CN110929056B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Television Signal Processing For Recording (AREA)

Abstract

本发明提供一种多媒体文件的产生方法与播放方法、多媒体文件产生装置与多媒体文件播放装置。多媒体文件的播放方法包括下列步骤:接收包括关联于时间轴的全景影片的多媒体文件;提取多媒体文件的第一数据轨,以取得全景影片中第一图像物件相对于时间轴的多个第一物件位置;当播放全景影片,显示对应于第一图像物件的图示于屏幕的画面;响应于检测到施于图示的选择操作,依据第一数据轨所记录的第一物件位置决定用以播放全景影片的播放视角,并基于播放视角播放包括第一图像物件的画面。

Figure 201811091556

The present invention provides a method for generating and playing a multimedia file, a multimedia file generating device, and a multimedia file playing device. The multimedia file playing method includes the following steps: receiving a multimedia file including a panoramic video associated with a timeline; extracting a first data track of the multimedia file to obtain a plurality of first object positions of a first image object in the panoramic video relative to the timeline; when playing the panoramic video, displaying a picture corresponding to an icon of the first image object on the screen; in response to detecting a selection operation applied to the icon, determining a viewing angle for playing the panoramic video according to the first object position recorded in the first data track, and playing a picture including the first image object based on the viewing angle.

Figure 201811091556

Description

Multimedia file generating method, multimedia file playing method, multimedia file generating device and multimedia file playing device
Technical Field
The present invention relates to a video playing technology, and more particularly, to a method and an apparatus for generating a multimedia file, and a method and an apparatus for playing a multimedia file.
Background
Nowadays, 360 degree movies and panoramic cameras are becoming more popular, and users can watch 360 degree movies (also called panoramic movies) through a computer screen. Images of different angles shot by the plurality of lenses are sewn through post-processing images, so that a 360-degree film without a view dead angle can be generated, and the effect that a user can see his own situation can be provided.
When a user views a 360-degree movie on a computer screen, the user can only see one corner of the entire 360-degree scene. The user needs to adjust the viewing angle of the 360-degree movie to view different corners of the 360-degree scene. Therefore, when the user watches a 360-degree movie, the user needs to manually adjust the playing viewing angle to search for the object of interest, which greatly reduces the convenience of the user in watching the 360-degree movie. Furthermore, due to the performance of the general consumer electronics, it is difficult for the general consumer electronics to perform real-time image object recognition and tracking for 360 degree movies. Therefore, when a user wants to lock an interested object for watching, the user needs to manually control the playing angle of view at any time and any place along with the movement of the interested object.
Disclosure of Invention
In view of the foregoing, the present invention provides a multimedia file generating method and a multimedia file generating apparatus, which can create a specific track of a multimedia file based on location information of an image object to generate the multimedia file including a panoramic film and having location information recorded therein.
The invention also provides a multimedia file playing method and a multimedia file playing device, which can acquire the position information of the image object according to the specific data track in the multimedia file so as to dynamically adjust the playing angle of view according to the position information of the image object which is interested by a user.
The embodiment of the invention provides a multimedia file generating method, which is suitable for a multimedia file generating device and comprises the following steps: obtaining a panorama associated with a time axis, wherein the panorama comprises at least one image object; obtaining a plurality of object positions of the image object relative to a time axis; making the object positions into an object position file; at least one data track of the multimedia file is generated according to the object position file so as to generate the multimedia file which comprises the panoramic film and is recorded with the object position.
The embodiment of the invention provides a multimedia file generating device, which comprises a storage device and a processor. The storage device stores a plurality of modules. The processor is coupled to the storage device and loads and executes the modules in the storage device. The module comprises a film obtaining module, a position obtaining module, a file making module and a file embedding module. The film obtaining module obtains a panorama film associated with a time axis, wherein the panorama film comprises at least one image object. The position obtaining module obtains a plurality of object positions of the image object relative to the time axis. The document making module makes the object positions into an object position document. The file embedding module generates at least one data track of the multimedia file according to the object position file so as to generate the multimedia file which comprises the panoramic film and is recorded with the object position.
Correspondingly, an embodiment of the present invention provides a method for playing a multimedia file, which is suitable for a multimedia file playing apparatus, and the method includes the following steps: receiving a multimedia file including a panorama film associated with a timeline; extracting a first data track of a multimedia file to obtain a plurality of first object positions of a first image object in the panoramic film relative to a time axis; when the panoramic film is played, displaying a picture corresponding to the first image object and shown on a screen; in response to detecting the selection operation applied to the icon, a playing angle for playing the panoramic film is determined according to the position of the first object recorded in the first data track, and a frame including the first image object is played based on the playing angle.
Correspondingly, an embodiment of the present invention provides a multimedia file playing device, which includes a screen, a storage device storing a plurality of modules, and a processor. The processor is coupled with the storage device and the screen, and loads and executes the module in the storage device. The module comprises a film receiving module, a data track extracting module, an interface providing module and a film playing module. The movie reception module receives a multimedia file including a panorama movie associated with a time axis. The data track extraction module extracts a first data track of the multimedia file to obtain a plurality of first object positions of a first image object in the panoramic film relative to a time axis. When the panoramic film is played, the interface providing module displays a picture corresponding to the first image object and displayed on the screen. The film playing module determines a playing angle for playing the panoramic film according to the position of the first object recorded by the first data track in response to detecting the selection operation applied to the graphic, and plays the picture including the first image object based on the playing angle.
Based on the above, the multimedia file generating apparatus can establish the specific data track of the multimedia file according to the object position of the image object appearing in the panorama. Based on this, when playing the panoramic film, the multimedia file playing device can acquire the object position of the image object in the panoramic film from the specific data track of the multimedia file, and further dynamically determine the playing view angle according to the object position of the image object. Therefore, a user can lock a specific image object in the panoramic film for watching without manually adjusting the playing angle of the panoramic film at any time.
In order to make the aforementioned and other features and advantages of the invention more comprehensible, embodiments accompanied with figures are described in detail below.
Drawings
FIG. 1 is a block diagram of a multimedia file generating apparatus according to an embodiment of the present invention.
Fig. 2 is a flowchart illustrating a method of generating a multimedia file according to an embodiment of the present invention.
Fig. 3A and 3B are schematic diagrams illustrating object positions corresponding to time intervals according to an embodiment of the invention.
FIG. 4 is an exemplary object location file according to an embodiment of the present invention
FIG. 5 is a diagram illustrating a multimedia file architecture according to an embodiment of the present invention.
FIG. 6 is a block diagram of a multimedia file playback apparatus according to an embodiment of the present invention.
Fig. 7 is a flowchart illustrating a method for playing a multimedia file according to an embodiment of the present invention.
Fig. 8A and 8B are schematic diagrams illustrating an exemplary playing of a multimedia file according to an embodiment of the invention.
[ notation ] to show
10: multimedia file generating device
60: multimedia file playing device
110. 610: processor with a memory having a plurality of memory cells
120. 620: storage device
630: screen
121: film acquisition module
122: position acquisition module
123: file making module
124: file embedding module
621: film receiving module
622: data track extraction module
623: interface providing module
624: film playing module
P1-P3: time interval
40: article position file
50: multimedia file
51: header head
52: multimedia data
521: video data track
522: audio data track
523: caption data track
524: object position data rail
80. 86: picture frame
82: virtual control button
I1-I3: illustration of the drawings
83. 84: image object
S201 to S204, S701 to S704: step (ii) of
Detailed Description
Some embodiments of the invention will now be described in detail with reference to the drawings, wherein like reference numerals are used to refer to like or similar elements throughout the several views. These examples are only a part of the present invention and do not disclose all possible embodiments of the present invention. Rather, these embodiments are merely exemplary of the method and multimedia file generating apparatus of the present invention as set forth in the claims.
FIG. 1 is a block diagram of a multimedia file generating apparatus according to an embodiment of the present invention, which is for convenience of illustration only and is not intended to limit the present invention. First, fig. 1 first describes all the components and configuration relationships of the multimedia file generating apparatus, and the detailed functions will be disclosed together with fig. 2.
Referring to fig. 1, the multimedia file generating apparatus 10 may be any electronic apparatus with computing capability, such as a desktop computer, a notebook computer, a server, etc., but the present invention is not limited thereto. The multimedia file generating apparatus 10 includes a processor 110 and a storage device 120, and the functions thereof are as follows:
the storage device 120 is, for example, any type of fixed or removable Random Access Memory (RAM), read-only memory (ROM), flash memory (flash memory), or the like or a combination thereof. In the present embodiment, the storage device 120 is used for recording a movie acquisition module 121, a location acquisition module 122, a file creation module 123, and a file embedding module 124.
The Processor 110 is, for example, a Central Processing Unit (CPU), or other Programmable general purpose or special purpose Microprocessor (Microprocessor), Digital Signal Processor (DSP), Programmable controller, Application Specific Integrated Circuit (ASIC), Programmable Logic Device (PLD), or other similar devices or combinations thereof, and is connected to the storage Device 120.
In the present embodiment, the module stored in the storage device 120 is, for example, a computer program, and can be loaded by the processor 110 to execute the method for generating a multimedia file according to the present embodiment.
Fig. 2 is a flowchart illustrating a multimedia file generating method according to an embodiment of the present invention, and the method flow of fig. 2 can be implemented by the elements of the multimedia file generating apparatus 10 of fig. 1. Referring to fig. 1 and fig. 2, the following describes detailed steps of the method for generating a multimedia file according to the present embodiment in combination with various elements and devices of the multimedia file generating device 10 in fig. 1.
In step S201, the film obtaining module 121 obtains a panorama film associated with a time axis, wherein the panorama film includes at least one image object. Here, the movie retrieving module 121 may retrieve the panoramic movie from an image acquiring module (not shown) of the multimedia file generating apparatus 10 itself or from other electronic devices. A panoramic film, which may also be referred to as a 360-degree film, is composed of video frames corresponding to different time stamps (timestamps) on a time axis, and the video frames are 360-degree images stored in a specific format. The above-mentioned specific format is, for example, an Equiangular format or the like. It should be noted that, in the embodiment of the present invention, the panorama film includes at least one image object generated by shooting at least one object, that is, the image object is presented in a video frame of the panorama film. The image object in the panoramic film is, for example, a human face, but the present invention is not limited to this, and may be other kinds of image objects.
In step S202, the position obtaining module 122 obtains a plurality of object positions of the image object relative to the time axis. In one embodiment, the object positions of the image object can be visually observed by a film editor in advance and manually edited to generate the image object. In other words, the position obtaining module 122 can obtain a plurality of object positions of the image object in a three-dimensional coordinate system by allowing a film editor to watch the panoramic film with naked eyes and labeling the object positions of the image object. Alternatively, in one embodiment, the object positions of the graphical object relative to the timeline may be automatically generated by an object detection and recognition algorithm of the image processing technique. In other words, by using the object detection and identification algorithm to track a specific image object in the panoramic film, the position obtaining module 122 can obtain a plurality of object positions of the image object relative to different time intervals in a three-dimensional coordinate system. The object position of the image object may be represented by the spherical coordinates of a spherical coordinate system, for example.
In an embodiment, the object positions of the image object respectively correspond to a plurality of time intervals on a time axis. That is, the object positions of the image objects can be sampled according to a fixed or unfixed time interval. Referring to fig. 3A, fig. 3A is a schematic diagram illustrating positions of a plurality of objects corresponding to a plurality of time intervals according to an embodiment of the invention. For an image object, the position obtaining module 122 can obtain the object position (r1, θ 1, ψ 1) corresponding to the time interval P1, the object position (r2, θ 2, ψ 2) corresponding to the time interval P2, and the object position (r3, θ 3, ψ 3) corresponding to the time interval P3. It should be noted that the time lengths of the time intervals P1-P3 may be the same or different, and the invention is not limited thereto.
In addition, in an embodiment, the number of image objects in the panorama film may be more than two. Thus, the at least one image object in the panoramic film may include a first image object and a second image object. Correspondingly, the object positions relative to the time axis will include a plurality of first object positions of the first graphical object and a plurality of second object positions of the second graphical object. Referring to fig. 3B, fig. 3B is a schematic diagram illustrating positions of a plurality of objects corresponding to a plurality of time intervals according to an embodiment of the invention. For the first image object, the position obtaining module 122 may obtain the object position (r4, θ 4, ψ 4) corresponding to the time interval P1 and the object position (r5, θ 5, ψ 5) corresponding to the time interval P2. For the second image object, the position obtaining module 122 may obtain the object position (r6, θ 6, ψ 6) corresponding to the time interval P1 and the object position (r7, θ 7, ψ 7) corresponding to the time interval P2.
Then, returning to the flow of fig. 2, in step S203, the document making module 123 makes the object positions into an object position document. Specifically, the file creating module 123 may compile the object positions corresponding to the time intervals on the time axis into an object position file in a preset file format. In one embodiment, the object location file may be generated in a manner similar to the generation of the movie subtitle file. Referring to fig. 4, fig. 4 is a diagram illustrating an example of an object location file according to an embodiment of the invention. The object location file 40 records object locations of two image objects in the panorama, named object name a and object name B, respectively, and the object locations are recorded at regular time intervals. The example shown in fig. 4 is a time interval of 1 second, but the invention is not limited thereto. For example, at time 00:01.000, the object position of the image object named "object name a" is (r6, θ 6, ψ 6), and the object position of the image object named "object name B" is (r7, θ 7, ψ 7). At time 00:02.000, the object position of the image object named "object name a" is (r8, θ 8, ψ 8), and the object position of the image object named "object name B" is (r9, θ 9, ψ 9).
In addition, in an embodiment, the file creating module 123 may map the object positions recorded as the three-dimensional position coordinates into two-dimensional position coordinates, and record the two-dimensional position coordinates in the object position file. In general, each video frame in a panoramic film is stored by mapping a panoramic image into a two-dimensional image, such as in the Equiangular format. The object positions recorded as a plurality of three-dimensional position coordinates (e.g., spherical coordinates) can also be mapped to two-dimensional position coordinates in a two-dimensional coordinate system and stored, so as to reduce the data size of the object position file.
Then, in step S204, the file embedding module 124 generates at least one data track of the multimedia file according to the object location file to generate a multimedia file including the panoramic film and having the object location recorded therein. Specifically, fig. 5 is a schematic diagram of a multimedia file architecture according to an embodiment of the present invention. The multimedia file 50 includes a header 51 and multimedia data 52, and the multimedia data 52 includes multimedia data that can be classified into a plurality of data tracks. In other words, the multimedia file 50 may include a plurality of data tracks. The header 51 records therein a description of the characteristics of these tracks and the number of these tracks, which may include a video track 521, an audio track 522, a subtitle track 523, and an object position track 524. Wherein, the video data track is used for classifying the video data; the audio data tracks are used for classifying the audio data, and different audio data tracks can represent different languages; the subtitle data track is used to classify subtitle data, and different subtitle data tracks may represent subtitles in different languages.
In one embodiment, when the object position file includes a plurality of first object positions of the first image object and a plurality of second object positions of the second image object (as shown in the example of fig. 4), the file embedding module 124 may generate a first data track corresponding to the first image object and embed the first object position (e.g., (r4, θ 4, ψ 4), (r6, θ 6, ψ 6), (r6, θ 6, ψ 6) in the object position file) into the first data track. On the other hand, the file embedding module 124 may generate a second data track corresponding to the second image object, and embed the second object position in the object position file (e.g., (r5, θ 5, ψ 5), (r7, θ 7, ψ 7), (r9, θ 9, ψ 9) of fig. 4) into the second data track. That is, the number of object position data tracks is determined by the number of labeled view image objects, and the object position of each image object is recorded by the corresponding object position data track. That is, different object position data tracks may represent position information of different image objects.
It is noted that, compared to the conventional multimedia file, the multimedia file 50 of the present embodiment further includes an object position data track 524 for recording the object position. The file embedding module 124 can establish at least one data track (i.e., the object location data track 524) of the multimedia file 50 according to the object location file, such as embedding the data in the object location file 40 shown in fig. 4 into the object location data track 524 of the multimedia file 50. Herein, embedding specific data into at least one data track of the multimedia file 50 represents embedding specific data into data blocks of the data track in the multimedia file 50. Furthermore, the header 51 records the description of the characteristics of the object location data tracks and the number of the object location data tracks. In this way, the player for playing the multimedia file 50 can obtain the position information of one or more image objects in the panorama from the object position data track 524, in addition to playing the panorama in the multimedia file 50.
After describing how to generate the multimedia file recorded with the object positions of the image objects in the panoramic film, the following embodiments will describe how to play the panoramic film according to the multimedia file of the present disclosure.
Fig. 6 is a block diagram of a multimedia file playing apparatus according to an embodiment of the present invention, which is for convenience of illustration only and is not intended to limit the present invention. First, fig. 6 first describes all the components and configuration relationships of the multimedia file playing apparatus, and the detailed functions will be disclosed together with fig. 7.
Referring to fig. 6, the multimedia file playing device 60 may be any electronic device with computing capability and image display capability, such as a desktop computer, a notebook computer, a smart phone, a tablet, and the like, which is not limited in the present invention. The multimedia file playing device 60 includes a processor 610, a storage device 620 and a screen 630.
The storage device 620 can be any type of fixed or removable random access memory, read only memory, flash memory, or the like, or any combination thereof. In the present embodiment, the storage device 620 is used for recording a movie receiving module 621, a data track extracting module 622, an interface providing module 623, and a movie playing module 624. In one embodiment, the module may be implemented as a software player.
The processor 610 is, for example, a central processing unit or other programmable general purpose or special purpose microprocessor, digital signal processor, programmable controller, application specific integrated circuit, programmable logic device, or the like, or a combination thereof, coupled to the storage device 620.
The screen 630 is used for displaying the image outputted by the multimedia file playing apparatus 60 for the user to watch. In the present embodiment, the multimedia file playing device 60 is, for example, a Liquid Crystal Display (LCD), a Light-Emitting Diode (LED) Display, a Field Emission Display (FED), or other types of displays.
In the present embodiment, the module stored in the storage device 620 is, for example, a computer program, and can be loaded by the processor 610 to execute the method for playing a multimedia file according to the present embodiment.
Fig. 7 is a flowchart illustrating a method for playing a multimedia file according to an embodiment of the present invention, and the method flowchart of fig. 7 can be implemented by the elements of the multimedia file playing apparatus 60 of fig. 6. Referring to fig. 6 and fig. 7, the following describes detailed steps of the method for playing a multimedia file according to the present embodiment in conjunction with various elements and devices of the multimedia file generating device 60 in fig. 6.
In step S701, the movie receiving module 621 receives a multimedia file including a panorama movie associated with a time axis. The movie receiving module 621 may receive a multimedia file including a panoramic movie via a wired or wireless network, and may also read the multimedia file stored in the storage device 620 or other external storage devices. In step S702, the track extraction module 622 extracts a first track of the multimedia file to obtain a plurality of first object positions of a first image object in the panorama film relative to a time axis. Specifically, the track extraction module 622 can demultiplex (demux) the multimedia file to obtain the multimedia data corresponding to each track. In one embodiment, the tracks of the multimedia file may include a video track, an audio track, a subtitle track, and an object position track. The track extraction module 622 can extract multimedia data classified into an object position track from the multimedia file, where the multimedia data classified into the object position track is a plurality of first object positions of a first image object in the panorama relative to a time axis. The object positions in the object position data track are described in detail in the foregoing embodiments, and are not described herein again. Similarly, the track extraction module 622 can also extract the video data classified into the video tracks from the multimedia file and decode the video data to obtain a plurality of video frames of the panorama film.
Then, in step S703, when the panorama film is played, the interface providing module 623 displays a picture corresponding to the first image object and shown on the screen 630. Specifically, the interface providing module 623 can provide a user interface of the player, which can include a frame playing area and a playing control column. It should be noted that by parsing the number of object location tracks in the header (e.g., the header 51 shown in fig. 5) of the multimedia file, the interface providing module 623 can know how many image objects are marked in advance in the movie content of the panorama film. Thus, while playing the panoramic film, the interface providing module 623 may display the image of the image object labeled in advance on the screen 630. The graphic representations can be any shape of interactive objects, and the names or representative patterns of the corresponding image objects are presented in each graphic representation to quickly guide the user to the emphasis on the panoramic image. In addition, each icon can be displayed on the edge of the playing frame or in the playing control column of the player, so as to avoid affecting the user's watching of the panoramic film.
The processor 610 then continuously detects whether the user selects any icon, and responds to the detection of the selection operation applied to a certain icon by the user. Therefore, in step S704, in response to detecting the selection operation applied to the icon, the movie playing module 624 determines a playing angle for playing the panoramic movie according to the position of the first object recorded in the first track, and plays the frame including the first image object based on the playing angle. That is, when the user selects the icon corresponding to the first image object, the film playing module 624 can obtain the current object position of the first image object in the panorama film from the object position data track. Then, the film playing module 624 can determine the playing angle of view according to the current object position of the first image object, and the playing frame will be shifted from the preset area of the panoramic film to the first area where the first image object is located, so that the user can quickly view the selected key object.
It is noted that the first object position of the selected first image object may change. Taking fig. 3B as an example, the first object position of the first image object may be changed from (r4, θ 4, ψ 4) to (r6, θ 6, ψ 6). If the playing angle is not adjusted, the first image object may disappear from the playing frame. In an embodiment, the movie playing module 624 may switch the playing angle according to the changed position of the first object again in response to recognizing the change of the position of the first object. Taking fig. 3B as an example, in response to the first object position of the first image object changing from (r4, θ 4, ψ 4) to (r6, θ 6, ψ 6), movie playback module 624 switches the playback view angle from the first view angle to the second view angle. Correspondingly, the playing frame is adjusted from the original first area to the second area where the first image object is located. That is, the movie playback module 624 plays the first area of the panoramic movie at the first viewing angle during the time interval P1, and then plays the second area of the panoramic movie at the second viewing angle during the time interval P2. In this way, the user can continuously view the selected key object without manually adjusting the play angle.
It is appreciated that the number of image objects may be more than two. In one embodiment, in addition to extracting the first track of the multimedia file, the track extraction module 622 can also extract a second track of the multimedia file to obtain a plurality of second object positions of the second image object in the panorama film relative to the time axis. Thus, when the panorama film is played, the interface providing module 623 will also display another picture corresponding to the second image object on the screen 630. Then, in response to detecting the selection operation applied to another illustration, the movie playing module 624 switches the playing angle according to the position of the second image object recorded in the second data track, and plays the frame including the second image object based on the switched playing angle.
For example, fig. 8A and 8B are schematic diagrams illustrating an exemplary playing of a multimedia file according to an embodiment of the invention. Referring to fig. 8A, when the multimedia file playing apparatus 60 plays the multimedia file generated by the present disclosure, the panoramic film is played along with the time axis. The user can adjust the playing angle of the panoramic film by operating the virtual control button 82. The multimedia file playing device 60 can obtain the description characteristics of the object position data track and the number of the object position data tracks according to the header of the multimedia file, so as to obtain the number of the image objects labeled in advance, the object names, and the like. In the present example, assuming that the number of image objects labeled in advance is 3, the multimedia playing device 60 will display three icons I1-I3 on the frame 80, and the three icons I1-I3 respectively show the representative names 'A', 'B' and 'C' of the three image objects.
Assuming that the user wants to watch the image object 83 corresponding to the icon I1 (i.e. the key character a), in response to detecting the user's selection operation with respect to the icon I1, the multimedia file playing apparatus 60 determines the playing perspective for playing the panoramic film according to the object position of the image object 83 recorded in the object position data track, so as to play the frame 80 including the image object 83 according to the just determined playing perspective. In this example, the selected image object 83 will be located in the middle of the frame 80. Then, assuming that the user wants to view the image object 84 corresponding to the icon I2 (i.e. the key character B), in response to detecting the user's selection operation with respect to the icon I2, the multimedia file playing apparatus 60 switches the playing perspective according to the object position of the image object 84 recorded in the object position data track, and plays the frame 86 including the image object 84 based on the switched playing perspective. In the present example, after switching the playback perspective, the selected image object 84 is located in the middle of the frame 86.
In summary, in the embodiment of the present invention, the multimedia file including the panoramic film further includes an object position data track recorded with the position information of the image object. The multimedia file generating device embeds the object position of the image object into the multimedia file, so that the multimedia file playing device can instantly know the object position of the specific image object according to the object position data track during playing the panoramic film. Therefore, the multimedia playing device of the user does not need to have strong computing capability to identify and track the image object. In addition, after the user selects the image object of interest, the multimedia file playing device can dynamically adjust the playing angle of the panoramic film according to the object position of the image object, so as to achieve the playing function of tracking the specific image object. Therefore, the user does not need to manually adjust the playing angle of view to ensure that the user can watch the interested image object, thereby greatly improving the convenience of watching the 360-degree film. The invention can also enable the user to quickly browse the key points in the panoramic film, so that the user can have direct and quick operation and viewing experience when watching the panoramic film.
Although the present invention has been described with reference to the above embodiments, it should be understood that various changes and modifications can be made therein by those skilled in the art without departing from the spirit and scope of the invention.

Claims (20)

1.一种多媒体文件的产生方法,适用于多媒体文件产生装置,所述方法包括:1. a method for producing a multimedia file, applicable to a device for producing a multimedia file, the method comprising: 取得关联于时间轴的全景影片,其中所述全景影片包括至少一图像物件;obtaining a panorama video associated with the timeline, wherein the panorama video includes at least one image object; 取得所述至少一图像物件相对于所述时间轴的多个物件位置;obtaining a plurality of object positions of the at least one image object relative to the time axis; 将所述多个物件位置制作成物件位置文件;以及making the plurality of object locations into an object location file; and 依据所述物件位置文件产生多媒体文件的至少一数据轨,以生成包括所述全景影片且记录有所述多个物件位置的所述多媒体文件。At least one data track of a multimedia file is generated according to the object position file, so as to generate the multimedia file including the panoramic video and recording the plurality of object positions. 2.根据权利要求1所述的方法,其中所述至少一图像物件包括第一图像物件与第二图像物件,相对于所述时间轴的所述多个物件位置包括所述第一图像物件的多个第一物件位置与所述第二图像物件的多个第二物件位置,而依据所述物件位置文件产生所述多媒体文件的所述至少一数据轨的步骤包括:2. The method of claim 1 , wherein the at least one image object includes a first image object and a second image object, and the plurality of object positions relative to the timeline include an image object of the first image object. A plurality of first object positions and a plurality of second object positions of the second image object, and the step of generating the at least one data track of the multimedia file according to the object position file includes: 将物件位置文件中的所述多个第一物件位置嵌入至第一数据轨;以及embedding the plurality of first object positions in the object position file into a first data track; and 将物件位置文件中的所述多个第二物件位置嵌入至第二数据轨。Embedding the plurality of second object positions in the object position file into a second data track. 3.根据权利要求1所述的方法,其中所述多媒体文件包括标头与多个数据轨,所述多个数据轨包括视频数据轨与用以记录所述多个物件位置的至少一物件位置数据轨,且所述标头记录有所述至少一物件位置数据轨的特性的描述与所述至少一物件位置数据轨的数目。3. The method of claim 1 , wherein the multimedia file includes a header and a plurality of data tracks, the plurality of data tracks including a video data track and at least one object position for recording the plurality of object positions data track, and the header records a description of the characteristics of the at least one object position data track and the number of the at least one object position data track. 4.根据权利要求1所述的方法,其中所述至少一图像物件的所述多个物件位置分别对应至所述时间轴上的多个时间区间。4 . The method of claim 1 , wherein the object positions of the at least one image object respectively correspond to a plurality of time intervals on the time axis. 5 . 5.根据权利要求4所述的方法,其中将所述多个物件位置制作成所述物件位置文件的步骤包括:5. The method of claim 4, wherein the step of producing the plurality of object locations into the object location file comprises: 将记录为多个立体位置坐标的所述多个物件位置映射成多个二维位置坐标,并将所述多个二维位置坐标记录于所述物件位置文件之中。The multiple object positions recorded as multiple three-dimensional position coordinates are mapped into multiple two-dimensional position coordinates, and the multiple two-dimensional position coordinates are recorded in the object position file. 6.一种多媒体文件产生装置,包括:6. A multimedia file generating device, comprising: 储存装置,储存有多个模块;a storage device, storing a plurality of modules; 处理器,耦接所述储存装置,载入并执行所述储存装置中的所述模块,所述模块包括:A processor, coupled to the storage device, loads and executes the modules in the storage device, and the modules include: 影片取得模块,取得关联于时间轴的全景影片,其中所述全景影片包括至少一图像物件;a video obtaining module, for obtaining a panoramic movie associated with the timeline, wherein the panoramic movie includes at least one image object; 位置取得模块,取得所述至少一图像物件相对于所述时间轴的多个物件位置;a position obtaining module to obtain a plurality of object positions of the at least one image object relative to the time axis; 文件制作模块,将所述多个物件位置制作成物件位置文件;以及a file making module for making the plurality of object positions into an object position file; and 文件嵌入模块,依据所述物件位置文件产生多媒体文件的至少一数据轨,以生成包括所述全景影片且记录有所述多个物件位置的所述多媒体文件。The file embedding module generates at least one data track of a multimedia file according to the object position file, so as to generate the multimedia file including the panoramic video and recording the plurality of object positions. 7.根据权利要求6所述的多媒体文件产生装置,其中所述至少一图像物件包括第一图像物件与第二图像物件,相对于所述时间轴的所述多个物件位置包括所述第一图像物件的多个第一物件位置与所述第二图像物件的多个第二物件位置,7. The multimedia file generating apparatus according to claim 6, wherein the at least one image object includes a first image object and a second image object, and the plurality of object positions relative to the time axis include the first image object a plurality of first object positions of the image object and a plurality of second object positions of the second image object, 其中所述文件嵌入模块将物件位置文件中的所述多个第一物件位置嵌入至第一数据轨,以及将物件位置文件中的所述多个第二物件位置嵌入至第二数据轨。The file embedding module embeds the plurality of first object positions in the object position file into the first data track, and embeds the plurality of second object positions in the object position file into the second data track. 8.根据权利要求6所述的多媒体文件产生装置,其中所述多媒体文件包括标头与多个数据轨,所述多个数据轨包括视频数据轨与用以记录所述多个物件位置的至少一物件位置数据轨,且所述标头记录有所述至少一物件位置数据轨的特性的描述与所述至少一物件位置数据轨的数目。8. The multimedia file generating apparatus of claim 6, wherein the multimedia file comprises a header and a plurality of data tracks, the plurality of data tracks comprising a video data track and at least a An object position data track, and the header records a description of the characteristics of the at least one object position data track and the number of the at least one object position data track. 9.根据权利要求6所述的多媒体文件产生装置,其中所述至少一图像物件的所述多个物件位置分别对应至所述时间轴上的多个时间区间。9 . The multimedia file generating apparatus according to claim 6 , wherein the object positions of the at least one image object correspond to a plurality of time intervals on the time axis, respectively. 10 . 10.根据权利要求6所述的多媒体文件产生装置,其中所述文件制作模块还将记录为多个立体位置坐标的所述多个物件位置映射成多个二维位置坐标,并将所述多个二维位置坐标记录于所述物件位置文件之中。10. The multimedia file generating device according to claim 6, wherein the file making module also maps the multiple object positions recorded as multiple three-dimensional position coordinates into multiple two-dimensional position coordinates, and maps the multiple object positions recorded as multiple three-dimensional position coordinates into multiple two-dimensional position coordinates. A two-dimensional position coordinate is recorded in the object position file. 11.一种多媒体文件的播放方法,适用于多媒体文件播放装置,所述方法包括:11. A method for playing a multimedia file, applicable to a multimedia file playing device, the method comprising: 接收包括关联于时间轴的全景影片的多媒体文件;receiving a multimedia file including a panoramic movie associated with a timeline; 提取所述多媒体文件的第一数据轨,以取得所述全景影片中第一图像物件相对于所述时间轴的多个第一物件位置;extracting the first data track of the multimedia file to obtain a plurality of first object positions of the first image object in the panoramic movie relative to the time axis; 当播放所述全景影片,显示对应于所述第一图像物件的图示于屏幕的画面;以及When the panoramic movie is played, a picture corresponding to the first image object on the screen is displayed; and 响应于检测到施于所述图示的选择操作,依据所述第一数据轨所记录的所述多个第一物件位置决定用以播放所述全景影片的播放视角,并基于所述播放视角播放包括所述第一图像物件的画面。In response to detecting a selection operation applied to the icon, determining a viewing angle for playing the panoramic video according to the positions of the plurality of first objects recorded in the first data track, and based on the viewing angle Playing a picture including the first image object. 12.根据权利要求11所述的方法,其中响应于检测到施于所述图示的选择操作,依据所述数据轨所记录的所述多个第一物件位置决定用以播放所述全景影片的所述播放视角的步骤包括:12 . The method of claim 11 , wherein in response to detecting a selection operation applied to the icon, determining to play the panoramic movie according to the positions of the plurality of first objects recorded in the data track. 13 . The steps of playing the viewing angle include: 响应于识别到所述多个第一物件位置的改变,切换所述播放视角。In response to recognizing a change in the positions of the plurality of first objects, the playback viewing angle is switched. 13.根据权利要求11所述的方法,还包括:13. The method of claim 11, further comprising: 提取所述多媒体文件的第二数据轨,以取得所述全景影片中第二图像物件相对于所述时间轴的多个第二物件位置;以及extracting the second data track of the multimedia file to obtain a plurality of second object positions of the second image object in the panoramic movie relative to the time axis; and 当播放所述全景影片,显示对应于所述第二图像物件的另一图示于所述屏幕的画面。When the panoramic movie is played, another image corresponding to the second image object displayed on the screen is displayed. 14.根据权利要求13所述的方法,其中在响应于检测到施于所述图示的选择操作,依据所述第一数据轨所记录的所述多个第一物件位置决定用以播放所述全景影片的所述播放视角,并基于所述播放视角播放包括所述第一图像物件的画面的步骤之后,所述方法还包括:14. The method of claim 13, wherein in response to detecting a selection operation applied to the icon, determining the position of the plurality of first objects recorded in the first data track to play the After the step of playing the picture including the first image object based on the playback perspective of the panoramic movie, the method further includes: 响应于检测到施于所述另一图示的选择操作,依据所述第二数据轨所记录的所述多个第二物件位置切换所述播放视角,并基于所切换的所述播放视角播放包括所述第二图像物件的画面。In response to detecting a selection operation applied to the other icon, the playback perspective is switched according to the positions of the plurality of second objects recorded in the second data track, and the playback perspective is played based on the switched playback perspective A frame including the second image object. 15.根据权利要求11所述的方法,其中所述多媒体文件包括标头与多个数据轨,所述多个数据轨包括视频数据轨与用以记录所述多个物件位置的至少一物件位置数据轨,且所述标头记录有所述至少一物件位置数据轨的特性的描述与所述至少一物件位置数据轨的数目。15. The method of claim 11, wherein the multimedia file includes a header and a plurality of data tracks, the plurality of data tracks including a video data track and at least one object location for recording the plurality of object locations data track, and the header records a description of the characteristics of the at least one object position data track and the number of the at least one object position data track. 16.一种多媒体文件播放装置,包括:16. A device for playing multimedia files, comprising: 屏幕;Screen; 储存装置,储存有多个模块;a storage device, storing a plurality of modules; 处理器,耦接所述储存装置与所述屏幕,载入并执行所述储存装置中的所述模块,所述模块包括:A processor, coupled to the storage device and the screen, loads and executes the modules in the storage device, and the modules include: 影片接收模块,接收包括关联于时间轴的全景影片的多媒体文件;a film receiving module, receiving a multimedia file including a panoramic film associated with the timeline; 数据轨提取模块,提取所述多媒体文件的第一数据轨,以取得所述全景影片中第一图像物件相对于所述时间轴的多个第一物件位置;a data track extraction module, for extracting the first data track of the multimedia file to obtain a plurality of first object positions of the first image object in the panoramic movie relative to the time axis; 接口提供模块,当播放所述全景影片,显示对应于所述第一图像物件的图示于所述屏幕的画面;以及an interface providing module, when playing the panoramic movie, displaying a picture corresponding to the first image object on the screen; and 影片播放模块,响应于检测到施于所述图示的选择操作,依据所述第一数据轨所记录的所述多个第一物件位置决定用以播放所述全景影片的播放视角,并基于所述播放视角播放包括所述第一图像物件的画面。A video playback module, in response to detecting a selection operation applied to the icon, determines a viewing angle for playing the panoramic video according to the positions of the plurality of first objects recorded in the first data track, and based on The playback perspective plays a picture including the first image object. 17.根据权利要求16所述的多媒体文件播放装置,其中所述影片播放模块响应于识别到所述多个第一物件位置的改变,切换所述播放视角。17 . The multimedia file playback device according to claim 16 , wherein the video playback module switches the playback viewing angle in response to recognizing the change of the positions of the plurality of first objects. 18 . 18.根据权利要求16所述的多媒体文件播放装置,其中所述数据轨提取模块提取所述多媒体文件的第二数据轨,以取得所述全景影片中第二图像物件相对于所述时间轴的多个第二物件位置;以及当播放所述全景影片,所述接口提供模块显示对应于所述第二图像物件的另一图示于所述屏幕的画面。18. The multimedia file playback device according to claim 16, wherein the data track extracting module extracts the second data track of the multimedia file to obtain the relative data of the second image object in the panoramic movie relative to the time axis. a plurality of second object positions; and when the panoramic movie is played, the interface providing module displays another image corresponding to the second image object on the screen. 19.根据权利要求18所述的多媒体文件播放装置,其中所述影片播放模块响应于检测到施于所述另一图示的选择操作,依据所述第二数据轨所记录的所述多个第二物件位置切换所述播放视角,并基于所切换的所述播放视角播放包括所述第二图像物件的画面。19. The multimedia file playback device according to claim 18, wherein the movie playback module responds to detecting a selection operation applied to the other icon, according to the plurality of data tracks recorded in the second data track The second object position switches the playback view angle, and plays a picture including the second image object based on the switched playback view angle. 20.根据权利要求16所述的多媒体文件播放装置,其中所述多媒体文件包括标头与多个数据轨,所述多个数据轨包括视频数据轨与用以记录所述多个物件位置的至少一物件位置数据轨,且所述标头记录有所述至少一物件位置数据轨的特性的描述与所述至少一物件位置数据轨的数目。20. The multimedia file playback device according to claim 16, wherein the multimedia file comprises a header and a plurality of data tracks, the plurality of data tracks comprises a video data track and at least a An object position data track, and the header records a description of the characteristics of the at least one object position data track and the number of the at least one object position data track.
CN201811091556.0A 2018-09-19 2018-09-19 Multimedia file generating method, multimedia file playing method, multimedia file generating device and multimedia file playing device Active CN110929056B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811091556.0A CN110929056B (en) 2018-09-19 2018-09-19 Multimedia file generating method, multimedia file playing method, multimedia file generating device and multimedia file playing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811091556.0A CN110929056B (en) 2018-09-19 2018-09-19 Multimedia file generating method, multimedia file playing method, multimedia file generating device and multimedia file playing device

Publications (2)

Publication Number Publication Date
CN110929056A true CN110929056A (en) 2020-03-27
CN110929056B CN110929056B (en) 2023-04-07

Family

ID=69855054

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811091556.0A Active CN110929056B (en) 2018-09-19 2018-09-19 Multimedia file generating method, multimedia file playing method, multimedia file generating device and multimedia file playing device

Country Status (1)

Country Link
CN (1) CN110929056B (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006004544A (en) * 2004-06-18 2006-01-05 Inventec Multimedia & Telecom Corp Multifunctional multimedia recording and reproducing apparatus
US20080117287A1 (en) * 2006-11-16 2008-05-22 Park Michael C Distributed video sensor panoramic imaging system
CN101369440A (en) * 2007-08-15 2009-02-18 凌阳科技股份有限公司 Multimedia file generating and playing method and recording medium for storing the file
US20120045149A1 (en) * 2010-03-18 2012-02-23 Panasonic Corporation Omnidirectional image processing device and omnidirectional image processing method
CN106331732A (en) * 2016-09-26 2017-01-11 北京疯景科技有限公司 Method for generating panoramic content, method for displaying panoramic content and corresponding devices
CN106445437A (en) * 2016-09-08 2017-02-22 深圳市金立通信设备有限公司 Terminal and view angle switching method thereof
US20170195576A1 (en) * 2016-01-05 2017-07-06 360fly, Inc. Dynamic field of view adjustment for panoramic video content
CN106954095A (en) * 2017-04-17 2017-07-14 腾讯科技(深圳)有限公司 The player method and device of a kind of multimedia file
CN107147824A (en) * 2016-06-22 2017-09-08 深圳市量子视觉科技有限公司 The output intent and device of multi-angle video
CN107633241A (en) * 2017-10-23 2018-01-26 三星电子(中国)研发中心 A kind of method and apparatus of panoramic video automatic marking and tracking object
CN107888987A (en) * 2016-09-29 2018-04-06 华为技术有限公司 A kind of panoramic video player method and device

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006004544A (en) * 2004-06-18 2006-01-05 Inventec Multimedia & Telecom Corp Multifunctional multimedia recording and reproducing apparatus
US20080117287A1 (en) * 2006-11-16 2008-05-22 Park Michael C Distributed video sensor panoramic imaging system
CN101369440A (en) * 2007-08-15 2009-02-18 凌阳科技股份有限公司 Multimedia file generating and playing method and recording medium for storing the file
US20120045149A1 (en) * 2010-03-18 2012-02-23 Panasonic Corporation Omnidirectional image processing device and omnidirectional image processing method
US20170195576A1 (en) * 2016-01-05 2017-07-06 360fly, Inc. Dynamic field of view adjustment for panoramic video content
CN107147824A (en) * 2016-06-22 2017-09-08 深圳市量子视觉科技有限公司 The output intent and device of multi-angle video
CN106445437A (en) * 2016-09-08 2017-02-22 深圳市金立通信设备有限公司 Terminal and view angle switching method thereof
CN106331732A (en) * 2016-09-26 2017-01-11 北京疯景科技有限公司 Method for generating panoramic content, method for displaying panoramic content and corresponding devices
CN107888987A (en) * 2016-09-29 2018-04-06 华为技术有限公司 A kind of panoramic video player method and device
CN106954095A (en) * 2017-04-17 2017-07-14 腾讯科技(深圳)有限公司 The player method and device of a kind of multimedia file
CN107633241A (en) * 2017-10-23 2018-01-26 三星电子(中国)研发中心 A kind of method and apparatus of panoramic video automatic marking and tracking object

Also Published As

Publication number Publication date
CN110929056B (en) 2023-04-07

Similar Documents

Publication Publication Date Title
US11211097B2 (en) Generating method and playing method of multimedia file, multimedia file generation apparatus and multimedia file playback apparatus
US8875212B2 (en) Systems and methods for remote control of interactive video
TWI253860B (en) Method for generating a slide show of an image
US8457407B2 (en) Electronic apparatus and image display method
US10929682B2 (en) Information processing apparatus, information processing method, and storage medium
ES2914124T3 (en) Media targeting
KR20110043612A (en) Image processing
US20190130193A1 (en) Virtual Reality Causal Summary Content
TW201520827A (en) System and method of providing augmented reality effect for multi-media data
US8244005B2 (en) Electronic apparatus and image display method
US11581018B2 (en) Systems and methods for mixing different videos
JP6203188B2 (en) Similar image search device
JP2012004747A (en) Electronic equipment and image display method
KR101773891B1 (en) System and Computer Implemented Method for Playing Compoiste Video through Selection of Environment Object in Real Time Manner
EP2942949A1 (en) System for providing complex-dimensional content service using complex 2d-3d content file, method for providing said service, and complex-dimensional content file therefor
US12113950B2 (en) Generation apparatus, generation method, and storage medium
US20230043683A1 (en) Determining a change in position of displayed digital content in subsequent frames via graphics processing circuitry
CN110929056B (en) Multimedia file generating method, multimedia file playing method, multimedia file generating device and multimedia file playing device
US20050251741A1 (en) Methods and apparatus for capturing images
CN104349200A (en) Television control device and related method
KR101843024B1 (en) System and Computer Implemented Method for Playing Compoiste Video through Selection of Environment Object in Real Time Manner
JP5479198B2 (en) Electronic device and image processing program
US20110231763A1 (en) Electronic apparatus and image processing method
TWI762830B (en) System for displaying hint in augmented reality to play continuing film and method thereof
US20230326094A1 (en) Integrating overlaid content into displayed data via graphics processing circuitry and processing circuitry using a computing memory and an operating system memory

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant