WO2004100545A1 - 情報処理装置及び方法,並びにプログラム及び記録媒体 - Google Patents
情報処理装置及び方法,並びにプログラム及び記録媒体 Download PDFInfo
- Publication number
- WO2004100545A1 WO2004100545A1 PCT/JP2004/005782 JP2004005782W WO2004100545A1 WO 2004100545 A1 WO2004100545 A1 WO 2004100545A1 JP 2004005782 W JP2004005782 W JP 2004005782W WO 2004100545 A1 WO2004100545 A1 WO 2004100545A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- picture
- stream
- video
- time
- buffer
- Prior art date
Links
- 230000010365 information processing Effects 0.000 title claims description 41
- 238000000034 method Methods 0.000 title claims description 33
- 239000000872 buffer Substances 0.000 claims abstract description 176
- 230000003139 buffering effect Effects 0.000 claims abstract description 31
- 101000701446 Homo sapiens Stanniocalcin-2 Proteins 0.000 claims description 43
- 102100030510 Stanniocalcin-2 Human genes 0.000 claims description 43
- 238000003672 processing method Methods 0.000 claims description 10
- 230000001360 synchronised effect Effects 0.000 claims description 10
- 210000001015 abdomen Anatomy 0.000 claims 1
- 101150103904 Clip2 gene Proteins 0.000 description 22
- 238000010586 diagram Methods 0.000 description 18
- 230000007704 transition Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 235000012976 tarts Nutrition 0.000 description 3
- 101150103583 ATC1 gene Proteins 0.000 description 1
- 102100037293 Atrial natriuretic peptide-converting enzyme Human genes 0.000 description 1
- 101000952934 Homo sapiens Atrial natriuretic peptide-converting enzyme Proteins 0.000 description 1
- 101100195263 Marchantia polymorpha rpl36 gene Proteins 0.000 description 1
- 235000018936 Vitellaria paradoxa Nutrition 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000009191 jumping Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- SRVJKTDHMYAMHA-WUXMJOGZSA-N thioacetazone Chemical compound CC(=O)NC1=CC=C(\C=N\NC(N)=S)C=C1 SRVJKTDHMYAMHA-WUXMJOGZSA-N 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/4302—Content synchronisation processes, e.g. decoder synchronisation
- H04N21/4305—Synchronising client clock from received content stream, e.g. locking decoder clock with encoder clock, extraction of the PCR packets
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23406—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving management of server-side video buffer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23424—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/4302—Content synchronisation processes, e.g. decoder synchronisation
- H04N21/4307—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
- H04N21/43072—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4392—Processing of audio elementary streams involving audio buffer management
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44004—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving video buffer management, e.g. video decoder buffer or video display buffer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44016—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
- H04N5/91—Television signal processing therefor
- H04N5/92—Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback
Definitions
- the present invention edits a multiplexed stream of a video stream and an audio stream with video frame accuracy, and reproduces editing points continuously (seamlessly).
- FIG. 1 is a block diagram showing a conventional DVR-STD model (DVR MPEG2 transport stream player model) (hereinafter referred to as a player) 101.
- DVR-STD is a conceptual model for modeling the decoding processing in the generation and verification of shea one Seamless two connected Tsu 1) 13 116 111 referenced ⁇ string Ichimu I Nyo.
- player 1 0 In 1 the reading unit (DVRdrive) 1 1 1 or et bitrate R UD in read TS (Transport Stream) file, buffers the re one Dobaffa 1 1 2
- the source packet is read from the read buffer 1 1 2 to the source depacketizer 1 1 3 Read at the rate R MAX .
- the pulse generator (27 MHz X-1) 114 generates a 27 MHz pulse.
- Arrival time clock counter (Arrival time clock counter) 115 is a binary counter that counts the pulses of the frequency of 27 MHz.
- the source depacketizer 113 generates an Arrival time clock at time t (i). Provides the count value of the counter Arrival— time— clock (i).
- One source packet has one transport packet and its Ume-stamp.
- the current source When the ar rival—time—stamp of the packet is equal to the value of the least significant bit (LSB) of the arrival—time_clock (i) 30 bits, the source depacketizer section 113 sends the transformer. Port packets are output.
- TS_recording_rate is the bit rate of the transport stream (hereinafter referred to as TS).
- TS bit rate of the transport stream
- T-STD transport stream system target decoder specified in ISO / IEC 13818-1 of (MPEG2 Systems standard).
- the decoding process when playing one DVR MPEG2 TS will be described.
- the timing for outputting a transport packet from the output unit 110 and inputting it to the TB1, TBn, or TBsys buffer of the DVR-STD, which is the decoder 120 is Arriva time of the source packet, determined by the time stamp.
- the specification of buffering operation of TB1, MB1, EB1, TBn, Bn, TBsys and TBsys is the same as T-STD specified in IS0 / IEC 13818-1.
- the specification of decryption operation and display operation is also the same as T-STD specified in IS0 / IEC 13818-1.
- the time axis of TS2's initial time base is not the same as the time axis of TS1's arrival time base.
- the time axis of the system time base of TS2 is not the same as the time axis of the system time base of TS1.
- Video display must be seamless and continuous.
- the presentation time of the audio presentation unit may overlap.
- the time until time T1 is determined by the arrival timing of the source packet of TS1 at the timing of input to the buffer of TB1, TBn or TBsys of the DVR-STD.
- TS_recording-rate (TSl) (the maximum bit rate of TS1).
- TS_recording-rate (TSl) is a value of TS_recording-rate defined in CI iplnfo 0 corresponding to CI ip 1.
- the time when the last byte of TS1 enters the buffer is time T2. Therefore, in the section from time T 1 ′ to T 2, arrival-Ume_stamp of the source packet is ignored.
- T 1 Is the number of bytes of the transport packet of TS1 following the last video packet of TS1, and the time from time T1 to T2 2—T 1 is the time required for a byte to finish inputting at a bit rate of TS—recording—rate (TSl), and is calculated as in equation (1) below. From time T1 to ⁇ 2, the values of Rxn and Rxsys shown in FIG. 1 both change to the value of TS_recording rate (TSl). Buffering operations other than this rule are T-STD and Is the same.
- the audio decoder can process the input data in the section from time T1 to T2, that is, between time T1 and T2, the values of Rxn and Rxsys shown in FIG. Since TS_recording changes to the value of one rate (TS1), an additional buffer amount (data amount of about 1 second) is required in addition to the buffer amount defined in T-STD.
- the arrival time clock counter 1 15 is reset to the value of arrival-time-s mp of the first source packet of TS2.
- the input timing to the TB1 TBn or TBsys buffer of the DVR-STD is determined by the arrival—time_sta instant of the source packet of TS2. Both Rxn and Rxsys change to the values defined in T-STD.
- the presentation of the video presentation unit must be continuous without gaps through the connection points.
- STC2 TS2 system time base time axis (Accurately, STC2 is the first PC of TS2
- the offset value between S1 and STC2 is determined as follows.
- PTS nd PT S on STC1 corresponding to the last video presentation unit of TS1
- PTS2start PTS on STC2 corresponding to the first video presentation unit of TS2
- Tpp Display period of the last video presentation unit
- the system time clock may overlap between times T2 and T5.
- the DVR-STD switches the system time clock from the old time base value (STC1) to the new time base value (STC2).
- STC2 can be calculated as in the following equation (3).
- the following describes the encoding conditions that must be satisfied by TS1 and TS2 when moving from TS1 to the next TS2 that is seamlessly connected to TS1.
- STC vid.cend The value of the STC on the system time base STC1 when the last byte of the last video packet of TS1 arrives at the DVR-STD TBI.
- STC2 1 video_end STC1 1 vide. in terms of the value of the _e n d to the value on the system time base Ichisu STC2 value
- the arrival timing of the first video packet of TS2 at the TBI is determined by the following inequality
- the size of this buffer capacity is based on the following factors. That is, in MPEG2TS, audio data synchronized with video data at a certain byte position can exist at a multiplexed phase difference within a predetermined range, and the maximum value of the multiplexed phase difference is 1 This is equivalent to the amount of data for seconds. Therefore, the maximum value of N1 in the above equation (1) corresponds to a maximum of one second of audio data. Between time T1 and T2 Since the source packet of N1 data at the maximum bit rate of TS is input to the audio buffer, ignoring time_s mp of the received packet and the time_s mp, T- In addition to the buffer amount defined in the STD, an additional buffer amount (about one second of data) was required.
- the audio data for one second is 24bitSampleX960000 samples / secX. 8 channels: Approximately 18 Mbits, an additional buffer of about 3 Mbytes is required, and when handling such multi-channel audio data, there is a problem that this additional buffer becomes extremely large. .
- DISCLOSURE OF THE INVENTION The present invention has been proposed in view of such a conventional situation, and seamlessly decodes two multiplexed streams in which an audio stream and a video stream are multiplexed in a seamless manner.
- an information processing apparatus comprises a data string in units of a source packet having a transport packet and its corresponding time stamp, and In an information processing apparatus for decoding a multiplexed stream connected so that a second picture, which is the first picture of the second multiplexed stream, is continuously reproduced from a first picture, which is the last picture, The software according to the arrival time stamp of the multiplexed stream.
- An output means for outputting a packet, a video buffer for buffering video data among the source packets, an audio buffer for buffering audio data among the source packets, and a buffer for the video buffer 7.
- Video decoding means for decoding the ringed video data; and audio decoding means for decoding audio data buffered in the audio buffer, wherein the audio buffer converts the second picture into the video data. It has a capacity to buffer audio data for the time required for input to the buffer.
- the size of the audio buffer is set to a capacity capable of buffering audio data for a time required for inputting the second picture to the video buffer, and the first picture is Even between the time when input to the video buffer ends and the time when input of the last source packet of the first multiplexed stream ends, the original time stamp of the source packet of the multiplexed stream (arriva and time_immediate)
- the source packet is input to the buffer according to, so that it was necessary to input the source stream at the maximum bit rate of the transport stream (TS), ignoring the source packet's arrival_time_stainp. Eliminates the need for an additional buffer of one second, and adds the last stream of the first multiplex stream.
- the first decoded picture of the second multiplexed stream can be input to the video buffer by its decoding timing.
- the required capacity of the audio buffer is EBnjnax (bits), the bit amount of the second picture is Ijnax (bits), and the input bit rate to the video buffer is Rv (bps).
- the bit rate of data is Ra (bps)
- EBn_max (I_max / v) X Ra can be satisfied
- the maximum value of the bit amount of the second picture for example, I picture Is Ijnax
- the audio buffer capacity is (I_max / Rv) X Ra.
- the audio buffer has a capacity capable of buffering audio data for 100 milliseconds, so that, for example, the data amount of MPEG-2 I picture is normally transmitted in one second. Less than 10% of data volume Therefore, if the audio buffer is set to the same size and the audio data of that size is postponed, the I picture can be input to the video buffer by the decoding timing. Data encoding restrictions are reduced. That is, by setting the audio buffer to have the above-mentioned capacity, the multiplexed stream can be multiplexed so that the input of the audio data ends 100 ms earlier than the timing at which the audio data is reproduced.
- the multiplexed stream satisfies STC2 2 start > STC2 1 end In accordance with DVR—STD It can be.
- the first source packet of the second multiplexed stream is output from the output means after a predetermined time ATC-delta has elapsed.
- the predetermined time ATC_del is determined so as to satisfy the value of the time difference STC_delta, and the multiplexed stream is multiplexed so as to satisfy the value of the time difference STC_delta. This allows flexibility in the input timing of the first source packet of the second multiplexed stream and facilitates encoding of the second multiplexed stream. .
- the value of the predetermined time ATC-delta can be managed as additional information of the first multiplexed stream.
- the information processing method comprises a data string in units of a source packet having a transport packet and its arrival time stamp, and includes a first picture as a last picture of the first multiplexed stream.
- the information processing method for decoding a multiplexed stream connected so that a second picture, which is the first picture of the second multiplexed stream, is continuously reproduced, the above-described arrival time of the multiplexed stream An output step of outputting the source bucket according to the stamp; a buffering step of buffering video data of the source packets in a video buffer and buffering audio data in an audio buffer 7; Video buffered in the audio buffer and A decoding step of decoding audio data, and in the buffering step, audio data for a time required for inputting the second picture to the video buffer is stored in the video buffer. It is characterized in that buffering is performed in the audio buffer 7 before buffering in the buffer.
- a program according to the present invention causes a computer to execute the above-described information processing, and a recording medium according to the present invention stores such a program and is readable by a computer.
- Another recording medium is a recording medium on which a multiplexed stream composed of a data stream in units of a source packet having a transport packet and an arrival time stamp thereof is recorded.
- the multiplexed stream is connected so that the first picture, which is the last picture of the first multiplexed stream, is successively played with the second picture, which is the first picture of the second multiplexed stream,
- the first and second multiplexing streams can be input to a decoder based on respective arrival timestamps, and the second picture is input to a decoder
- the audio data for the time required to input the second picture to the decoder is multiplexed so that the input can be completed before the input of the second picture to the decoder starts. Since the streams are multiplexed, such a multiplexed stream has an audio buffer with a capacity capable of buffering audio data for the time required for inputting the second picture to the video buffer. After decoding by the decoder, after the last transport packet of the first multiplexed stream is input, the first decoded picture of the second multiplexed stream is input to the video buffer by the decoding timing. be able to.
- Another information processing apparatus comprises a data string in units of a source packet having a transport bucket and its corresponding time stamp, and is read and decoded by a decoder based on the corresponding time stamp.
- a first video coded stream that ends display at a first picture is generated, and display is started from a second picture that is displayed following the first picture.
- Video encoding means for generating a second video encoding stream to be encoded, and multiplexing the first video encoded stream with an audio encoded stream synchronized with the first video encoding stream. To generate a first multiplexed stream, and the second video coding stream and the second video coding stream.
- a second multiplexing stream is generated by multiplexing with the audio coding stream synchronized with the frame, and the second picture is added to the first picture, which is the last picture of the first multiplexed stream.
- Audio data for a time required for inputting a second picture to the decoder is multiplexed so that input can be completed before inputting of the second picture to the decoder is started. .
- the input of the second picture to the decoder is started, and the input of the second picture to the decoder is started for audio data for a time required, for example, 100 milliseconds.
- the decoder In order to perform multiplexing so that input can be completed beforehand, the decoder must forward the audio data to the audio buffer and allow enough time to transmit the second picture, such as an I-picture, by the decoding timing. Multiplex stream can be easily encoded.
- Another information processing method comprises a data string in units of a source packet having a transport packet and an equivalent time stamp, and is read and decoded by a decoder based on the equivalent time stamp.
- a first video coded stream ending display at a first picture is generated, and display is started from a second picture displayed following the first picture.
- the second multiplexed stream is generated by multiplexing the audio coded stream synchronized with the video stream, and the second picture is added to the first picture, which is the last picture of the first multiplexed stream.
- FIG. 1 is a block diagram showing a conventional information processing apparatus.
- FIG. 2 is a schematic diagram showing the relationship between the preceding Playltem and the current Playltem when using Bridge-Clip.
- FIG. 3 is a schematic diagram showing the relationship between the preceding Playltem and the current Playltem when Bridge-Clip is not used.
- Fig. 4 is a schematic diagram showing an example of seamless connection between Clipl and Clip2 as video streams in the picture display order (Presen Uon order).
- Fig. 5 is a schematic diagram showing the data stream in each AV stream that achieves seamless connection using BridgeSeauence, which is the first method when the display video streams (ipl and Clip2) are connected seamlessly in Fig. 4. It is.
- Fig. 6 is a schematic diagram showing the data stream of each AV stream that realizes seamless connection without using BridgeSetiuence, which is the second method, when the video streams (Clipl and Clip2) shown in Fig. 4 are connected seamlessly. is there.
- FIG. 7 is a diagram for explaining the overlap of the display of audio, and is a schematic diagram showing a presentation unit of video and a presentation unit of audio in TS1 and TS2.
- FIG. 8 is a block diagram showing the information processing device according to the embodiment of the present invention.
- Fig. 9 is a timing chart of input, decoding, and display of transport packets when moving from one AV stream (TS1) to the next AV stream (TS2) that is seamlessly connected to it.
- Figure 10 shows another example of input, decoding, and display of transport packets when moving from one AV stream (TS1) to the next AV stream (TS2) that is seamlessly connected to it. It is.
- Figure 11 shows another example of transport packet input, decoding and display when moving from one AV stream (TS1) to the next AV stream (TS2) that is seamlessly connected to it. It is.
- FIG. 12 is a diagram showing a data format of the additional information ClipInfoO for storing ATC_delta.
- FIG. 13 is a schematic diagram showing the additional information ClipInfoO when there are a plurality of next AV streams (TS2) connected to a certain AV stream (TS1).
- Fig. 14 shows the case of the conventional DVR-STD.
- the audio buffer size is 4kBy tes
- FIG. 7 is a graph showing an example of a change in the bit occupancy of a buffer.
- FIG. 15 is a diagram for explaining the effect of the embodiment of the present invention.
- the audio buffer size is 8 kBytes
- the transition from TS1 to the next TS2 seamlessly connected to TS1 is performed.
- FIG. 9 is a graph showing an example of a change in bit occupancy of a video buffer and an audio buffer of DVR-STD.
- BEST MODE FOR CARRYING OUT THE INVENTION will be described in detail with reference to the drawings.
- the present invention is applied to an information processing apparatus that seamlessly and continuously reproduces two AV streams in which a video stream and an audio stream are multiplexed.
- the DVR-STD the DVR-STD
- Clip indicates a multiplexed stream of a video stream and an audio stream.
- the Play List indicates a set of playback sections in the Clip.
- One playback section in a Clip is called Playltem, which is represented by a pair of IN point and OUT point on the time axis. Therefore, a PlayList is a collection of Playltems.
- Playing seamlessly between Playltems means that the playback device (player) can display (play) audio video data recorded on the disc without causing pauses or gaps in the playback output of the decoder. It is.
- the structure of two seamlessly connected Playltems will be described. Whether the connection between the previous Playltem and the current Playltem is guaranteed to be seamlessly displayed depends on the connection conditions defined in the current Playltem. You can judge from the field.
- the seamless connection between Playltems includes a method using Bridge-Clip (BridgeSeauence) (hereinafter referred to as the first method) and a method not using it (hereinafter referred to as the second method).
- FIG. 2 is a schematic diagram showing the relationship between the preceding Playltem PI1 and the current Playltem PI2 when Bridge-Clip is used as the first method.
- the stream data read out by the player when Bridge-Clip is used is shaded.
- the transport stream (TS) is composed of an integer number of Aligned units.
- the size of the Aligned unit is 6144 bytes (2048 x 3 bytes).
- One Aligned unit consists of 32 source packets and starts from the first byte of the source packet.
- Each source packet is 192 bytes long.
- One source packet consists of a TP_extra_header and a transport packet.
- TP_extra_lieader is 4 bytes long, and a transport packet is 188 bytes long.
- the TP—extra—header has a copy—pr emission—indicator and an arrival—time—s tamp, and the copy—p remission—indicator is an integer that indicates the copy limit of the transport packet payload.
- arrival_time_stamp (ATS) is a time stamp indicating the time at which the corresponding transport packet arrives at the decoder in the AV stream.
- the time axis created based on the arriva of each source packet constituting the AV stream based on the time stamp is called the arrival time base, and its clock is ATC.
- TS1 (first multiplexed stream) shown in Fig. 2 is a stream stream showing D1 with the shadow of Clipl (Clip AV stream) and SPlarriva of Bridge-Clip, and the shadow before timejiscon tinuity.
- Stream data D2 shown in FIG. SPN_arrival_time — discontinuity indicates the address of the source packet where there is a discontinuity in the arrival time base in the Bridge-Clip AV stream file.
- the stream data D 1 shown with the shadow of Clipl included in TS 1 is From the address of the stream needed to decode the presentation unit corresponding to IN-time of PI ay It em (indicated by IN-time 1 in Figure 2), refer to SPN-exi from- previous_Clip Stream data up to the source packet to be transmitted.
- stream data D 2 shown with a shadow before SPN—arrival—time—discontinuity of the Bridge-Clip included in TS1 is referred to by SPN_arriva and tinie_discontiiiuity from the first source packet of the Bridge-Clip. It is the stream data up to the source packet just before the source packet.
- the TS2 (second multiplex stream) shown in Fig. 2 is a stream stream D4 shown with the shadow of Clip2 (Clip AV stream) and SPN_arrival-time of Bridge-Clip. It consists of stream data D 3 shown with a shadow after discontinuity.
- the stream D3 which is indicated by the shadow after the SPN_arriva time-discontinuity of the Bridge-Clip included in TS2, is the last source of the Bridge-Clip from the source packet referenced by the SPN_arrival-tiine_discontinuity. This is a short walk to the bucket.
- stream data D4 indicated by adding a shadow of Clip2 included in TS2 is obtained from the source packet referenced by SPN_enter—current—Clip from 0UT_time of the current Playltem (OUT—time in FIG. 2). This is indicated by 2.)
- FIG. 3 is a schematic diagram showing the relationship between the preceding Playltem PI1 and the current Playltem PI2 when Bridge-Clip, which is the second method, is not used.
- the stream data read out by the player is shaded.
- TS1 (first multiplexed stream) shown in FIG. 3 is composed of stream data D5 shown with a shadow of Clipl (Clip AV stream).
- Stream data D5 shown with the shadow of Clipl included in TS1 is necessary to decode the presentation unit corresponding to IN-time (shown as IN time 1 in Fig. 3) of the preceding Playltem. It starts with the address of the best stream and ends with the last source packet of Clip 1.
- TS2 (second multiplexed stream) shown in FIG. 3 is composed of stream data D6 shown with a shadow of Clip2 (Clip AV stream).
- the stream data D6 shown with the shadow of Clip2 included in TS2 starts from the first source packet source of Clip2, and corresponds to the present Playltem's 0UT_time (indicated by OUT-time2 in Fig. 3). This is a stream stream up to the stream address required to decrypt one unit.
- TS1 and TS2 are continuous streams of source packets. Next, TS1 and TS2 stream specifications and connection conditions between them will be described.
- TS1 and TS2 are obtained by multiplexing a video stream and an audio stream.
- the limitation of the video bit stream in the encoding restriction for seamless connection will be described first.
- FIG. 4 is a schematic diagram showing an example of seamless connection between Clipl and Clip2 as a video stream in a picture display order.
- an art-point-side program which is a program temporally ahead of the out-point picture, which is the skip reproduction start point
- an in-point picture which is a point at which skip reproduction is reached
- the video stream In order to seamlessly connect the in-point side program, which is the program on the rear side in time, the video stream must be re-encoded in the decoding device.
- the GOP (group of pictures), which is a unit of a group of pictures according to the MPEG2 standard, includes at least one I (Intra) picture (reference picture) in which the picture is coded without predictive coding from other pictures.
- I Intra
- P predict iv
- B bi-directionally pictures, which are bi-directionally predicted coded images obtained by coding the images using coding.
- each number of Clipl and Clip2 indicates the display order
- I, P, B or i, p, b indicates the type of each picture.
- Fig. 5 shows an example of seamless connection using the first method, BridgeSeauence, when seamlessly connecting video streams (Clipl and Clip2) as shown in Fig. 4.
- the video stream of the Bridge-CI ip before the SPN—arrival—time one discontinuity consists of the coded video stream up to the picture corresponding to 0UT_Umel of Clipl shown in Fig. 4. Then, the video stream is connected to the preceding Clipl video stream, and is re-encoded so that one continuous elementary stream conforms to the MPEG2 standard.
- the video stream of Bridge-Clip after SPN_arrival_time-discontinuity is composed of an encoded video stream after the picture corresponding to IN_time 2 of CI ip 2 in FIG.
- the video stream can start decoding correctly, is connected to the subsequent Clip2 video stream, and is re-encoded so as to become an elementary stream according to the MPEG2 standard in one continuous stream.
- In order to create a Bridge-Clip generally, several pictures must be re-encoded, and the other pictures can be copied from the original Clip.
- FIG. 5 shows the Clipl shown in Fig. 4 in the decoding order, and the player changes the source packet number (SPN_exit_from_previous_Clip) of P5 of the preceding Clipl to the Bridge-Clip shown in (b).
- D2 shown in Fig. 2 of Bridge-CI ip that is, the stream data on the OUT-timel side of Clipl corresponding to the video data before SPN-arrival-time-discontinuity in the BridgeSeduence
- the data dl up to and including B4 consist of the data directly copied from Clipl, and subsequent data P7 and B6 originally consist of B6 and B7 of the original Clipl.
- the image data is returned to the image data that has not been processed, and then encoded again to obtain data d2 that is P7 and B6.
- D3 shown in Fig. 2 of Bridge-Clip that is, the video data after time_discontinuity after SPN-arrival in BridgeSeauence.
- b4, p5, p8, b6, and b7 of the original Clip2 are image data that has not been decoded by compressing the CI ip2 once and compressed.
- FIG. 6 shows an example of realizing a seamless connection without using BridgeSeauence, which is the second method, when seamlessly connecting video streams (Clipl and Clip2) as shown in FIG.
- Clipl and Clip2 show pictures in their decoding order.
- BridgeSeauence is not used, as in the case of using BridgeSeauence shown in Fig. 5, streams near the connection point (connection point) are not decoded and compressed once. And then re-decoded to the optimal picture type. That is, Clipl's video stream consists of an encoded video stream up to the picture corresponding to OUTLUmel in Fig. 4, and it is an original video stream that follows the MPEG2 standard in one continuous stream.
- B6 and B7 of the Clipl are re-encoded data (P7, B6) d5.
- the video stream of Clip2 consists of the coded video stream after the picture corresponding to IN_time2 of Clip2 in Fig. 4, and it is a continuous elementary stream in accordance with the MPEG2 standard.
- b4, p5, p8, b6, and b7 of the original Clip2 are re-encoded as de-even (i0, pi, 4, b2, b3) d6.
- FIG. 7 is a diagram for explaining the overlap of the audio display, and is a schematic diagram showing the video presentation units VPU1 and VPU2 and the audio presentation units ⁇ and APU2 in TS1 and TS2.
- the last audio frame A_end of the audio stream of TS1 includes an audio sample having a display time equal to the display end time (OUT-time) of the last display picture of TS1.
- the first audio frame A starU of the audio stream of TS2 or the display start of the first display picture of TS2 (IN _time2) contains an audio sample with a display time equal to
- Lap (Audio over lap) occurs.
- the TS at the connection point is a DVR MPEG2TS according to a DVR-STD (Digital Video Recording-System Target Decoder) described later.
- DVR-STD is a conceptual model for modeling the decoding process when generating and verifying an AV stream referenced by two seamlessly connected yltems.
- Figure 8 shows the DVR-STD model (DVR MPEG2 transport stream player mode 1).
- the information processing apparatus (DVR MPEG2 transport stream player model, hereinafter referred to as a player) 1 is composed of transport packets (TS) from TSs connected to be seamlessly reproduced. It comprises an output unit 10 for reading and outputting the data, and a decoder (DVR-STD) 20 for decoding the transport packet from the output unit 10. As will be described later, this decoder 20 is obtained by changing the input timing of the transport packet and the capacity of the audio buffer in the conventional DVR-STD described above.
- the output unit 10 the TS file read from the read unit (DVRdrive) 11 at the read rate R UD is buffered in the read buffer 12, and the source packets (Source packets) are read from the read buffer 12.
- R MAX is the bit rate of the source packet stream.
- the pulse generator (27 MHz X-tal) 14 generates a 27 MHz pulse.
- Arrival time clock counter (Arrival time clock counter) 15 is a binary counter that counts the pulses of the frequency of 27 MHz.
- the source depacketizer section 13 counts the Arrival time clock counter at time t (i). Supplies the value Arriva one time one clock one clock (i).
- one source packet has one transport packet and its arrival time_stamp.
- Arrival time—stamp of the current source packet
- TS recording_rate is the bit rate of the TS.
- n, TBn, MBn, EBn, TBsys, Bsys, Rxn, Rbxn, Rxsys, Dn, Dsys, On and Pn (k) shown in Fig. 8 is IS0 / IEC13818-1 (MPEG2 Systems standard).
- n Index number of the elementary stream
- MBn Multiplex buffer for elementary stream n (exists only for video stream)
- TBsys Input buffer for system information of the program being decrypted
- Bsys Main buffer in the system target decoder for system information of the program being decoded
- Rxn Transmission rate at which data is stripped from TBn
- Rbxn Transmission rate at which PES packet packets are stripped from MBn (only present for video streams)
- Rxsys Transmission rate at which data is removed from TBsys
- Dsys Decoder for system information of the program being decrypted
- the timing at which transport packets are input to the TB1, TBn or TBsys buffer is determined by the arrival_time_st a of the source packet.
- the specification of the buffering operation of TB1, MB1, EB1, TBn, Bn, TBsys and TBsys is the same as T-STD specified in IS 0 / IEC 13818-1.
- the specification of decoding operation and display operation is also the same as T-STD specified in IS0 / IEC 13818-1.
- Fig. 9 is a timing chart of input, decoding, and display of a transport packet when moving from one AV stream (TS1) to the next AV stream (TS2) that is seamlessly connected to it. is there.
- TS1 is the preceding stream
- TS2 is the current stream.
- Each packet divided by TS1 and TS2 represents source packets SP1 and SP2 of TS1 and TS2.
- the time axis of the TS2's initial time base (indicated by ATC2 in Figure 9) Is not the same as the time axis of TS1's arrival time base (indicated by ATC1 in Figure 9). Also, the time axis of the system time base of TS2 (indicated by STC2 in FIG. 9) is not the same as the time axis of the system time base of TS1 (indicated by STC1 in FIG. 9).
- Video display is required to be seamless and continuous. The display time of the audio presentation unit may overlap.
- the audio buffer is made to have an optimum capacity by changing the following two points with respect to the player 101 described in Japanese Patent Application Publication No. 2002-1585894.
- the first change is that when moving from one AV stream (TS1) to the next AV stream (TS2) that is seamlessly connected to it, the decoder 20 up to the last packet of TS1 is transferred.
- the input is determined by the arrival_l; ime_stamp of those source packets.
- the arrival_Ume_stamp is ignored from the time T1 when the last video packet of TS1 ends to be input to the TBI until the time T2 when the last bit of TS1 is completely input to the TBI.
- the input of the source packet between T1 and T2 is performed at time T1
- the source packet of TS1 is determined according to & 1 "1: 31_1: 11116-513 ⁇ .
- the maximum bit rate of TS is ignored, ignoring the arrival-time_stamp of the conventional source packet.
- additional buffers of one second which is required to enter in R MAX becomes unnecessary.
- the input of the decoder 20, TB1, TBn or TBsys to the buffer of the TS1 is performed by the source packet of the TS1. Determined by the time_stamp of the SP1 arriva.
- the timing at which the remaining packets of TS1 are input to the decoder 20 is also determined by Ume_stamp of the source packet SP1 of TS1.
- the time when the last byte of TS1 enters the buffer is time T2.
- the initial time clock counter 15 is reset to the value immediately after arrival_time_sta of the first source packet of TS2.
- the input timing to the buffer of TB1, TBn or TBsys of the decoder 20 is determined by the arrival_time stamp of the source packet SP2 of TS2.
- the input timing to the decoder 20 is such that the input timing to the TB1, TBn or TBsys buffer of the decoder 20 is the source packet SP1 of the TS1 until the time T2 when the last byte of the TS1 is input to the buffer.
- the time_sta is determined by the time_sta immediately, and the time after T 2 is determined by the arrival-time-stamp of the source bucket SP2 of the TS2.
- connection point As shown in FIG. 2 or FIG. (Points) without gaps. That is,
- the last video data of TS1 (first picture) is added to the first video data of TS2 (first picture).
- STC1 Time axis of the system time base of TS1 (shown as STC1 in Figure 9)
- STC2 The time axis of the system time base of TS2 (shown as STC2 in Figure 9).
- STC2 starts from the time when the first PCR ((Program Clock Refarence)) of TS2 is input to T-STD.
- the offset between STC1 and STC2 is determined as follows. That is, PTS nd: PTS on STC1 corresponding to the last video presentation unit of TS1
- PTS2start PTS on STC2 corresponding to the first video presentation unit of TS2
- T PP Display period of the last video presentation unit of TS1
- the offset value SK_delta between the two system time bases is calculated as in the following equation (6).
- the process performed by the player for controlling the system time clock of the decoder 20 when moving from TS1 to the next TS2 seamlessly connected to TS1 will be described.
- the last audio presentation unit of TS1 is displayed.
- the system time clock may overlap between times T2 and T5.
- the decoder 20 switches the system time clock from the old time base value (STC1) to the new time base value (STC2).
- the value of ST C2 can be calculated as in the following equation (7).
- STCPend The value of the STC on the system time base STC1 when the last byte of the last packet of TS1 arrives at the decoder 20
- TS2 of the first of the first byte is the system time base Ichisu STC2 on the STC value at the time of arrival to the decoder 2 0 of Baketsuto
- FIG. 10 is another example of a timing chart of input, decoding, and display of a transport packet when moving from a certain AV stream (TS1) to the next AV stream (TS2) seamlessly connected to it.
- the input to the decoder 20 up to the last packet of TS1 is determined by arriving and time_stainp of those source buckets in the same manner, but with one difference from the evening timing chart shown in FIG. Therefore, as shown in FIG. 10, a predetermined time interval (deltal: between times T2 and T2 ') is set so that it is not necessary to input the first packet of TS2 immediately after the last packet of TS1.
- This makes it possible to determine the input timing of the first packet of TS2 more flexibly than in the case of FIG. 9, and thus has an effect of facilitating the encoding of TS2.
- the input timing to the buffer of TB1, TBn or TBsys of decoder 20 is the source packet of TS1 SP1 arrival—determined by time_stamp.
- arrival time clock counter 15 is reset to the value of arrival_time_stamp of the first source bucket of TS 2.
- Decoder 20 input to TB1, TBn or TBsys buffer The arrival is determined by the arrival-time-stamp of the source packet SP2 of TS2.
- STC2 2 s tart and STC2 nd above must satisfy the following relational expression (1 0).
- FIG. 11 shows the transport when moving from one AV stream (TS1) to the next AV stream (TS2) that is seamlessly connected to it.
- 9 is another example of a timing chart for inputting, decoding, and displaying a packet.
- the input to the decoder 20 up to the last packet of TS1 is determined by the arriving of those source packets and time_sta immediately, but the difference from the evening timing chart shown in FIG. 10 is one point.
- a predetermined time interval (ATC_delta: between times T2 and T2 ') is provided. This makes it possible to determine the input timing of the first packet of TS2 more flexibly than in the case of FIG. 9, and thus has an effect of facilitating the encoding of TS2.
- the input timing of ⁇ 1, ⁇ of the decoder 20 or the buffer of TBsys is the source packet SP1 of the TS1. Arrival_time_stanip.
- Time T 2 ′ is the time at which the first packet of TS 2 is input to decoder 20.
- ATC_delta is an offset time from the arrival time stamp (time on the ATCl) of the last bucket of TS1 to the time T2 'projected on the ATCl.
- arrival time clock counter 15 is reset to the value of arrival time stamp of the first source packet of TS 2.
- Decoder 2 The input timing of 0 to the buffer of TB1, TBn or TBsys is determined by the time_stamp of the source packet SP2 of TS2.
- ATC-delta is determined so as to satisfy STC_delta in the above equation (6).
- ATC_delta is managed as additional information of the stream data.
- the value of ATC-delta is managed as information attached to TS1.
- Fig. 12 shows the data format of the additional information ClipInioO for storing ATC_delta.
- a flag indicating whether or not is-ATC-dell ⁇ , CI iplnfo 0 has the value of ATC_delta. Multiple values can be registered in Cliplnio 0. This is to allow a plurality of TS2s to be connected to TS1 as shown in FIG. If the is_ATC_delta flag is 1, number_o ATC-delta_entries indicates the number of ATC-deltas registered in this CI iplnfo0.
- following-Clip-Infomation-file-name is the name of the stream of TS2 connected to TSl.
- Clip Iniomation_file If there is more than one TS2 corresponding to one name, the value of ATC_delta for each TS2 is registered in Cliplnfo 0.
- ClipInioO contains the above-mentioned AT C-delta information, and ATC_delta is obtained by the DVR-STD model controller (not shown in FIG. 8) in a predetermined manner in switching between TS1 and TS2 as described above. Will be treated.
- the size of the audio buffer of the decoder 20 is changed to a size large enough to satisfy the following condition.
- This condition means that when moving from TS1 to the next TS2 that is seamlessly connected to this, after the last transport packet of TS1 is input, the first decoded picture (I picture) of TS2 is decoded. It is possible to input to the video buffer by evening.
- the maximum required audio buffer capacity is as follows: Value. In other words, it is a size that can store the amount of audio data having a length corresponding to "the time during which the maximum bit amount of an I picture can be input to the video buffer before its decoding timing".
- the maximum required amount of audio buffer EBnjnax can be calculated by the following equation (11).
- EBn_max (I_max / Rv) * Ra [bits] ⁇ ⁇ ⁇ (1 1)
- max is the maximum bit amount of the I picture, and this is the size of the video code buffer EB1 shown in Fig. 8.
- Size. Rv is the input bit rate to the video code buffer EB1.
- I is the bit rate of the audio stream.
- the size of the audio buffer to be obtained, EBn-max is determined by changing the buffer occupancy of the video code buffer EB1 from zero at the input bit rate to the video elementary stream buffer (EB1). It is the value obtained by multiplying the time required to reach I_max (Lmax / Rv) by Ra.
- the bit size of the I picture is generally 10% or less of the encoding bit rate. For example, if the coding pit rate is 10 Mbps, the size of the I picture is usually less than or equal to I Mbits.
- the I-picture can be input to the video buffer of the decoder 20 by its decode timing.
- the audio buffer of the decoder 20 can store 100 milliseconds of audio data
- the audio data can be stored 100 millimeters faster than the evening when it is reproduced.
- TS 1 can be multiplexed so that the input to the audio buffer is completed as early as seconds. Therefore, assuming that the audio buffer has a buffer size capable of storing at least 100 milliseconds of audio data, for the first and second reasons, transfer from TS1 to the next TS2 that is seamlessly connected to it.
- the first decoded picture of TS2 At least 100 milliseconds can be secured as the time until the input of the character (I picture) to the video buffer (code timing of the I picture).
- the following specifically calculates the capacity of the audio buffer that can store audio data for 100 milliseconds.
- the size of the audio buffer of the DVR-STD is changed by inputting the first decoded picture (I picture) of TS2 to the video buffer by the decoding timing.
- the effect of changing to “a size that can store the amount of audio data corresponding to the available time” will be described in further detail with reference to FIGS. 14 and 15.
- FIG. 14 and 15 The effect of changing to “a size that can store the amount of audio data corresponding to the available time” will be described in further detail with reference to FIGS. 14 and 15.
- the audio stream is an AC3 audio stream with a bit rate of 640 kbps and a sampling frequency of 48 kHz. Since the number of samples in one audio frame of the AC3 audio stream is 1,536 samples, the time length is 32 milliseconds. The byte size of one audio frame is 2560 Bytes.
- FIG. 7 is a graph showing an example of a change in bit occupancy of a video buffer and an audio buffer of the DVR-STD of FIG.
- the buffer transition of the video / audio data of TS1 is indicated by a broken line
- the buffer transition of the video / audio data of TS2 is indicated by a solid line.
- the 4 kBytes audio buffer stores 50 milliseconds of audio data overnight. Therefore, the time when the last byte of the last audio packet of TS 1 arrives at DVR-STD In TS1, TS1 can be multiplexed so that the input of audio data at the end of the playback is terminated 50 ms earlier. However, 50 milliseconds is not enough to input the first decoded picture (I-picture) of TS2 to the video buffer before its decoding timing. For this reason, encoding is restricted so that the size of the picture (I picture) decoded first in TS2 is reduced, and there is a problem that the image quality is deteriorated.
- the amount of time that can be advanced is 50 ms, so the time required to input the first I picture of TS2 to the video buffer is shown in Fig. 14.
- the start-up delay (t1) is small, up to 50 ms. Therefore, the time for inputting the first I picture of TS2 cannot be taken long enough, and the I picture size S1 becomes small, encoding is restricted, and the picture quality of the I picture deteriorates.
- an additional buffer of about 1 second is provided in addition to 4 kBytes, and the maximum rate of TS is between T1 and T2. I had to type in RMAX.
- the AC3 audio stream with a bit rate of 640 kbps is described. Becomes extremely large.
- the audio buffer size of the DVR-STD is changed to, for example, 8 kBytes as in the decoder 20 in the present embodiment.
- (A) and (b) in Fig. 15 show an example of optimizing the capacity of the audio buffer.
- the audio buffer size is 8 kBytes
- the next connection from TS1 to this is seamless.
- FIG. 8 is a graph showing an example of a change in the bit occupancy of the video buffer and the audio buffer of the DVR-STD according to the present embodiment when shifting to TS2 of FIG.
- the transition of the video / audio data buffer of TS1 is indicated by a broken line
- the transition of the video Z audio data of TS2 is indicated by a solid line.
- the 8 kBytes audio buffer can store 100 milliseconds of audio data. Therefore, the last byte of the last audio packet of TS1 is DVR-ST
- TS1 is multiplexed so that the input of the audio data ends 100 ms earlier than the timing at which it is played. Wear. At least 100 ms, first decoded picture of TS2
- I picture can be input to the video buffer before the decoding timing. That is, the time for inputting the first I picture of TS2 (startup delay) t2 can be sufficient, so that the first decoded picture (I picture) size S2 of TS2 can be increased, Therefore, the I-picture encoding restrictions are small and high picture quality can be achieved.
- the data is composed of a data string in units of a source packet having a transport packet and its corresponding time stamp, and is read and decoded by a decoder based on the corresponding time stamp.
- the generated TS can be generated and recorded in the multiplexing device (information processing device).
- the multiplexing device generates a re-encoded Clipl (first video encoded stream) so as to end the display at a predetermined picture, as described with reference to FIGS. 4 to 6 above, for example.
- a video encoding unit that generates a Cl ip2 (second video encoded stream) that is displayed following this picture and that is re-encoded so that display can be started;
- a multiplexing unit that multiplexes the coded stream to generate TS1, multiplexes Cl ip2 and an audio coding stream synchronized with the Cl ip2 to generate TS2, and a multiplexing unit including TS1 and TS2.
- a recording unit for recording the encrypted stream.
- the input of the I picture, which is the second picture, to the video buffer of the decoder 20 is started by inputting audio data for the time required for the I picture to the decoder 20. Multiplex so that input can be completed before.
- the coding unit may generate the Bridge-Clip and the multiplexing unit may also multiplex the Bridge-Clip.
- the recording medium on which the multiplexed stream generated by such a multiplexing device is recorded includes a TS 1 that ends the display with the first picture and a second picture that is reproduced following the first picture.
- the first picture of TS2, which is generated from the display start TS2 can be input to the decoder 20 based on the respective competition time stamps, and the first picture of the second picture TS2 is input to the decoder.
- the time it takes to A multiplexed stream is recorded in which the audio data of the minutes is multiplexed so that the input can be completed before the input of the second picture to the decoder 20 is started.
- the TS 1 and the TS 2 are transmitted after the last video packet of the TS 1 has been input to the TB I of the decoder 20. Even before the rest of the packets enter the decoder 20, the transport packets are entered according to the arrival time stamp, and the size of the audio buffer is reduced from 4 kBytes in the conventional DVR-STD to I picture.
- the last bucket of TS 1 is input Is the time (start-up delay) for inputting the I picture, which is the first picture of TS2, up to its decoding timing. It is possible to correspondingly ensure, comb small the coding constraints of the I-picture can be a high quality.
- the capacity of the audio buffer is changed as described above, and the transport packets are input in accordance with the arrival time stamp. It is possible to eliminate the need for a special buffer.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Television Signal Processing For Recording (AREA)
- Television Systems (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP04728909.5A EP1515553B1 (en) | 2003-05-08 | 2004-04-22 | Information processing device and method, program, and recording medium |
US10/520,488 US8086087B2 (en) | 2003-05-08 | 2004-04-22 | Information processing device and method, program, and recording medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2003-130661 | 2003-05-08 | ||
JP2003130661A JP4902935B2 (ja) | 2003-05-08 | 2003-05-08 | 情報処理装置、情報処理方法、プログラム、及び記録媒体 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004100545A1 true WO2004100545A1 (ja) | 2004-11-18 |
Family
ID=33432113
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2004/005782 WO2004100545A1 (ja) | 2003-05-08 | 2004-04-22 | 情報処理装置及び方法,並びにプログラム及び記録媒体 |
Country Status (7)
Country | Link |
---|---|
US (1) | US8086087B2 (ja) |
EP (1) | EP1515553B1 (ja) |
JP (1) | JP4902935B2 (ja) |
KR (1) | KR100975175B1 (ja) |
CN (1) | CN100380959C (ja) |
TW (1) | TWI246330B (ja) |
WO (1) | WO2004100545A1 (ja) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070226432A1 (en) * | 2006-01-18 | 2007-09-27 | Rix Jeffrey A | Devices, systems and methods for creating and managing media clips |
KR100788685B1 (ko) * | 2006-03-10 | 2007-12-26 | 삼성전자주식회사 | 데이터 스트림 포맷의 변환 방법 및 장치, 이를 이용한데이터 스트림 기록 방법 및 장치 |
KR101488729B1 (ko) * | 2008-05-13 | 2015-02-06 | 삼성전자주식회사 | 디지털 방송 송신장치 및 수신장치와 그 방법들 |
JP4719250B2 (ja) * | 2008-06-20 | 2011-07-06 | 株式会社ソニー・コンピュータエンタテインメント | 画面記録装置、画面記録方法、画面記録プログラム及び情報記憶媒体 |
US8817725B2 (en) * | 2012-01-13 | 2014-08-26 | Blackberry Limited | Scheduling transmission of traffic treated less preferentially despite timing requirements |
US20130336379A1 (en) * | 2012-06-13 | 2013-12-19 | Divx, Llc | System and Methods for Encoding Live Multimedia Content with Synchronized Resampled Audio Data |
US10231001B2 (en) | 2016-05-24 | 2019-03-12 | Divx, Llc | Systems and methods for providing audio content during trick-play playback |
SE541208C2 (en) * | 2016-07-04 | 2019-04-30 | Znipe Esports AB | Methods and nodes for synchronized streaming of a first and a second data stream |
US10148722B2 (en) * | 2016-07-04 | 2018-12-04 | Znipe Esports AB | Methods and nodes for synchronized streaming of a first and a second data stream |
KR102090070B1 (ko) * | 2018-10-31 | 2020-03-17 | 카테노이드 주식회사 | 스트리밍 서버, 클라이언트 단말 및 이를 이용한 av 라이브 스트리밍 시스템 |
JP6992104B2 (ja) * | 2020-02-26 | 2022-01-13 | 株式会社Jストリーム | コンテンツ編集装置、コンテンツ編集方法およびコンテンツ編集プログラム |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000057691A (ja) | 1998-08-10 | 2000-02-25 | Sony Corp | システムターゲットデコーダのオーディオバッファ |
JP2000175152A (ja) | 1998-12-04 | 2000-06-23 | Sony Corp | 多重化装置、多重化方法及び記録媒体 |
JP2000183957A (ja) * | 1998-12-16 | 2000-06-30 | Samsung Electronics Co Ltd | デ―タ列間の連続再生を保障するための付加情報生成方法、この情報を貯蔵する記録媒体及び記録、編集及び/または再生装置 |
JP2001054118A (ja) | 1999-06-01 | 2001-02-23 | Sony Corp | 符号化装置及び符号化方法並びに多重化装置及び多重化方法 |
JP2002158974A (ja) * | 2000-04-21 | 2002-05-31 | Sony Corp | 情報処理装置および方法、記録媒体、プログラム、並びに記録媒体 |
JP2003130661A (ja) | 2001-10-24 | 2003-05-08 | Matsushita Electric Ind Co Ltd | ナビゲーション装置 |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB9424429D0 (en) * | 1994-12-02 | 1995-01-18 | Philips Electronics Uk Ltd | Audio/video timing discrepancy management |
EP0847198B1 (en) * | 1995-09-29 | 1999-04-28 | Matsushita Electric Industrial Co., Ltd. | Method, device and disc for recording and reproducing interleaved bit stream on and from the disk |
US6181383B1 (en) * | 1996-05-29 | 2001-01-30 | Sarnoff Corporation | Method and apparatus for preserving synchronization of audio and video presentation when splicing transport streams |
GB9813831D0 (en) * | 1998-06-27 | 1998-08-26 | Philips Electronics Nv | Frame-accurate editing of encoded A/V sequences |
EP0971542A3 (en) * | 1998-07-10 | 2001-08-01 | Tektronix, Inc. | Readjustment of bit rates when switching between compressed video streams |
US6507592B1 (en) * | 1999-07-08 | 2003-01-14 | Cisco Cable Products And Solutions A/S (Av) | Apparatus and a method for two-way data communication |
GB9930788D0 (en) * | 1999-12-30 | 2000-02-16 | Koninkl Philips Electronics Nv | Method and apparatus for converting data streams |
US6678332B1 (en) * | 2000-01-04 | 2004-01-13 | Emc Corporation | Seamless splicing of encoded MPEG video and audio |
GB0007868D0 (en) * | 2000-03-31 | 2000-05-17 | Koninkl Philips Electronics Nv | Methods and apparatus for editing digital video recordings and recordings made by such methods |
JP4599740B2 (ja) * | 2000-04-21 | 2010-12-15 | ソニー株式会社 | 情報処理装置および方法、記録媒体、プログラム、並びに記録媒体 |
JP4355988B2 (ja) * | 2000-04-21 | 2009-11-04 | ソニー株式会社 | 情報処理装置、情報処理方法、プログラム記録媒体、プログラム、および情報記録媒体 |
JP4682434B2 (ja) * | 2000-04-21 | 2011-05-11 | ソニー株式会社 | 情報処理装置および方法、記録媒体、並びにプログラム |
EP1198133A4 (en) * | 2000-04-21 | 2004-10-06 | Sony Corp | INFORMATION PROCESSING DEVICE AND METHOD, PROGRAM AND RECORDED MEDIUM |
EP1164796B1 (en) * | 2000-06-14 | 2006-12-20 | Eads Astrium Sas | Process and system for video on demand |
WO2003019512A2 (en) * | 2001-08-22 | 2003-03-06 | Gary Alfred Demos | Method and apparatus for providing computer-compatible fully synchronized audio/video information |
US7386223B2 (en) * | 2001-11-30 | 2008-06-10 | Matsushita Electric Industrial Co., Ltd. | Method and an apparatus for stream conversion a method and an apparatus for data recording and data recording medium |
WO2004019530A1 (en) * | 2002-02-15 | 2004-03-04 | Visible World, Inc. | System and method for seamless switching through buffering |
-
2003
- 2003-05-08 JP JP2003130661A patent/JP4902935B2/ja not_active Expired - Lifetime
-
2004
- 2004-04-22 CN CNB2004800004774A patent/CN100380959C/zh not_active Expired - Lifetime
- 2004-04-22 WO PCT/JP2004/005782 patent/WO2004100545A1/ja active Application Filing
- 2004-04-22 US US10/520,488 patent/US8086087B2/en active Active
- 2004-04-22 EP EP04728909.5A patent/EP1515553B1/en not_active Expired - Lifetime
- 2004-04-22 KR KR20057000185A patent/KR100975175B1/ko not_active IP Right Cessation
- 2004-05-06 TW TW93112789A patent/TWI246330B/zh not_active IP Right Cessation
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000057691A (ja) | 1998-08-10 | 2000-02-25 | Sony Corp | システムターゲットデコーダのオーディオバッファ |
JP2000175152A (ja) | 1998-12-04 | 2000-06-23 | Sony Corp | 多重化装置、多重化方法及び記録媒体 |
JP2000183957A (ja) * | 1998-12-16 | 2000-06-30 | Samsung Electronics Co Ltd | デ―タ列間の連続再生を保障するための付加情報生成方法、この情報を貯蔵する記録媒体及び記録、編集及び/または再生装置 |
JP2001054118A (ja) | 1999-06-01 | 2001-02-23 | Sony Corp | 符号化装置及び符号化方法並びに多重化装置及び多重化方法 |
JP2002158974A (ja) * | 2000-04-21 | 2002-05-31 | Sony Corp | 情報処理装置および方法、記録媒体、プログラム、並びに記録媒体 |
JP2003130661A (ja) | 2001-10-24 | 2003-05-08 | Matsushita Electric Ind Co Ltd | ナビゲーション装置 |
Also Published As
Publication number | Publication date |
---|---|
EP1515553B1 (en) | 2015-06-03 |
US8086087B2 (en) | 2011-12-27 |
JP4902935B2 (ja) | 2012-03-21 |
TW200425740A (en) | 2004-11-16 |
JP2004336488A (ja) | 2004-11-25 |
CN100380959C (zh) | 2008-04-09 |
KR20060009807A (ko) | 2006-02-01 |
KR100975175B1 (ko) | 2010-08-10 |
CN1698374A (zh) | 2005-11-16 |
TWI246330B (en) | 2005-12-21 |
EP1515553A4 (en) | 2012-05-09 |
EP1515553A1 (en) | 2005-03-16 |
US20050249202A1 (en) | 2005-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6795499B1 (en) | Encoding apparatus and method, and multiplexing apparatus and method | |
JP3900050B2 (ja) | データ処理装置、ビデオカメラ及びデータ処理方法 | |
JP2002158974A (ja) | 情報処理装置および方法、記録媒体、プログラム、並びに記録媒体 | |
JP2002158972A (ja) | 情報処理装置および方法、記録媒体、プログラム、並びに記録媒体 | |
KR20030012761A (ko) | 데이터 다중화 방법, 데이터 기록 매체, 데이터 기록 장치및 데이터 기록 프로그램 | |
KR20020020916A (ko) | 정보 처리 장치와 방법, 프로그램 및 기록매체 | |
JP2002519917A (ja) | 符号化されたav系列のフレームの正確な編集 | |
JP2010061803A (ja) | オーディオ/ビデオ記録装置、記録方法、再生装置、再生方法、再生プログラム及び記録プログラム | |
JP2006260762A (ja) | データ列間の連続再生を保障するための付加情報生成方法、この情報を貯蔵する記録媒体及び記録、編集及び/または再生装置 | |
US7974281B2 (en) | Multiplexer and multiplexing method, program, and recording medium | |
WO2004100545A1 (ja) | 情報処理装置及び方法,並びにプログラム及び記録媒体 | |
JP2002159004A (ja) | 符号化装置および方法、記録媒体、並びにプログラム | |
JP3918332B2 (ja) | 多重化装置、多重化方法及び記録媒体 | |
KR100677110B1 (ko) | 데이터열간의 연속 재생을 보장하는 데이터의 기록및/또는 편집 장치 | |
KR100657262B1 (ko) | 데이터열간의 연속 재생을 보장하기 위한 부가 정보를저장하는 기록 매체 | |
KR100532113B1 (ko) | 데이터열간의 연속 재생을 보장하는 데이터의 기록및/또는 재생 장치 | |
Kelly et al. | Virtual editing of MPEG-2 streams | |
JP2009100303A (ja) | 動画像符号化装置およびそのデータ処理方法 | |
WO2004086397A1 (en) | Method and apparatus for guaranteeing seamless reproduction of a plurality of data streams |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004728909 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020057000185 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10520488 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 20048004774 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWP | Wipo information: published in national office |
Ref document number: 2004728909 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1020057000185 Country of ref document: KR |