WO2004030358A1

WO2004030358A1 - Data processing device

Info

Publication number: WO2004030358A1
Application number: PCT/JP2003/012449
Authority: WO
Inventors: Masanori Ito; Hiroshi Yahata
Original assignee: Matsushita Electric Industrial Co., Ltd.
Priority date: 2002-09-30
Filing date: 2003-09-29
Publication date: 2004-04-08
Also published as: AU2003266691A1; JPWO2004030358A1; JP4566745B2

Abstract

A data processing device includes: a signal input section for receiving at least one of the video and audio signal containing a plurality of frames; a compression section for compressing/encoding the received signal and generating encoded data; and a stream constructing section for dividing the encoded data to generate at least one packet having a predetermined data size and adding a control packet containing information on controlling reproduction of the encoded data so as to generate a data stream. The stream constructing section generates position information indicating the storage position of each frame in the data stream and describes it as auxiliary information in the control packet. Thus, it is possible to obtain a program stream having a data structure which can easily be converted, for example, into a transport stream.

Description

Data processing equipment Technical field

The present invention provides a method for storing a moving image stream on a recording medium such as an optical disk.

The present invention relates to a data processing apparatus and method for recording data in real time. Background art

Various data streams have been standardized that compress and encode video (video) and audio (audio) signals at low bit rates. As an example of such a data stream, a system stream of the MPEG2 system standard (IS0 / IEC 13818-1) is known. The system stream includes three types: a program stream (PS), a transport stream (TS), and a PES stream.

In recent years, optical disks such as phase-change optical disks and MOs have been attracting attention as recording media for recording data streams in place of magnetic tapes. Currently, the “Video Recording Standard” (DVD Specifications for Reiri table / Re-recordable Discs Part 3 VIDEO RECORDING version 1.0 September 1999) is specified as a standard for recording a data stream on a phase change optical disk. FIG. 1 shows a configuration of a functional block of a conventional data processing device 90. The data processing device 90 records a data stream in real time on a phase change optical disk 131, such as a DVD-RAM disk or a Blu-ray disk (BD), and reproduces the recorded data stream. Can be.

The data recording process of the data processing device 90 is performed as follows. First, a video signal input to the video signal input unit 100 is compression-encoded by the video compression unit 101. At the same time, the audio signal input to the audio signal input unit 102 is compression-encoded by the audio compression unit 103. The program stream assembling section 244 multiplexes them to generate an MPEG2 program stream (hereinafter, referred to as “program stream” or “PS”). These processes are performed in the MPEG2PS encoder 170. Next, the recording unit 120 and the pickup 130 write the generated PS on the optical disk 1331. At this time, physically continuous free areas on the optical disc 13 1 are detected based on the processing of the recording control section 16 1, the continuous data area detection section 16 0, the logical block management section 16 3, etc. , PS is recorded.

The reproduction process of the data stream is performed as follows. The program stream decomposing unit 114 separates the program stream reproduced via the pickup 130 and the reproducing unit 121 into a video signal and an audio signal. The video decompression unit 1 1 1 and the audio decompression unit 1 1 3 decode video and audio signals, respectively, and display the resulting video and audio data on video. Displayed and output by the unit 110 and the audio output unit 112.

FIG. 2 shows an example of the data structure of the program stream 20. The program stream 20 includes a plurality of video object units (VOBUs) 21. The VOBU 21 includes a plurality of video packs (V-P CK) 22 storing video data and a plurality of audio packs (A-P CK) storing audio data. These are 0.4 seconds to 1 second worth of video playback time. The video pack 22 includes a pack header 22a, a packet header 22b, and compressed video data 22c. On the other hand, the audio pack includes audio data instead of the video data 22 c of the video pack 22. Note that the data size of one VOBU fluctuates within the range of the maximum recording / reproduction rate if the video data has a variable bit rate. If the video data has a fixed bit rate, the VOBU data size is almost constant. Generally, a “pack” is known as one exemplary form of a packet.

FIG. 3 shows the relationship between the program stream 20 and the recording area of the optical disc 13 1. The VOBU of the program stream 20 is recorded in the continuous data area 24 of the optical disk 1331. The continuous data area 24 is composed of physically continuous logical blocks. In this area, data having a maximum rate of 17 seconds or more is recorded. The data processing device 90 assigns an error correction code to each logical block. The data size of the logical block is 32 kbytes. each A logical block contains 16 2 Kbyte sectors.

FIG. 4 shows a state where the recorded data is managed in the file system of the optical disk 13 1. For example, UDF (Universal Disk Format) standard file system, or IS OZ I E C 1 3 346 (Volume and file structure of write once and rewritable media using non-sequent i al recording for

information interchange) A file system is used. In FIG. 4, a continuously recorded program stream is recorded as a file name VR—MOV I E.V RO. The first sector number is set as the position of the file entry that constitutes the file. The file entry includes allocation descriptors a to c for managing each of the continuous data areas (CDA: Contiguous Data Area) a to c. The reason that one file is divided into a plurality of areas a to c is that a bad logical block, a non-writable PC file, etc. existed in the area a.

The UDF standard is equivalent to a subset of the IS0 / IEC 13346 standard. Also, by connecting the optical disk drive (data processing device 90) to a PC or the like via the 1394 interface and SBP (Serial Bus Protocol) -2, the recorded file can be converted from the PC to a single file. Can handle.

FIG. 5 shows an example of a data structure of a program stream 25 conforming to the DVD-VR standard. The difference between program stream 25 and program stream 20 is that each VO of program stream 25 An RDI pack (RDI-PCK) 27 is always added at the beginning of BU26. 101 Pack 27 includes control information for controlling playback of PS, pack header 27a, system header 27b, PES header 27c, RDI data 27d, and manufacturer extension Field 27 e is included. Manufacturer-specific information (manufacturer extension information) can be described in the manufacturer extension field 27 e.

The data processing unit 90 records and reproduces the program streams 20 and 25, and also transmits the data stream from the IEEE1394 interface unit 140 to the D-VHS, set-top box (STB), etc. Can be output. However, since the IE EE1394 interface standard specifies only the MPEG2 transport stream (hereinafter referred to as “transport stream” or “TS”) as a video synchronous communication protocol, data The processing unit 90 needs to convert PS into TS.

Hereinafter, the transport stream will be described. FIG. 6 shows an example of the data structure of the transport stream 28. The TS 28 includes a plurality of TS OBject Units (TOBU) 29, and the TOBU 29 is composed of one or more transport packets (TS packets). TS packets are, for example, video TS packets (V—TSP) 30 containing compressed video data, audio TS buckets (A_TSP) 31 containing compressed audio data, and the program 'association'. Packet (PAT TSP) containing a table (PAT) 32, Program, A packet containing a map table (PMT) (PMT—TSP) 33 and a packet containing a program clock reference (PCR) (PCR—TSP) 34.

FIGS. 7 (a) to 7 (e) show examples of the data structure of each TS packet. As shown in each TS packet TS packet header 30a to 34a, a different packet identifier (PID) is assigned according to the type of the TS packet. In the transport stream, since different PIDs are provided for each packet of a plurality of programs in the evening, it is possible to acquire only the TS bucket of the necessary programs by using the PID. For example, the procedure for obtaining V—TSP 30 and A—TSP31 is as follows. First, PAT—TSP32 with PID of “0X0000” is obtained. PAT32c in PAT-TSP32 describes the PID of PMT-TSP. Therefore, PID (“0x0303”) of PMT_TS S33 is acquired. Ρ ΜΤ PMT 33 C in TSP 33 includes V— TSP 30 PID (“0x0 0 2 0”) and A—TSP 31 PID (“0 x 0 0 2 1”). Is described. By obtaining the packets to which those PIDs are assigned, V_TSP30 and A-TSP31 are obtained.

The transport stream is transmitted with B while maintaining the packet structure.

It is recorded on a recording medium such as D. For example, WO 01/04893 pamphlet describes a technique for recording a transport stream of MPEG2 video on a recording medium. In addition, received In some cases, an arrival time stamp is added to a transport stream and recorded on a recording medium. Figures 8 (a) to 8 (c) show the data structure of TS35 with a time stamp. FIG. 8A shows a video TS packet (V-TSPT) 37 and an audio TS packet (A-TSPT) 38 stored in an object unit (TTOBU) 36. FIGS. 8B and 8C show the data structures of the video TS bucket 37 and the audio TS bucket 38, respectively. The first four bytes are timestamps 37a and 38a, and the next 188 bytes are each received TS bucket.

When the program stream is converted into a transport stream in the data processing device 90, the PS → PES converter 243 first converts the PS into a packetized elementary stream (PES), and then converts the PES → TS. Unit 242 converts PES to TS. Such processing is described in, for example, Japanese Patent Application Laid-Open No. 10-243394.

FIGS. 9 (a) to 9 (c) show the stream correspondences when stream conversion of PS 39 to general TS 41 via PES 40 is performed. Each PES packet of the PES 40 stores one frame of data. Further, even in the general TS 41, one frame of data is stored in one PES bucket. For simplicity of explanation, the figure shows only the processing for video data. The PS 39 is a stream conforming to, for example, the DVD-VR standard.

First, as shown in Fig. 9 (a), from each video pack of PS 39 Video data portions 39a to 39c are extracted. These are grouped in one frame unit of the video to constitute the pay mouth 40b of the video PES packet shown in FIG. 9 (b). When the PES header 40a is added to the payload 40b, one video PES packet of the PES 40 is obtained.

The video TS packet of TS 41 in FIG. 9C is configured by, for example, adding a 4-byte TS header 41 a to data 41 b obtained by dividing PES 40 in units of 184 bytes. Is done. A TS packet (not shown) including a PAT, a PMT, and the like is newly generated and inserted into the TS 41.

However, the conventional device has a problem that it takes a very long time to convert a program stream into a transport stream.

The first reason is that, in the conventional conversion processing, time information (program clock reference; PCR) used as a reference for the operation of the decoder is added back to the transport stream. In other words, the transport stream needs to be constructed as a multiplexing stream of audio / video data conforming to the coder model (T-STD) at the system target of the transport stream. It is necessary to calculate the value of the program clock reference (PCR) by changing. In the transport stream, stream-specific packets (PAT-TSP, PMT TSP, PCRTSP, etc.) are stored in, for example, 100 Since it is necessary to insert them at intervals of less than one second, it is necessary to change the timing and order of multiplexing audio / video data.

The second reason is that when one frame of data is stored in the PES packet payload of the transport stream, when converting from PS to PES, each pack of PS is scanned and the boundary for each frame is scanned. Must be searched. For example, when converting a program stream conforming to the DVD-VR standard to a general transport stream in which one frame of data is stored in one PES packet, all audio and Z video streams are scanned. It is necessary to detect the start position and the data size of each audio / video frame, and add a PES bucket header to the start position of each frame to make a PES bucket. Further, when forming a PES packet, it is necessary to add PTS and DTS which are not always added to all audio Z video frames to the PES packet header of all audio Z video frames.

The present invention has been made in view of the above problems, and has as its object to provide a program stream having a data structure that can be easily converted to a transport stream. Disclosure of the invention

A data processing apparatus according to the present invention includes: a signal input unit to which at least one of a video signal and an audio signal including a plurality of frames is input; When, The encoded data is divided to generate one or more packets having a certain data size, and a control packet storing control information for controlling reproduction of the encoded data is added to generate a data stream. And a stream assembly part. The stream assembling unit generates position information indicating a storage position in the data stream for each frame, and describes the position information in the control bucket as the control information.

In a preferred embodiment, the stream assembling unit further generates time information indicating a display timing of each frame, and describes the time information as the control information in the control bucket.

In a preferred embodiment, the stream assembling unit further generates time information indicating the timing of decoding of each frame, and describes the time information in the control packet as the control information.

In a preferred embodiment, the stream assembling unit further generates size information indicating a data size of each frame, and describes the size information in the control packet as the control information.

In a preferred embodiment, the stream assembler describes the control information in a field in the control bucket in which arbitrary information can be described.

In a preferred embodiment, the stream assembler generates a control pack as the control bucket, and adds the control pack to generate a program stream as the data stream. The data processing method according to the present invention includes the steps of: inputting at least one of video and audio signals including a plurality of frames; Compressing and encoding the signal to generate encoded data; dividing the encoded data to generate one or more packets having a fixed data size; The method includes a step of adding a control packet storing control information for controlling the reproduction of the encoded data to generate a data stream. The step of generating a data stream includes generating, for each frame, position information indicating a storage position in the data stream, and describing the position information in the control bucket as the control information. .

In a preferred embodiment, the step of generating the data stream further includes a step of generating time information indicating a display timing of each frame and describing the time information as the control information in the control packet.

In a preferred embodiment, the step of generating the data stream further includes a step of generating time information indicating the timing of decoding each frame, and describing the control information as the control information in the control bucket. Include.

In a preferred embodiment, the step of generating the data stream further includes a step of generating size information indicating a data size of each frame and describing the size information as the control information in the control packet.

In a preferred embodiment, the step of generating the data stream describes the control information in a field in the control packet in which arbitrary information can be described. In a preferred embodiment, the step of generating the data stream includes generating a control pack as the control bucket, adding the control pack, and generating a program stream as the data stream.

The data processing method according to the present invention is used when converting a first data stream into a second data stream. Here, the first data stream is a bucket having a first data size and includes coded data obtained by compression-coding at least one of video and audio signals including a plurality of frames. It has the above-mentioned bucket and a control packet in which control information on the reproduction of the encoded data is stored. Further, the second data stream has one or more buckets having a second data size different from the first data size. The data processing method includes: a step of extracting, from the control packet, the control information including position information indicating a storage position of each frame in the data stream and time information indicating reproduction timing, and the control information. A step of determining whether or not the encoded data in each packet includes a head portion of a frame, and if the encoded data portion includes a head portion of the frame, the second data stream. Time information indicating the playback timing of the frame is added before the beginning of the frame, the encoded data is divided and stored in the packet, and if the beginning of the frame is not included, the encoded data is And storing the divided data in the bucket to generate the second data stream. Another data processing apparatus according to the present invention includes: a signal input unit to which at least one of a video signal and an audio signal including a plurality of frames is input; and compression encoding of the received signal to generate encoded data. A compression unit; and a stream assembling unit that divides the encoded data to generate a packet having a fixed data size, and arranges the bucket to generate a first data stream. The stream assembling section generates first time information indicating the decoding timing for all buckets, and further delays some of the buckets at a predetermined timing later than the time of the first time information. Second time information indicating time is generated, and one of the first time information and the second time information is described in each packet.

In a preferred embodiment, the stream assembler generates the _second time information based on a data size of a bucket of a second data stream different from the first data stream. In a preferred embodiment, the stream assembler includes: 10

The second time information is generated at a timing within 0 ms. In a preferred embodiment, the stream assembling unit associates a plurality of buckets as one unit based on a video playback time of the data stream, and the second bucket at a timing at which a video bucket appears at the head of the unit. Generate time information.

In a preferred embodiment, the data processing device includes a recording unit that records the first data stream on a recording medium.

In a preferred embodiment, the other data processing device is as described above. The first data stream recorded on the recording medium is converted into the second data stream. The data processing device has a conversion unit that converts the packet into one or more packets. The conversion unit inserts a control packet for specifying a bucket including the encoded data before the packet obtained by converting the bucket in which the second time information is described.

Still another data processing device according to the present invention includes a signal input unit to which at least one of a video signal and an audio signal including a plurality of frames is input, and compression-encodes the received signal to generate encoded data. And a stream assembling unit that divides the encoded data to generate a bucket having a first data size, and arranges the buckets to generate a first data stream. The stream assembler provides padding packets in some of the packets, and determines a data size of the padding bucket based on a data size of a packet of a second data stream different from the first data stream. .

In a preferred embodiment, the stream assembler provides the bucket having the padding bucket at one time within 100 milliseconds.

In a preferred embodiment, the other data processing device converts the first data stream recorded on the recording medium into the second data stream. To stream. The data processing device has a conversion unit that converts the packet into one or more packets. When converting the bucket provided with the padding bucket, the conversion unit converts the padding bucket into a control packet for specifying a packet including the encoded data. BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 is a block diagram of a conventional data processing apparatus 90.

FIG. 2 is a diagram showing an example of the data structure of the program stream 20.

FIG. 3 is a diagram showing the relationship between the program stream 20 and the recording area of the optical disc 13 1.

FIG. 4 is a diagram showing a state in which recorded data is managed in the file system of the optical disc 13 1.

FIG. 5 is a diagram showing an example of the data structure of the program stream 25 conforming to the DVD-VR standard.

FIG. 6 is a diagram showing an example of the data structure of the transport stream 28.

Fig. 7 shows the data structure of V-TSP30, Fig. 7 (b) shows the data structure of A-TSP31, and Fig. 7 (c) shows the data structure of PAT-TSP32. FIG. 7 (d) shows the data structure of PMT-TSP33, and FIG. 7 (e) shows the data structure of PCR-TSP34. FIG. 8A is a diagram showing a video TS packet (V—TSPT) 37 and an audio TS packet (A—TSPT) 38 stored in an object unit (TTOBU) 36, (B) and (c) are diagrams illustrating the data structures of a video TS packet 37 and an audio TS packet 38, respectively.

Figures 9 (a) to 9 (c) show a general TS 41 where one frame of data is stored in one PES bucket from the PS 39 conforming to the DVD-VR standard via the PES 40. FIG. 4 is a diagram showing a correspondence relationship between streams when stream conversion is performed.

FIG. 10 is a block diagram of the data processing device 10 according to the first embodiment.

FIG. 11A is a block diagram of an MPEG2PS decoder 171 for decoding PS, and FIG. 11B is a block diagram of a TS decoder model for decoding TS.

Fig. 12 shows the Ps generated by the PS assembly 104 of the data processor 10.

FIG. 10 is a diagram showing a correspondence relationship between S 50 and T S55 converted based on P S 50.

FIG. 13 is a diagram showing a data structure of a pack header included in each pack of the PS.

FIG. 14 is a diagram schematically showing the position of the SCR gap. FIG. 15 is a diagram schematically showing an SCR gap repeatedly provided within 100 ms in the middle of a VOBU.

Figure 16 shows an SCR channel set to a PS compliant with the DVD-VR standard. It is a figure which shows a cap typically.

FIG. 17 is a diagram showing a processing procedure when the PS assembling unit 104 sets the SCR of the program stream.

FIG. 18 shows the processing executed by the conversion unit 142. It is a figure which shows the procedure of the conversion process from to.

FIG. 19 is a diagram showing the end SCR for each pack.

FIG. 20 shows a correspondence relationship between PS 59-1 generated by PS assembling section 104 of data processing apparatus 10 according to Embodiment 2 and TS 59-2 converted based on PS 59-1. FIG.

FIG. 21 is a diagram showing the data structure of the padding packet 61 a included in the video pack 61.

FIG. 22 is a diagram schematically showing video packs 65 and 66 including padding packets repeatedly provided within 100 milliseconds. FIG. 23 (a) is a diagram showing a data structure of a program stream generated by the PS assembling unit 104 according to the present embodiment, and FIG. 23 (b) is a diagram showing video auxiliary information 67 d. FIG. 3 is a diagram showing a detailed data structure.

FIG. 24 is a diagram schematically showing start addresses of the I frame, the B1 frame, and the B2 frame when the start of VOBU # i is calculated as 0.

FIG. 25 is a diagram illustrating a procedure of a process in which the conversion unit 142 converts PS into TS.

Figure 26 (a) shows the size information specified in the auxiliary information field 70. FIG. 26 (b) is a diagram showing a data structure of the auxiliary information field 70. FIG. BEST MODE FOR CARRYING OUT THE INVENTION

Hereinafter, embodiments of the data processing device of the present invention will be described with reference to the accompanying drawings.

(Embodiment 1)

FIG. 10 shows a functional block configuration of the data processing device 10 according to the first embodiment of the present invention. The data processor 10 records a data stream in real time on a phase-change optical disc 131, such as a DVD-RAM disc or a Blu-ray disc (BD), and records the recorded data in a stream. System can be played.

The data processor 10 further converts the generated or recorded program stream (PS) into an MPEG2 transport stream (TS), and converts the stream into an IEEE 1394 interface section. It can be output via 140.

Hereinafter, the configuration related to the recording function of the data processing device 10 will be described. The data processing device 10 includes a video signal input unit 100, an audio signal input unit 102, an MPEG2 PS encoder 170, a recording unit 120, and a continuous data area detection unit 160. And a recording control unit 161, and a logical block management unit 163.

The video signal input unit 100 is a video signal input terminal, and receives a video signal representing video data. Audio signal input section 102 is audio signal input terminal Child and receives audio signals representing audio data. For example, when the data processing device 10 is a portable video coder, the video signal input unit 100 and the audio signal input unit 102 are respectively a video output unit and an audio output unit of a tuner unit (not shown). Connected to and receives video and audio signals from each. When the data processing device 10 is a movie recorder, camcorder, or the like, the video signal input unit 100 and the audio signal input unit 102 output from a camera CCD (not shown) and a microphone, respectively. The received video and audio signals are received.

MP EG 2—PS encoder 170 (hereinafter, referred to as “encoder 170”) receives the video signal and the audio signal, performs the processing of the present invention described later, and outputs the MP EG 2 program stream shown in FIG. (PS) or a PS that conforms to the DVD-VR standard shown in Figure 5. The encoder 170 has a video compression unit 101, an audio compression unit 103, and a PS assembling unit 104. The video compression unit 101 and the audio compression unit 103 compress and encode the video signal and the audio signal, respectively, based on the MPEG2 standard to generate video data and audio data. The PS assembling section 104 divides the video data and audio data into packs of 2 KB each, V-P CK and A-P CK, so that these two types of packs constitute one VO BU. Arrange them in order and add RDI Pack 27.

The recording unit 120 controls the 'pickup 130' based on the instruction of the recording control unit 161, and the logical block instructed by the recording control unit 161 '. The PS video object unit (VOBU) 26 is recorded from the position of the number. At this time, the recording unit 120 divides each VOBU into 32 K-byte units, adds an error correction code in each unit, and records the logical unit as one logical block on the optical disc 13 1. If the recording of one VOBU is completed in the middle of one logical block, the recording of the next VOBU is performed continuously without opening a gap. The PS is stored on the optical disk 131, for example, in a form as shown in FIG.

The continuous data area detection section 160 checks the use status of the sectors of the optical disk 131 managed by the logical block management section 163, and detects continuous free logical block areas.

The recording control unit 16 1 controls the operation of the recording unit '120. The recording controller 161 issues an instruction to the continuous data area detector 160 in advance to detect a continuous free logical block area. The recording control unit 16 1 notifies the recording unit 120 of the logical block number each time a logical block unit is written, and when the logical block is used, the logical block management unit Notify 1 6 3 Note that the recording control unit 161 may cause the continuous data area detecting unit 1660 to dynamically detect the size of a continuous free logical block area.

The logical block management unit 163 manages the usage status of each logical block number based on the used logical block number notified from the recording control unit 161. In other words, the usage status of each sector unit that constitutes a logical block number is determined by using the UDF or the space bit descriptor area specified in the file configuration of IS0 / IEC 13346. Therefore, whether they are used or unused are recorded and managed. Then, in the final stage of the recording process, the FID and the file entry are written to the file management area on the disk.

Hereinafter, the relationship between the data amount and the reproduction time when PS is recorded will be described. The continuous data area detector 160 re-detects the next continuous data area when the remainder of one continuous data area falls below 3 seconds in terms of the maximum recording / reproducing rate. When one continuous data area becomes full, writing is performed to the next continuous data area.

When the data processor 10 reproduces the recorded PS, it reads out the data from the optical disc 13 1 and decodes (reproduces) the read out data in parallel. At this time, control is performed so that the data read rate is faster than the maximum data read rate, and the operation is performed so that there is no shortage of data to be reproduced. As a result, if the reproduction of the PS is continued, extra data to be reproduced can be secured per unit time by the rate difference between the maximum reproduction rate and the data read rate. The data processor 10 reproduces extra PS data during the period during which the pickup 130 cannot read data (for example, during a seek operation), thereby realizing seamless PS reproduction. Can be.

For example, the data readout rate of the playback unit 121 is 11.08 Mb'ps, the maximum data playback rate of the PS decomposition unit 114 is 10.08 Mbps, and the maximum travel time of the pickup is 1.5 seconds, in order to play the PS without interruption, while moving the pickup 130 15.2 Mbits of extra data is required. In order to secure this much data, it is necessary to read data continuously for 15.2 seconds. In other words, it is necessary to continuously read out 15.2 Mbits for the time obtained by dividing by the difference between the data readout rate of 1108 Mbps and the maximum data recording / reproducing rate of 10.8 Mbps. Therefore, up to 16.753 Mbits of data (ie, 16.62 seconds of reproduced data) is read during 15.2 seconds of continuous data reading. By securing a continuous data area of more than 16.62 seconds (approximately 17 seconds), continuous data reproduction can be guaranteed. Note that there may be several defective logic blocks in the middle of the continuous data area. However, in this case, it is necessary to secure a continuous data area slightly more than 16.62 seconds for the playback time in anticipation of the read time required to read the defective logical block required during playback. .

The recording function of the data processing device 10 is realized by the above components. Note that the configuration and operation of the data processing device 10 relating to the playback function are the same as those of the conventional data processing device 90, and a description thereof will be omitted.

Next, a stream conversion function from PS to TS of the data processing device 10 and a configuration therefor will be described. The data processing device 10 includes a 1394 IZF unit 140, an output timing adjustment unit 141, and a PS-TS conversion unit 142.

Data that can be transmitted according to the I EE E 1 394 interface standard Since the evening format is limited to TS, the data processor 10 has a stream conversion function from PS to TS. The stream conversion is performed, for example, in the following aspects. When the data processing device 10 is a camcorder, video and audio are recorded as PS on the optical disk 1331 loaded in the camcorder. When the user connects the camcorder to the PC via the IEEE1394 cable and instructs the recorded content to be transmitted to the PC, the camcorder converts the PS to TS. As a result, TS is transmitted from PC to PC

The 1394 IZF section 140 is a terminal that outputs data based on the IEEE 1394 interface standard. The output timing adjusting section 14 1 adjusts the output timing of the TS and passes the signal 3 to the 1394 I / F section 140.

The? 3 → 3 conversion unit 142 (hereinafter, referred to as "conversion unit 142") performs format conversion from PS to TS. The conversion unit 142 receives the PS reproduced through the pickup 130 and the reproduction unit 121 or the PS generated by the encoder 170, and generates T S. The data structure of the generated TS is, for example, as shown in FIG. The conversion unit 142 inserts PAT-TSP32, PMT-TSP33, PCR-TSP34, etc. shown in FIG. 6 to generate Ts. The data structures and functions of these TS packets are as described with reference to FIG.

Next, the stream characteristics of the PS and the TS will be described with reference to FIG. The data conversion process from PS to TS will be described.

The encoder 170 that generates the PS must encode the PS that can be decoded in a general MPEG2PS decoder. For this purpose, predetermined conditions called system encoding conditions are imposed. By satisfying the system encoding conditions, PS has stream characteristics that can be decoded by a general MPEG2PS decoder 171. This is the same for TS.

FIG. 11A is a block diagram of an MPEG2PS decoder 171 for decoding PS. First, the decoder 17 1 has a PS decomposition section 114, a video decompression section 111, and an audio decompression section 113. These configurations are almost the same as the configuration of the decoder model called Program Stream / System Target Decoder-1 (P-STD).

The PS is input to the demultiplexer 114 a of the PS decomposer 114 at an arbitrary rate described in the pack header (for example, a relatively high rate of 10.0 Mbps). PS is separated into video data and audio data by the demultiplexer 1 1 4 a, a buffer B _v 1 1 4 b of the video data at a constant rate Bok (peak rate 1 0. 0 8 Mb ps), for voice data Input to buffer B _A 1 1 4c. At this time, the pack header, the system header, the PES header, and the like that constitute the PS are removed, and an elementary stream composed of video data and audio data is input to each buffer. Each of the video and evening data elementary streams is The data is decoded by the decompression unit 111 and the audio decompression unit 113 and output as an uncompressed video stream and audio stream.

On the other hand, FIG. 11 (b) shows functional blocks of a TS decoder model for decoding TS. This TS decoder model is called the so-called transport stream system target decoder (T-STD). The conversion unit 142 of the data processing device 10 is configured based on T-STD.

In FIG. 11B, the TS is input to the PID filter 42 of the decoder at a fixed transmission rate according to the characteristics of the transmission medium and the like. The PID filter identifies the bucket assigned to the TS bucket and separates the TS into video data, audio data, system data, etc. according to the PID.Transport buffer for video data TB _V 43a, for audio data Is input to the transport buffer TB _A 45 a of the other. Image data is then, after multiplex buffer MB _v 43 b, the Jer incrementer predecoder buffer EB _v 43 c, and decoded into a video stream Te to the video decompression unit 44. Meanwhile the audio data via the elementary decoder buffer EB _A 4 5 b, is decoded movies picture stream by voice decompression section 46.

The data separated in the PID file 42 is input to each buffer at a different rate, and transmitted between the buffers. For example, video data is transmitted from buffer 42 to buffer 43a at 10.0.08 Mbps, from buffer 43a to buffer 43b at 18 Mbps, and buffer 43b From buffer 43 c to 1 Transmitted at 5Mb s or less. On the other hand, audio data is transmitted from the buffer 45a to the buffer 45b at 2Mbps or less.

Paying attention to the transmission rate in the decoder for each of PS and TS, PS audio data is transmitted faster than TS audio data. In the PS, the time information when the P-STD demultiplexer 114a receives each pack of the PS is defined. This time information is defined as time information (system clock reference; SCR) in the PS pack headers 22a (FIG. 2) and 27a (FIG. 5). On the other hand, the time information at the time when the PID filter 42 of the T-STD receives each transport packet of the TS is defined in the TS. This time information is recorded at a predetermined frequency in the transport packet header. For example, the time information is set in PCR-TSP at intervals of 100 ms or less.

The PS assembling unit 104 according to the present embodiment generates a PS assuming that it is converted into a TS, and adds time information in consideration of the difference in the transmission speed of each stream in the P-STD. That is, PS system encoding is performed so that a packet can be converted to a stream conforming to T-STD without changing the packet arrival time information. Specifically, the voice stream maximum 2 Mb shall be transmitted from the demultiplexer 1 1 4 a to 1 1 4 b in ps, means pursuant system encoding for PS, e.g. puffer B _v l 1 4 b and B _A 1 1 4 Make sure that c does not underflow. In this way, when converting the PS time information (SCR) to the TS time information (PCR), the value is used as it is. This eliminates the need to recalculate time information (PCR) by system encoding.

Also, when generating a TS, it is necessary to insert a TS packet relating to the PAT which does not exist in the PS, a TS bucket relating to the PMT, and the like. Therefore, the PS assembling unit 104 according to the present embodiment inserts such a TS bucket and sets time information in the PS in consideration of the processing timing, so that the time information of the TS bucket at the time of TS conversion is obtained. It is not necessary to calculate the information again.

The PS assembly part 104 of the present embodiment is By providing the “SCR gap” described below in the time information 3-1 of 3 above,? At the time of conversion from 3 to D3, a TS packet including a PAT, a TS bucket including a PMT, and the like can be easily imported.

Hereinafter, the processing of the PS assembling unit 104 of the present embodiment will be specifically described with reference to FIGS. FIG. 12 shows the correspondence between the PS 50 generated by the PS assembling unit 104 and the TS 55 converted based on the PS 50. The PS 50 includes video packs (V-PCK) 51 and 54 and audio packs (A-PCK) 52, and these VOBUs are constituted. It is assumed that the PS 50 is recorded on the optical disk 131, but whether or not the PS 50 is recorded does not matter.

The PS assembly 104 sets the SCR gap 53 between the appropriate packs (V-PCK51 and A-PCK52 in the figure). The SCR gap 53 has been given the SCR # (n + 1) of A—PCK 52 It can be provided by specifying (slower) than the specified SCR value. Specifically, the three-length gap 53 is defined as T 1 where the SCR value added by the conventional PS assembly unit 244 (FIG. 1) is T 1 and the SCR value added by the PS assembly unit 104 is T 2 (T 2 > T 1) is the time represented by (Τ 2 -Τ 1).

Figure 13 shows the structure of the pack header included in each pack of PS. The value of the SCR is specified by 33 bits in a field 56 divided into three parts, indicated as field name, system-clock-reference-base, among the data specified in the pack header. According to the present embodiment, the SCR value T2 described above is set in the field 56 of the A-PCK 52.

When converting from PS to TS, TS packets such as TS packet 32 of PAT, TS packet 33 of PMT, TS packet 34 of PCR, and SIT are inserted in the period of SCR gap 53. This eliminates the need to recalculate the SCR value of A—PCK 52 and simplifies the stream conversion process. The conversion process from PS to TS will be described later with reference to FIG.

The PS assembler 104 encodes each pack so that it can be decoded by the program stream system target decoder (P-STD) (Fig. 11 (a)), and recalculates most of the time information. PS 50 is generated so that the converted TS 55 can be decoded by the transport stream / system target decoder (T-STD) (Fig. 11 (b)). According to MP EG standard In order to ensure that a general decoder can decode data, system encoding conditions are specified when encoding. The system encoding conditions are specifically described as follows. First, even if the SCR gap (53) is provided by delaying the SCR # (n + 1) of A—P CK 52, the A—P CK 52 remains It is necessary to transmit at a transmission rate of 2 Mbps by the timing indicated by (n + 2). Therefore, the time interval defined by the SCR # (n + 1) of A—PCK52 and the SCR # (n + 2) of V-PCK54 becomes A—PCK5 at a transmission rate of 2 Mb ps. It must be longer than the time required for 2 to be transmitted.

Further, it is necessary to determine the time interval between adjacent SCRs other than the SCR of A—PCK52 and the SCR of V—PCK54. This time interval is V-P〇 and eight-? During CK, it must be longer than the time required to transmit one pack at the peak rate of PS (10.08 Mbps).

In the PS assembling section 104 of the present embodiment, the SCR gap is provided immediately before the V-PCK located at the head of the VOBU, and is provided within 100 ms thereafter. As a result, at the time of conversion into TS, TS packets 32 to 34 including PAT and the like can be inserted after the completion of the TS bucketing of one VOBU. However, it is necessary to assume an increase in the bit rate of the TS such that one or more (eg, 3 偭) TS buckets can be inserted into the SCR gap 53 every 100 milliseconds. FIG. 14 schematically shows the position of the SCR gap provided at the beginning of V〇BU and 100 ms after it. FIG. 15 schematically shows SCR gaps repeatedly provided within 100 ms. The SCR value at the beginning of the SCR gap is set to be an integral multiple of 270,000. The reason for setting the value to “270 00 00 0” is that this value corresponds to 100 milliseconds in time with respect to a clock frequency of 27 MHz. It should be noted that the SCR gap may be provided repeatedly only within 100 milliseconds, and may not necessarily be provided immediately before the V-P CK located at the head of the VO BU. Further, as long as it is within 100 ms, it may be an integral multiple of 50 ms, for example, or may be a variable length. Figure 16 schematically shows the SCR gap set for a PS conforming to the DVD-VR standard. The SCR gap is provided immediately before RDI-PCK58 located at the beginning of VO BU #n. Furthermore, the next SCR gap is set within 100 ms from the SCR gap.

FIG. 17 shows a processing procedure when the PS assembling section 104 sets the SCR of the program stream. First, in step S100, the PS assembling unit 104 determines a temporary SCR value for each pack. This temporary SCR value is the same as the value determined by the conventional PS assembly unit 244. Next, in step S101, it is determined whether the pack to be processed is the first pack of the V〇BU. If the pack is not the first pack, the process proceeds to step S102. If the pack is the first pack, the process proceeds to step 104. In step S102, PS assembly part 10 Step 4 determines whether the SCR value is near an integral multiple of 100 ms. For example, when a 27 MHz counter is used, the PS assembling unit 104 determines whether the SCR value is a multiple of 270 00 000. If the result of the determination is that the value is not around the integral multiple, the process proceeds to step S103, and in step S103, the provisionally determined SCR value is described in the pack header without being changed. On the other hand, if it is near the integral multiple, the process proceeds to step S104, and in step S104, the three-jime is shifted by a predetermined 3 CR gap.

Next, a procedure for generating T S from P S with the SCR gap will be described. FIG. 18 shows the procedure of the conversion process from PS to TS, which is executed in the conversion unit 142. First, in step S200, the insertion timing of the first PAT, PMT, and PCR is determined based on the SCR value of V--PCK located at the beginning of VOBU. For example, when a 27 MHz counter is used, the insertion timing is obtained by (SCR value div 27 00 00 00 of the first VOBU). This "div" means that the result of the division is rounded down to the nearest decimal point. Also, “270 00 00 0” corresponds to 100 milliseconds, and is a value corresponding to 1 Z 10 of the count of 27 MHz.

In step S201, the conversion unit 142 calculates the value of the last SCR one pack before (excluding the first pack in which no pack exists before). Figure 19 shows the trailing SCR for each pack. As shown in the figure, “tail SCR” indicates the earliest timing (time information) for transmitting one byte immediately after a certain pack. Again in Figure 18 Each step will be described. In step S202, conversion section 142 determines whether the pack to be processed is the first pack of V の BU. If the pack is not the first pack, the process proceeds to step S203, and if the pack is the first pack, the process proceeds to step S209. In step S203, the conversion unit 1442 determines whether or not the obtained insertion timing is included between the SCR value of the pack and the end SCR value. Proceed to 9; if not included, proceed to step S207.

In step S209, the PAT, {}, and PCR TS packets are inserted at the obtained insertion timing. Next, in step S204, 2700000 is added to the input timing value to obtain the next input timing value. Thereafter, the process proceeds to step S207. In step S207, the converter 144 divides one pack into 188 bytes to generate a TS bucket. For example, as shown in FIG. 12, one video pack is divided into 11 video TS packets 30, and one audio pack is divided into 11 audio TS packets 31. In step S208, a time stamp is added to the TS packet.

In step S206, the conversion unit 142 determines whether all packs in the PS have been processed. If the processing of all the packs has been completed, the processing is terminated. If not completed, the next pack is extracted in step S205, and the processing of step S201 and subsequent steps is performed again. Perform processing. By the above processing, the conversion unit 142 can easily convert the transport stream from the program stream. In particular, there is no need to recalculate and set the PCR, PTS (Presentation Time Stamp) and DTS (Decoding Time Stamp) in V-PCK or A-PCK, and use the transformer based on the SCR value of the program stream. Since the time information of each TS packet in the port stream can be determined, processing time is not required as in the conventional case, and the processing load during conversion can be significantly reduced. The time information of the TS packet is called PCR, and defines the time at which the TS bucket should reach the virtual MPEG 2 decoder. Ding? The scale value is? It can be calculated based on the scale value. For example, the value of PCR 34 in FIG. 12 may be set to a value earlier in time than the value of SCR # (n + 1) of PS. The degree is equivalent to the transmission time of one TS packet 34.

It has been described that the recorded program stream is converted into a transport stream in order to be output from the IEEE 1394 interface unit 140. However, the same processing can be performed when the recorded program stream is converted into a transport stream and then recorded on a recording medium such as an optical disc 131. To include a timestamp in the transport stream, the timestamp can be selected based on the SCR of the program stream.

In the program stream, position information indicating the position where the SCR gap is provided can be defined. The location information is, for example, This is the head address of the padding bucket, with the beginning of the program stream set to 0. Also, the SCR value at the head of the SCR gap may be set in the manufacturer extension fields 27 e and 67 of RDI-PCK (FIGS. 5 and 23). Further, a usable SCR value of a TS bucket including PAT, PMT, PCR, or SIT may be recorded as a candidate. Also, such information may be recorded in a file different from the file containing the video stream.

(Embodiment 2)

Hereinafter, a data processing device according to a second embodiment of the present invention will be described. The components of the data processing device of the present embodiment are the same as those of the data processing device 10 (FIG. 10) shown in FIG. Therefore, the description of each function is omitted, and the processing of the data processing apparatus will be described below.

The data processing device 10 according to the present embodiment differs from the data processing device according to the first embodiment in that a padding packet is provided instead of the SCR gap 53 in FIG.

FIG. 20 shows the correspondence between the PS 59-1 generated by the PS assembling section 104 of the data processing apparatus 10 according to the present embodiment and the TS 59_2 converted based on the PS 59-1. Is shown. The PS assembling unit 104 inserts the padding packet 61 a into an appropriate pack (the video pack (V—PCK) 61 in the figure) constituting the VOB U. When converted from a program stream to a transport stream, a TS packet including PAT 32, PMT 33, PCR, SIT, etc. is inserted corresponding to the position of padding packet 61a. . One video A packet is divided into 11 video TS packets 30, and one audio pack is divided into 11 audio TS packets 31. FIG. 21 shows the data structure of the padding bucket 61 a included in the video pack 61. The padding packet 61a has a packet header 63 and a field 64 in which a fixed value (OxFF) padding byte is stored. In the packet header 63, the data length (PES packet length) of the padding byte field 64 is indicated by 2 bytes, and thereby the length of the padding packet 61a can be specified. The data length of the padding byte field 64 is related to the packet length of the TS packet, and is, for example, an integral multiple of 188 bytes (for example, 2 to 4 times). “188 bytes” corresponds to the data size of the transport packet. When the data length of the padding byte field 64 is twice the length of 188 bytes, two TS packets can be inserted. When the data length of the padding byte field 64 is four times, four TS packets can be inserted. Will be able to enter.

The padding packet 61a is the same as the padding bucket described in the MPEG2 system standard. The arrangement of the padding bucket is the same as the DVD-VR standard. The video pack 61 may contain as much normal compressed video data as possible. Although the padding bucket 61 a of the present embodiment is provided in the video pack 61, it may be provided in the audio pack 62.

Figure 22 shows a padding pad that is repeatedly provided within 100 milliseconds. 5 schematically shows a video pack 65, 6 6 including a bracket. Video packs 65 and 66 are provided in the middle of V〇BU # n. As a result, it is necessary to assume that the bit rate of the TS increases so that one or more extra TS packets can be inserted into the padding packet every 100 milliseconds. The padding packet may be provided repeatedly only within 100 ms, and may not be provided immediately before V-PCK located at the head of V の BU.

As in the description related to FIG. 12, the PS assembly 104 determines that each pack satisfies the system encode conditions of the program stream, system target decoder (P-STD), and The program stream 59-1 is generated so that the converted transport stream also satisfies the system encoding conditions of the transport stream system target decoder (T-STD).

The system encoding conditions are specifically described as follows. The time interval defined by 30 # (n + 1) of the audio pack (A—P CK) 6 2 and the SCR # (n + 2) of the following V_P CK (not shown) is 2 Mb ps It must be longer than the time required for A__PC52 to be transmitted at this transmission rate.

As another system code condition, adjacent SCRs other than SCR # (n + l) of A_P CK 62 and subsequent SCR # (n + 2) of V—P CK (not shown) The time interval between them (eg, the time interval between the SCRs of Video Packs 60 and 61) is also specified. In other words, this time interval is one peak at the peak rate of the PS (10.08 Mbps). It is necessary to set each SCR value so that it is longer than the time required to transmit the pack.

In the program stream, position information indicating the position where the padding packet is provided can be defined. The position information is, for example, the head address of a padding bucket with the head of the program stream being 0. Also, the SCR value at the head of the SCR gap may be provided in the RDI-PCK medium extension fields 27 e and 67 (FIGS. 5 and 23). Further, a usable SCR value of the TS bucket including PAT, PMT, PCR, or SIT may be recorded as a candidate. Also, such information may be recorded in a file different from the file containing the video stream.

(Embodiment 3)

Hereinafter, a data processing device according to a third embodiment of the present invention will be described. The components of the data processing device of the present embodiment are the same as those of the data processing device 10 (FIG. 10) shown in FIG. Therefore, the description of each function is omitted, and the processing of the data processing device will be described below. In the following, description will be made assuming that the program stream according to the first or second embodiment is generated. However, this is not a mandatory assumption. When the program stream according to the first embodiment or the second embodiment is not generated, it is necessary to separately calculate time information in the stream when converting from PS to TS.

The data processing device 10 according to the present embodiment converts a program stream conforming to the DVD-VR standard into a transport stream. In addition, a program stream that can reduce the processing load and shorten the conversion processing time is generated. Here, it is assumed that the program stream and the transport stream have a relationship shown in FIG. 9 with each other. Therefore, as shown in (c) of FIG. 9, the PES packet of (b) is divided and stored in the data portion 41b of the TS bucket. One frame of data (video data or audio data) is stored in the payload 40b of the PES packet. The data for one frame is obtained from the data in the data portion 39a and 39b of the pack shown in (a) and a part of the data portion 39c. One frame of video refers to, for example, a screen for two fields when displayed in an interlaced manner. One frame of audio refers to, for example, AC-3 with a sampling frequency of 48 kHz and 256 kbps. In the case of audio, it means a total of 1 5 3 6 samples. Since the PS assembling unit 104 of the data processing device 10 determines the data to be stored in the data portion of each pack when generating the PS, not only the data type such as video audio, but also the It is possible to specify, for example, which data constitutes which part of which frame. Therefore, the PS assembling unit 104 of the present embodiment generates auxiliary information indicating the position of the frame, the PTS (playback timing information), the DTS (decode timing information), etc., for each data type constituting such a PS. I did it. Since there is a data area in the RDI pack that can be used freely by the manufacturer, auxiliary information can be recorded in this area. FIG. 23 (a) is generated by the PS assembly unit 104 according to the present embodiment. 9 shows the data structure of the program stream obtained. This program stream conforms to the DVD-VR standard, and has a 2-kilobyte RDI pack (RDI-PCK) with an extension field 67.

The configuration of the extension field 67 will be described below. The maker extension field 67 has main identification information 67a, pixel number information 67b, compression mode information 67c, video auxiliary information 67d, and audio auxiliary information 67e. The manufacturer identification information indicates the manufacturer of the data processing device 10 that generated the PS, the pixel number information 67 b indicates the number of pixels in the vertical and horizontal directions of the recorded image, and the compression mode information 67 c corresponds to the DVD-VR standard. Indicates whether or not. In addition, the manufacturer's extension field 67 can record PS fact information (4: 3, 16: 9, letterbox, etc.) and audio channel attribute information (monaural / stereo, etc.). You.

The video auxiliary information 67 d and the audio auxiliary information 67 e of the maker extension field 67 specify information on the data structure for each data type constituting the PS. That is, the video auxiliary information 67 d specifies the data structure of the video pack in the VOBU including RDI-PCK at the beginning, and includes video frame position information and PTS ZDTS information. The audio auxiliary information 67 e specifies the data structure of the audio pack, and includes audio frame position information and PTS information.

Figure 23 (b) shows the detailed data structure of video auxiliary information 67d. Show. The video auxiliary information 67 d specifies frame position information 68 a to 68 d and PTS / DTS information for each of I, P, and B frames specified in the MPEG standard.

Each frame position information 68a to 68d is the data size from the beginning of the program stream or the beginning of the VOBU to the beginning of each frame data, and is expressed in bytes. FIG. 24 schematically shows the start addresses (storage positions) of the I frame, the B1 frame, and the B2 frame when the start of VOBU # i is calculated as 0. The start address of the video frame includes the data length of A-PCK included from the beginning of VOBU to A-PCK. When generating the PS, the PS assembling unit 104 holds the start position of each frame, the PTS of the position, and the like, and generates the video auxiliary information 67d.

The PT S / DTS information described in FIG. 23 (b) indicates P-three and zero-three in frame units. DTS may be provided as needed. The PTS / DTS information is provided corresponding to each frame position information 68a to 68d, and is the I-frame next to the I-frame position information 68a? D3 / 0/3 information is provided, and PTS / DTS information of the P1 frame is provided after the P1 frame position information 68b. The same applies to the B1, B2 frames and the like. Although the audio auxiliary information 67 e is not shown, the data structure of the audio auxiliary information 67 e is the data of the video auxiliary information 67 d except that DTS information is not included. Same as the structure.

Next, the conversion unit 142 converts a PS having the above data structure into a TS. The procedure for switching will be described. Fig. 25 shows the procedure of the conversion process from PS to TS. First, in step S300, when a pack in units of 2 kbytes conforming to the DV D-VR standard is input to the conversion unit 142, in step S301, the conversion unit 142 It is determined whether or not it is an I-pack (S301). As a result of the determination, if the pack is an RDI pack, the process proceeds to step S302. If the pack is not an RDI pack, the process proceeds to step S303. The processing in step S303 means that the pack is always a video pack (V_PCK) or an audio pack (A-PCK).

In step S302, the conversion unit 142 extracts frame position information and PT SZDTS information for each of video and audio from the manufacturer extension field 67 of the RDI pack.

In step S303, the conversion unit 142 removes the pack header and the PES header of the pack to be processed. It should be noted here that the RDI pack is always located at the beginning of the VO BU, so at the time of executing step S303, the conversion unit 142 has already received the frame position information from the preceding RDI pack. , PT SZDT S information.

Therefore, the conversion unit 142 determines whether or not there is a frame start position in this pack by referring to the frame position information in step S304. If the frame start position is included, the process proceeds to step S305, and if not, the process proceeds to step S306.

In step S305, the conversion unit 142 references the PT SZDT S information. Then, the corresponding PTS / DTS is extracted, and a PES header including the PTSZDTS is generated and added before the package data. In the next step S306, the conversion unit 142 divides the processed data to generate and output 11 or 12 TS packets. Thereafter, the flow advances to step S307 to process the next pack.

Through the above processing, the conversion unit 142 can easily convert the PS into a TS composed of PES in which one PES packet contains one frame of data without analyzing the elementary stream in the PS. Can be converted to

The auxiliary information 67 d and 67 e shown in FIGS. 23 (a) and (b) can be further extended. FIG. 26 (a) shows a maker extension field 69 in which size information is specified in the auxiliary information field 70. FIG. 26 (b) shows the data structure of the auxiliary information field 70. The auxiliary information field 70 differs from FIG. 23 (b) in that information on the data size of the frame is specified for each frame. For example, for the I frame, PTS / DTS is described after the start address 68a, and then the frame size 71 is described. For the P1 frame, PTS / DTS is described after the start address 69a, and then the frame size 72 is described. Similarly, for the B1 frame, the B1 frame size 73, such as the address 68c and the PTS, is described.

The above processing does not refer to the processing of inserting TS packets including PAT / PMT. However, these TS packets The processing of the data processing device 10 according to the first or second embodiment can be directly applied and inserted. This eliminates the need for the data processing device 10 to perform the system encoding process again.

Although the position information of the video frame is described as the data size from the start of the program stream or VOBU to the start position of each frame, the difference in the start address between adjacent frames is defined as the position information. You can also. The same applies to the position information of the audio frame. Furthermore, the number of audio streams may be one or more. Even when there are two or more audio streams, auxiliary information (for example, Fig. 23) can be specified in the RDI pack.

In addition, flag information indicating whether auxiliary information is recorded is also recorded in the manufacturer extension fields 67, 69 of the RDI pack to indicate whether one or more auxiliary information is recorded. May be.

Video frame information and audio frame information in V〇B U

It is stored in RDI_PCK at the beginning of OBU. However, auxiliary information such as video frame information and audio frame information may be stored at another position in the stream. In addition, auxiliary information on audio frames synchronized with the video frames in V〇BU may be stored in RDI_PCK at the beginning of the VOBU.

The program stream according to the present embodiment stores auxiliary information for conversion into a transport stream. Therefore, the so-called reverse conversion from transport stream to program stream is performed. In consideration of the conversion, auxiliary information for converting into a program stream may be stored in the transport stream. The auxiliary information may be stored, for example, in a dedicated transport bucket for storing the auxiliary information. Also, auxiliary information for converting to a program stream or a transport stream may be stored in the PES stream.

The data processing device according to the embodiment of the present invention has been described above.

The operation of each component of the data processing device described above is performed based on an instruction from a central control unit (not shown) provided in the data processing device. The central control unit issues instructions based on a program arranged in a memory (not shown) of the data processing device, and controls the overall operation of the device. The program performs the procedures described in Figures 17, 18, 25, etc., parses the data stream according to a predetermined data structure, or converts data that conforms to such a data structure. Generate a data stream.

In the present specification, the MPEG2 program stream has been described as an example, but the system stream of MPEG1 may be used. Although the recording medium is a phase change optical disk, for example, optical disks such as DVD-RAM, DVD-R, DVD-RW, DVD + RW, MO, CD-R, CD_RW, and hard disk Other disk-shaped recording media can also be used. Further, it may be a semiconductor memory. In this connection, the read / write head is an optical disk pickup.For example, when the recording medium is MO, the pickup and the magnetic head are used. In the case of a hard disk, it becomes a magnetic head.

In each embodiment of the present invention, the transport stream may be in a format compliant with a digital broadcasting standard using MPEG, or may be in a format compliant with digital data broadcasting using MPEG. Good. As a result, compatibility with the digital broadcast set-top box (STB) can be improved, and the functions of the STB such as a data broadcast receiving function can be utilized.

In this specification, an example has been described in which the transfer rate slightly increases after conversion to the transport stream.However, it is assumed that 13 TS packets are converted from one pack of PS. However, after the conversion, a TS packet including PAT, PMT, and PCR may be inserted in the empty timing, assuming a transfer rate of 13Z11 times. In addition, although an example was described in which one pack was divided into 11 TS buckets from 11 packs, the element stream part in one pack was described in order to keep the number in 1 pack. Data size may be limited to 224 bytes or less. The reason why the number of bytes is equal to or less than 224 bytes is that 224 bytes is equivalent to 11 bytes, which is equal to the size of a pay bucket in a transport bucket (184 bytes).

The data processing device 10 may be, for example, a stationary video recording device or a device that performs data conversion from a program stream to a transport stream, in addition to a camcorder. '' Furthermore, in the data processing device according to the first embodiment, the SCR gap is used for the purpose of inserting a PAT or the like at the time of transport bucket conversion. However, it may be provided to insert another pack into the program stream. For example, when only one audio stream is included in the program stream, the PS assembling unit 104 performs the system encoding process and recalculates the time information without re-calculating the second audio data stream. An SCR gap may be used so that it can be inserted into the PCK.

In the PS obtained by the processing according to the third embodiment, various types of auxiliary information are specified in the manufacturer extended information, but the auxiliary information may be separately collected and stored as a data file separate from the MPEG program stream. .

In the above description, the transmission rate of PS is assumed to be 10.08 Mbps or less in accordance with the DVD-VR standard, but it may exceed 10.08 Mbps. This is because, according to the PS generated by the processing of the present embodiment, even if the bit rate is increased, the conversion to the transport stream and the conversion efficiency are not affected.

As described above, according to the data processing device of the present invention, recording in a program stream and conversion to a transport stream are efficient and easy. By recording in the program stream, it has high compatibility with editing application software for DVD devices and PCs. Also, since it is easy to convert to a transport stream, it has high affinity with the 1394 interface. Industrial applicability According to the present invention, when a program stream of video data conforming to the DVD-VR standard is transmitted, for example, via a digital interface of IEEE1394, the program stream can be easily converted to a transport stream. can do.

Claims

1. A signal input unit to which at least one of a video signal and an audio signal including a plurality of frames is input, and a compression and encoding of the received signal to generate encoded data.

A compression unit;

The encoded data is divided into at least one having a certain data size.

And generating a data stream by adding a control packet storing control information for controlling the reproduction of the encoded data.

A data processing device, comprising: a stream assembling unit, wherein the stream assembling unit generates, for each frame, position information indicating a storage position in the data stream, and describes the position information in the control bucket as the control information. .

2. The data processing device according to claim 1, wherein the stream assembling unit further generates time information indicating evening of display of each frame and describes the time information in the control packet as the control information.

3. The data processing device according to claim 2, wherein the stream assembling unit further generates time information indicating a timing of decoding of each frame, and describes the time information as the control information in the control packet.

4. The stream assembling section is also the size of each frame The data processing device according to claim 1, wherein size information indicating the following is generated, and the size information is described as the control information in the control bucket.

5. The data processing device according to claim 1, wherein the stream assembling unit describes the control information in a field in the control packet in which arbitrary information can be described.

6. The data processing device according to claim 1, wherein the stream assembling section generates a control pack as the control packet, adds the control pack, and generates a program stream as the data stream.

7. A step of inputting at least one of video and audio signals including a plurality of frames;

Compression encoding the received signal to generate encoded data;

Dividing the encoded data to generate one or more packets having a fixed data size;

Adding a control packet storing control information for controlling the reproduction of the encoded data to the one or more packets to generate a data stream, wherein for each frame, the data stream is included in the data stream. Generating position information indicating the storage position of the control bucket and describing the control information in the control bucket as the control information; A data processing method including:

8. The data processing according to claim 7, wherein the step of generating the data stream further includes a step of generating time information indicating a display timing of each frame and describing the control information as the control information in the control packet. Method.

9. The method according to claim 8, wherein the step of generating the overnight stream further includes a step of generating time information indicating a decoding timing of each frame, and describing the control information as the control information in the control packet. Data processing method described.

10. The data processing according to claim 7, wherein the step of generating the data stream further includes a step of generating size information indicating a data size of each frame and describing the control information as the control information in the control bucket. Method.

11. The data processing method according to any one of claims 6 to 8, wherein the step of generating the data stream describes the control information in a field in the control packet in which arbitrary information can be described. .

12. The step of generating the data stream includes generating a control pack as the control packet, adding the control pack, 2004/030358

The data processing method according to claim 7, wherein a program stream is generated as the data stream.