JP3976045B2

JP3976045B2 - Authoring system and method

Info

Publication number: JP3976045B2
Application number: JP2004318455A
Authority: JP
Inventors: 俊明瀬戸川
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2004-11-01
Filing date: 2004-11-01
Publication date: 2007-09-12
Anticipated expiration: 2015-12-12
Also published as: JP2005151551A

Description

本発明は、例えばディジタル・ビデオ・ディスク等のようなメディアを制作するための、ディジタル映像、音声等のデータのオーサリングシステム等に適用して好適な、オーサリングシステム及びその方法に関する。 The present invention relates to an authoring system and method suitable for application to an authoring system for data such as digital video and audio for producing a medium such as a digital video disc.

ディスク再生機のランダムアクセスには、真の意味でのランダムにアクセスする場合と、より一般的に応用されているところの、ＣＤ−ＤＡ（ＣｏｎｐａｃｔＤｉｓｋ−ＤｉｄｉｔａｌＡｕｄｉｏ）以来馴染みの深い「トラック」と呼ばれる単位でのアクセス、あるいは、レーザーディスクでの「チャプター」と呼ばれる単位でのアクセスがある。ＣＤにおいてはトラックとは通常１曲１曲に相当し、また、レーザーディスクにおける「チャプター」は、それぞれのシーンに相当する。また、最近のＶＩＤＥＯＣＤ（ビデオ・ＣＤ）では、トラックの概念をより発展させ、ストリーム全体をＰＬＡＹＩＴＥＭ（プレー・アイテム）と呼ばれる再生単位の部分に分割し、ＰＬＡＹＬＩＳＴ（プレー・リスト）と呼ばれるリストに記述してあるＰＬＡＹＩＴＥＭの順番に従って（ディスク上の記録順序とは異なる順番で）再生するＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬと言う新しいランダムアクセスの機構も存在する。 Random access of disc players is called “track”, which has been familiar since CD-DA (Compact Disk-Digital Audio), which is more commonly applied in the case of true random access. There is access in units, or access in units called “chapter” on a laser disc. In a CD, a track usually corresponds to one piece of music, and a “chapter” in a laser disc corresponds to each scene. Further, in recent VIDEO CD (video CD), the concept of the track is further developed, and the entire stream is divided into playback unit parts called PLAY ITEM (play item), which is called PLAY LIST (play list). There is also a new random access mechanism called PLAYBACK CONTROL that reproduces in the order of PLAY ITEM described in the list (in an order different from the recording order on the disk).

例えば、トラックによるアクセスは概ね次のような仕掛けで実現されている。
１．エンコード時（オーサリングあるいはマスタリングと呼ばれることもある）に、それぞれのトラックの開始点をタイムコードで指定しておく。トラックの番号は、例えば、１番から順番に増えていくとする。
２．エンコードを行なうときに、あるトラック番号で指定されたタイムコードに相当する映像または音声データがディスク上のどこに記録されたかを、そのディスクフォーマットで使用されている絶対番地方式によりディスクアドレスを覚えておく。｛ＣＤの場合は、絶対タイムコードが、ＤＶＤ（ＤｉｇｉｔａｌＶｉｄｅｏＤｉｓｋ）の場合にはセクタ・アドレスの使用が予定されている。｝
３．全てのトラック開始点について、トラック番号とそのディスクアドレスのペアの情報が出来上がったら、それをＴＯＣ（ＴａｂｌｅＯｆＣｏｎｔｅｎｔｓ）としてディスク上の物理的に最初にアクセスされる場所の近傍（通常は、ディスクの最内周）におく。
４．再生機は、このＴＯＣを読み、メモリーなどの記憶しておけば、ユーザーがあるトラックを指定したとき、即座にディスク上の絶対番地が判明するので、直接その番地へピックアップを進めて、再生すべきトラックの最初のデータを読むことができる。 For example, access by truck is generally realized by the following mechanism.
1. At the time of encoding (sometimes called authoring or mastering), the start point of each track is designated by a time code. For example, it is assumed that the track number increases sequentially from the first.
2. When encoding, remember where the video or audio data corresponding to the time code specified by a certain track number was recorded on the disc by the absolute address method used in the disc format. . {In the case of CD, the absolute time code is planned to be used, and in the case of DVD (Digital Video Disk), the use of a sector address is scheduled. }
3. When the track number / disk address pair information is completed for all track starting points, it is used as the TOC (Table Of Contents) in the vicinity of the first physical location on the disk (usually the disk Place it on the innermost circumference.
4). If the player reads this TOC and stores it in the memory, etc., when the user designates a track, the absolute address on the disc is immediately identified, so the pickup is directly advanced to that address for playback. You can read the first data of the track that should be.

ＤＶＤ再生機においても、これらの機能を継承しトラック（またはチャプター）番号によるランダムアクセスやＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬによるランダムアクセス機能が搭載される予定である。以下の説明での対象は、このように特定された点に対するランダムアクセス機能である。また、説明を簡単にするため、これらの機能を「頭出し機能」、及びランダムアクセスされるディスク上の特定の箇所を「頭出し点」と呼ぶことにする。 The DVD player will also inherit these functions and be equipped with a random access function based on track (or chapter) numbers and a random access function based on PLAYBACK CONTROL. The object in the following description is a random access function for the points specified in this way. For the sake of simplicity, these functions will be referred to as a “cue function” and a specific location on a disk that is randomly accessed will be referred to as a “cue point”.

ランダムアクセスはＧＯＰ（ＧｒｏｕｐＯｆＰｉｃｔｕｒｅ）で制限されるＭＰＥＧ２（ＭｏｖｉｎｇＰｉｃｔｕｒｅＥｎｇｉｎｅｅｒｉｎｇＧｒｏｕｐ−２）の映像信号が記録されたディスクなどを再生する際に、ランダムアクセスなどを行ない記録されたストリームの途中から再生する場合でも、通常のビデオを信号とは異なり任意のピクチャから再生が開始できるわけではない。ＭＰＥＧの画像圧縮の性格上、常にリファレンスピクチャと呼ばれるイントラ・ピクチャ（Ｉピクチャ）を先に読む必要があるため、ＧＯＰ単位での頭出しとなる。 Random access is performed from the middle of the recorded stream by performing random access when playing a disc on which a video signal of MPEG2 (Moving Picture Engineering Group-2) restricted by GOP (Group Of Pictures) is recorded. Even in this case, unlike normal signals, playback of normal video cannot be started from an arbitrary picture. Due to the nature of MPEG image compression, it is always necessary to read an intra picture (I picture) called a reference picture first, which leads to a head search in GOP units.

一方、通常のＭＰＥＧ２のエンコーディングにおいては、１ＧＯＰは１２〜１５枚のピクチャ（フレーム）で構成されるのが効率が良いことが分かっている。従って、もし、この１ＧＯＰ＝１２〜１５ピクチャのルールのみでエンコードしてしまうと、上で述べた再生時の頭出しの可能な点も１２〜１５ピクチャおきにしか得られないことになる。しかし、ビデオや映画作品のシーンの切れ目などの頭出しの候補となる点は、当たり前ながら、１２〜１５枚おきに現われることなど有りえなく、全く任意のところに現われるのは自明であろう。 On the other hand, in normal MPEG2 encoding, it is known that it is efficient that 1 GOP is composed of 12 to 15 pictures (frames). Therefore, if encoding is performed only with the rule of 1 GOP = 12 to 15 pictures, the above-described points that can be cued during reproduction can be obtained only every 12 to 15 pictures. However, it is obvious that points that are candidates for cueing such as breaks in scenes of videos and movie works may appear every 12 to 15 frames, and appear at any place.

もし、最も手軽に頭出し点を決めるとすれば、作品上の意味の有る切れ目を選ぶのではなく、最も近い場所のＧＯＰの切れ目を選んで頭出し点とする方法がある。（現状のＶＩＤＥＯＣＤなどはこの方法である。）しかし、圧縮エンコードの複雑さを厭わなければ、通常１２〜１５枚のピクチャおきに構成するＧＯＰのルールを、例えば、頭出し点の近傍で崩し、頭出し点をＧＯＰの切れ目に持ってくることは可能である。 If the cue point is to be determined most easily, there is a method of selecting the GOP cut at the nearest location and setting it as the cue point instead of selecting a cut with meaning in the work. (The current VIDEO CD is this method.) However, if the complexity of compression encoding is not considered, the rule of GOP usually composed of every 12 to 15 pictures is broken, for example, near the cue point. It is possible to bring the cue point to the end of the GOP.

ところが、問題はこれだけでは解決していないのである。頭出し点をＧＯＰの区切りにしただけでは、ランダムアクセス後、作品が意図したピクチャから再生が開始される保証はないのである。それは、ＧＯＰの中のピクチャの並び方に原因がある。 However, this alone does not solve the problem. Just by setting the cue point as a GOP break, there is no guarantee that playback will start from the picture intended by the work after random access. This is due to the way the pictures are arranged in the GOP.

以上のことについて、図８を参照して詳細に説明する。図８には、一般的なＭＰＥＧの画像列の様子について示されている。この図８には、連続した長い画像のうちの途中が切り出されて示されたもので、原画像がどのような種類の画像に圧縮エンコードされて、どのような順番に並べられるのかが、代表的な例が用いられて表されたものである。図８Ａには原画像が、図８Ｂにはエンコーダ出力が、図８Ｃにはデコーダ入力が、図８Ｄには再生画像が夫々示されている。 The above will be described in detail with reference to FIG. FIG. 8 shows a general MPEG image sequence. FIG. 8 shows a cut out part of a continuous long image, and the type of image that is compressed and encoded into the original image and the order in which the original image is arranged. A typical example is used. 8A shows the original image, FIG. 8B shows the encoder output, FIG. 8C shows the decoder input, and FIG. 8D shows the reproduced image.

〔ＧＯＰは完全には独立していない〕
図８における凡例において、Ｉピクチャとは、それ自身独立して再生が可能な画像のことであり、Ｐピクチャとはそれよりも前方（過去）にあるＩまたはＰピクチャからの予測を使った画像であり、Ｂピクチャとは、それ自身よりも前方（過去）にあるＩまたはＰピクチャからの予測と後方（未来）にあるＩまたはＰピクチャからの予測の両方を使用した画像のことである。なお、Ｉ、Ｂ、Ｐに添えられた数字「ｘｘ」は原画像における画面の順番を表し、小さい数字が過去を大きい数字が未来を表している。また、実線の矢印は、どの画像からどの画像が予測されているかの関係を示す。 [GOP is not completely independent]
In the legend in FIG. 8, an I picture is an image that can be reproduced independently, and a P picture is an image using prediction from an I or P picture ahead (in the past). A B picture is an image that uses both prediction from an I or P picture that is ahead (past) than itself and prediction from an I or P picture that is behind (future). Note that the numbers “xx” attached to I, B, and P indicate the order of the screens in the original image, and the smaller numbers indicate the past and the larger numbers indicate the future. A solid line arrow indicates a relationship of which image is predicted from which image.

さて、エンコーダに入力された原画像１は、図８Ａに示されるように「・・・ＢＢＰＢＢＩＢＢＰＢＢＰＢＢＩＢＢ・・・」の順番で順次圧縮エンコードされて行く。前方または後方からの予測は、実線の矢印で示されるように、例えば、Ｉ２ピクチャからの前方予測はＢ３ピクチャ、Ｂ４ピクチャとＰ５ピクチャの３画像に使われ、同時にＢ３ピクチャ、Ｂ４ピクチャには、Ｐ５ピクチャからの後方予測も使われる。 As shown in FIG. 8A, the original image 1 input to the encoder is sequentially compressed and encoded in the order of “... BBPBBIBBPBBPBBIBB. As indicated by solid arrows, forward prediction from forward or backward, for example, forward prediction from I2 picture is used for 3 images of B3 picture, B4 picture and P5 picture, and at the same time, for B3 picture and B4 picture, Backward prediction from P5 pictures is also used.

圧縮エンコードされた画像データは、図８Ｂに示されるエンコーダ出力に示されるように、デコード時に便利なように画像の並べ変えが行なわれる。例えば、原画上のＩ２ピクチャは、図８Ｂに示される位置に置かれるが、Ｂ３ピクチャ、Ｂ４ピクチャは、夫々後ろにずれてＰ５ピクチャの後ろの位置に夫々置かれる。こうすることにより、図８Ｃのデコーダ入力に示すように、Ｂ３ピクチャとＢ４ピクチャを再生するのに必要なＩ２ピクチャとＰ５ピクチャを先にデコードすることができるようになる。なお、図８Ｂに示すエンコーダ出力において、Ｉピクチャから次のＩピクチャの手前までをＧＯＰと呼んでいる。 As shown in the encoder output shown in FIG. 8B, the image data subjected to the compression encoding is rearranged for convenience in decoding. For example, the I2 picture on the original picture is placed at the position shown in FIG. 8B, but the B3 picture and the B4 picture are respectively shifted to the back and placed at the positions behind the P5 picture. By doing so, as shown in the decoder input of FIG. 8C, the I2 picture and the P5 picture necessary for reproducing the B3 picture and the B4 picture can be decoded first. Note that, in the encoder output shown in FIG. 8B, a portion from the I picture to the front of the next I picture is called a GOP.

さて、ここで注目しなければならないのは、図８Ｄの再生画像に示されたＢ０ピクチャとＢ１ピクチャの再生である。ＧＯＰの観点から見れば、このＢ０とＢ１ピクチャは、Ｉ２と同一のＧＯＰＮに含まれるのだが、その再生のためには、図８Ｃにおいて実線の矢印で示すように、一つ手前のＧＯＰに含まれるＰ１ピクチャからの予測が必要となる点である。このことをもって「ＧＯＰは完全には独立していない」と言うのである。 Attention should be paid here to the reproduction of the B0 picture and the B1 picture shown in the reproduction image of FIG. 8D. From the point of view of GOP, these B0 and B1 pictures are included in the same GOP N as I2, but for reproduction, the GOP is moved to the previous GOP as shown by the solid arrow in FIG. 8C. This is a point that requires prediction from the included P1 picture. With this, it is said that “GOP is not completely independent”.

さらに、重要なポイントは、原画像の順番で見ると、通常のＧＯＰにおける先頭の画像はＢ０ピクチャであることである。 Furthermore, an important point is that the top image in the normal GOP is a B0 picture when viewed in the order of the original images.

〔頭出し再生における問題点〕
次に、ＧＯＰは完全に独立でないために起こる問題について図９を参照して説明する。図９は、あるＧＯＰＮの先頭を頭出し点とした場合の問題の様子を示す説明図である。図９Ａにはデコーダ入力が、図９Ｂには再生画像が示されている。今、連続的に圧縮エンコードされたＧＯＰの列があり、デコーダに入力されるものとする。さて、このようなＧＯＰ列に対して、再生機が頭出し動作を行なったとする。このときの再生機のピックアップが位置する頭出し点は、ＧＯＰＮのデコーダ入力で見たときの先頭であるＩ２ピクチャであるとする。 [Problems in cue playback]
Next, a problem that occurs because the GOP is not completely independent will be described with reference to FIG. FIG. 9 is an explanatory diagram showing a problem when the head of a certain GOP N is used as a cue point. FIG. 9A shows a decoder input, and FIG. 9B shows a reproduced image. Now, it is assumed that there is a series of GOPs that are continuously compression-encoded and input to the decoder. Now, it is assumed that the player performs a cueing operation on such a GOP row. It is assumed that the cue point at which the pickup of the player at this time is located is the I2 picture that is the head when viewed from the GOPN decoder input.

さて、この時、Ｂ０ピクチャとＢ１ピクチャを再生するためには、Ｉ２ピクチャからの後方予測だけではなくＰ１ピクチャからの前方予測が必要になるが、頭出し点より以前のデータは読みだしていないので、図９Ａにおいて破線の矢印で示す、Ｐ１ピクチャからの予測は使えなくなってしまう。つまり、連続的に逐次再生をしているときは何の問題もないのだが、ある場所に頭出しをした場合には、「一つ前のＧＯＰの最後のＰピクチャからの予測」が使えなくなるため、結果、図９Ｂに示す再生画像においては、ＧＯＰＮ内の最初にあるいくつかの（この場合、Ｂ０とＢ１の２枚である）Ｂピクチャは再生することが出来なくなり、再生開始可能点はＩ２ピクチャからになってしまう。 At this time, in order to reproduce the B0 picture and the B1 picture, not only the backward prediction from the I2 picture but also the forward prediction from the P1 picture is required, but the data before the cue point is not read out. Therefore, the prediction from the P1 picture indicated by the dashed arrow in FIG. 9A cannot be used. In other words, there is no problem when continuous sequential playback is performed, but when the head is found at a certain place, “prediction from the last P picture of the previous GOP” cannot be used. Therefore, as a result, in the reproduced image shown in FIG. 9B, some of the first B pictures (in this case, B0 and B1) in GOP N cannot be reproduced, and the reproduction can be started. Becomes from the I2 picture.

元々、エンコードの時点では、ＧＯＰＮの先頭ピクチャはＢ０ピクチャであるから、この例の場合は、指定されたＧＯＰデータにピックアップを持って行けたとしても、頭出し点は結果的に２フレーム後ろにずれることになる。 Originally, at the time of encoding, the first picture of GOP N is a B0 picture. In this example, even if the specified GOP data can be picked up, the cue point is consequently two frames behind. It will shift to.

以上説明したように、頭出し再生においてはいくつかの問題がある。この問題点の解決方法としては、次のようなものが考えられる。 As described above, there are some problems in cue playback. The following can be considered as a solution to this problem.

〔一見正しそうな解決方法１〕
＊頭出し点をＧＯＰの先頭からオフセットさせる。
例えば、トラックの先頭として頭出しをさせたいピクチャをＧＯＰＮの先頭ではなく、最初から後ろにずらしてＩピクチャを頭出し点として指定しておき再生開始可能点と一致させておけば良いのではないか、とも思えるが、それでは次の問題が解決していない。 [Solution 1 seems to be correct at first glance]
* Offset the cue point from the beginning of the GOP.
For example, the picture that you want to cue at the beginning of the track is not the beginning of GOP N, but rather it is shifted from the beginning to specify the I picture as the cue point and match the playback start point. It seems that it is not, but that does not solve the following problem.

１．ＧＯＰの先頭からどのくらいずらしたらよいか一意には決まらないという問題。
ＭＰＥＧ２のルールでは、Ｂピクチャの枚数（Ｉピクチャどうしまたは、ＩピクチャとＰピクチャの間隔、ＭＰＥＧ用語を用いればＭの数）に制約はない。画質と符号効率から見てＢピクチャの枚数は任意に選べることになっている。従って、頭出し点をＧＯＰの先頭からどのくらいずらしたら良いのかエンコードしてみないことには何とも言えない。
元々、頭出し点を作品上の特定の位置に指定したいのでＧＯＰの区切りをずらそうとしたのに、エンコードしてみて実際にＧＯＰを構築してみないと頭出し点が求まらないと言う、いわば「鶏と卵」の関係に陥ってしまう。 1. The problem that it is not uniquely determined how much to shift from the top of the GOP.
According to the rules of MPEG2, there is no restriction on the number of B pictures (I pictures, or the interval between I pictures and P pictures, or the number of M if MPEG terms are used). In view of image quality and code efficiency, the number of B pictures can be arbitrarily selected. Therefore, it cannot be said that it is impossible to encode how much the cue point should be shifted from the head of the GOP.
Originally I wanted to specify the cue point at a specific position on the work, so I tried to shift the GOP delimiter, but if I did not actually build the GOP after encoding, I could not find the cue point In other words, it falls into the “chicken and egg” relationship.

２．頭出し点をタイムコードで確認できないという問題
さらに、タイムコードの問題がある。ＭＰＥＧ２では、タイムコードは各ピクチャごとに持つことは出来ない。オプションとしてＧＯＰヘッダをＧＯＰの先頭に配置することが出来、このＧＯＰヘッダにタイムコードを打つことができる。「打つ」とは、ＧＯＰヘッドに相当する記録媒体上にタイムコードデータを記録することである。予想されるＤＶＤのフォーマットでも、形は異なるが、やはりＧＯＰごとにタイムコードを打つことが可能である。この場合、タイムコードの値はどうするべきであろうか？
通常の、逐次再生を考えた場合、ＧＯＰの構造（どのピクチャが何枚使われているかということ）とは無関係に一意にタイムコードの値を決めるとすれば、極自然に考えて、ＧＯＰに含まれるピクチャの中で最初に再生表示されるべきピクチャに対応するタイムコードとなるはずである。図９の例で言えば、データ列の最初から逐次再生されてきたとすれば、Ｂ０ピクチャがＧＯＰＮのなかでは一番最初に再生されるべき画像であるから、ＧＯＰＮに付随させるべきタイムコードは、Ｂ０ピクチャの時間「ｔ＝０」の値となる。
そうなると、頭出し可能点がＧＯＰＮにおいて表示順で見ると先頭ではないイントラピクチャＩ２になってしまうと、それに対応するタイムコードは「ｔ＝２」の値であるが、それそのものの値はＧＯＰＮに付随するデータとしてはデータストリーム中には存在しなくなる。元々、タイムコードで指定された頭出し点（この例で言えば、Ｉ２ピクチャの「ｔ＝２」）であるが、そのタイムコードが最終的にデータとしてストリーム中に存在しないと、頭出し点が正しくエンコードされたかをデータそのものから確認をすることが困難となってしまう。さらに、頭出し点をタイムコードと頭出し点が記録されたディスク上の記録アドレスの対で表し、それをＴＯＣに登録するという従来の「頭出し点検索システム」から見ても、検索される側のデータのアドレスに位置する情報を読み出してみても、それに付随すべきタイムコードが直接的に確認できないのは問題である。
また、ＧＯＰに付随しているタイムコードから常に固定値のフレームだけ進んだところを頭出し点とするということは、結果的に問題点１に帰着してしまう。 2. The problem that the cue point cannot be confirmed with the time code. Furthermore, there is a problem with the time code. In MPEG2, a time code cannot be held for each picture. As an option, a GOP header can be arranged at the head of the GOP, and a time code can be put on this GOP header. “Strike” is to record time code data on a recording medium corresponding to the GOP head. Even in the expected DVD format, the shape is different, but it is also possible to enter a time code for each GOP. What should the timecode value be in this case?
When normal sequential playback is considered, if the time code value is uniquely determined regardless of the GOP structure (how many pictures are used), the GOP is considered very naturally. The time code corresponding to the picture to be reproduced and displayed first among the included pictures should be obtained. In the example of FIG. 9, if the B0 picture is the first image to be reproduced in GOP N if it is sequentially reproduced from the beginning of the data string, the time code to be attached to GOP N Is a value of time “t = 0” of the B0 picture.
In that case, when the cueable point becomes the intra picture I2 which is not the top when viewed in the display order in GOP N, the time code corresponding to it is a value of “t = 2”, but the value itself is GOP. Data associated with N does not exist in the data stream. Originally, the cue point specified by the time code (in this example, “t = 2” of the I2 picture), but if the time code does not finally exist in the stream as data, It becomes difficult to confirm from the data itself whether or not is correctly encoded. Furthermore, the cue point is expressed as a pair of a time code and a recording address on the disc where the cue point is recorded, and the search is performed even when viewed from a conventional “cue point search system” in which it is registered in the TOC. Even if the information located at the address of the data on the side is read out, the time code that should accompany it cannot be confirmed directly.
In addition, the fact that a point that is always advanced by a fixed value frame from the time code attached to the GOP is used as a cue point results in a problem 1.

ここで、以上説明した問題についてまとめる。トラックなどの頭出し点をＧＯＰの中に存在する最初のイントラピクチャにするという方法は、「Ｍ」の値を例えば３に固定する、と言う特別な場合には（ＴＯＣの問題は「ズレ」があるということで妥協して）実用になるかもしれないが、一般的に見れば「Ｍ」の値は任意なので、頭出し点をエンコード時に指定しても、実際に頭出しされる画像を一意に決定するのは困難である、ということになる。 Here, the problems described above are summarized. The method of setting the cue point of a track or the like as the first intra picture existing in the GOP is a special case in which the value of “M” is fixed to 3 (for example, the problem of the TOC is “deviation”). In general, the value of “M” is arbitrary, so even if you specify the cue point at the time of encoding, the image that is actually cued It is difficult to determine uniquely.

〔一見正しそうな解決方法２〕
＊ＣＬＯＳＥＤＧＯＰを適用する方法
図１０は、ＭＰＥＧ２における「ＣＬＯＳＥＤＧＯＰ（クローズド・ＧＯＰ）」の概念を示す説明図である。図１０Ａには原画像が、図１０Ｂにはエンコーダ出力が、図１０Ｃにはデコーダ入力が、図１０Ｄには再生画像が夫々示されている。 [Solution 2 seems to be correct at first glance]
* Method for Applying CLOSED GOP FIG. 10 is an explanatory diagram showing the concept of “CLOSED GOP” in MPEG2. 10A shows an original image, FIG. 10B shows an encoder output, FIG. 10C shows a decoder input, and FIG. 10D shows a reproduced image.

ＣＬＯＳＥＤＧＯＰは、ＧＯＰ内の最初にあるいくつかのＢピクチャなどをエンコードする際に、「その一つ前のＧＯＰの最後のＩ、Ｐピクチャからの予測」を使用せずに、原画面順で直後のＩピクチャのみから予測するようにコーディングする（ＯｎｌｙＢａｃｋｗａｒｄＰｒｅｄｉｃｔｉｏｎ）ことによりＧＯＰの独立性を確保する考え方である。図１０Ｂのエンコーダ出力で見て、このようなＧＯＰＮは、ＣＬＯＳＥＤＧＯＰと呼ばれる。今、図１０Ｃに示すように、プレーヤの頭出し点がＣＬＯＳＥＤＧＯＰになっていたとすると、デコーダ入力においては、図中、破線の矢印で示す、元々前のＧＯＰからの前方予測を必要としていないので、図１０Ｄに示す再生画像においてもＩ２ピクチャの直後にある何枚かのＢピクチャ（この例では、図１０Ｄに示すＢ０及びＢ１である）は、Ｉ２ピクチャからの後方予測のみを使用して、理論上は、再生が可能となる。 CLOSED GOP does not use “prediction from the last I or P picture of the previous GOP”, but encodes the first several B pictures in the GOP in the original screen order. The idea is to ensure the independence of the GOP by coding so as to predict only from the immediately following I picture (Only Backward Prediction). In view of the encoder output in FIG. 10B, such a GOP N is called a CLOSED GOP. As shown in FIG. 10C, if the player's cue point is a CLOSED GOP, the decoder input does not require forward prediction from the previous GOP as indicated by the dashed arrow in the figure. In the reproduced image shown in FIG. 10D, some B pictures immediately after the I2 picture (in this example, B0 and B1 shown in FIG. 10D) use only backward prediction from the I2 picture, In theory, reproduction is possible.

これは、一見、良さそうな解答であるが、完全な解決にはなっていない。ＭＰＥＧ２においては、ＣＬＯＳＥＤＧＯＰは、再生時の表示順で見たときに先頭にあるＢ０ピクチャから再生できる、とは言っているが、再生しなければならないとは言っていないのである。 This is a good solution at first glance, but it is not a complete solution. In MPEG2, CLOSED GOP is said to be reproducible from the B0 picture at the head when viewed in the display order at the time of reproduction, but it is not said that it must be reproduced.

通常の再生機は、図１０の例で言えば、頭出しの直後に、まずＩ２ピクチャを読みそれをデコードしなければならない。そしてこのＩ２ピクチャがリファレンスとなり次のデコードが可能になる。この時、次のことが問題となる。 In the example of FIG. 10, a normal player must first read an I2 picture and decode it immediately after cueing. This I2 picture becomes a reference, and the next decoding is possible. At this time, the following matters.

デコードされたＩ２ピクチャをすぐに表示せずに、Ｂ０ピクチャのデコードを待ってからこのＧＯＰにおける表示順に見て最初のピクチャ（Ｂ０ピクチャ）から表示するか、それともすぐにＩ２ピクチャを表示してしまい、時間的に手前のＢ１ピクチャは飛ばしてしまい、次のＢ３ピクチャに続けてしまうかは、全くの任意である。（実際に、頭出し点の再生開始点においては、デコードバッファが空であるために、まず、Ｉピクチャを再生してしまう再生機が多いようである。理由は、制御が簡単になるということと、少なくとも何枚かのＢピクチャをデコードする時間は再生画像の表示が出せないため、見た目ではアクセス速度が遅くなったように見えるのを避けるためである。） Do not display the decoded I2 picture immediately, wait for decoding of the B0 picture, and then display from the first picture (B0 picture) in the order of display in this GOP, or immediately display the I2 picture It is completely arbitrary whether the previous B1 picture is skipped in time and continued to the next B3 picture. (Actually, at the playback start point of the cue point, since the decoding buffer is empty, it seems that there are many players that first play the I picture. The reason is that the control becomes simple. (In order to avoid the appearance that the access speed appears to be slow because the playback image cannot be displayed during the time for decoding at least some of the B pictures.)

従って、エンコード時にトラックなどの頭出し点をＣＬＯＳＥＤＧＯＰとしても、再生時には、必ずしもそのＧＯＰにおける表示順に見て先頭のピクチャから再生されるとは限らないのである。但し、そのような再生方法はＭＰＥＧで規定されてルールを違反しているわけではない。 Therefore, even if the cue point of a track or the like is a CLOSED GOP at the time of encoding, it is not always reproduced from the first picture in the display order in the GOP at the time of reproduction. However, such a reproduction method is not violated by rules defined by MPEG.

このような状況により、現状では、確実に頭出し点を指定するエンコード方法がなく、どのピクチャから頭出し再生をするかは個々の再生機で自由に行われているのである。 Under such circumstances, at present, there is no encoding method for reliably specifying the cue point, and from which picture cue reproduction is performed is freely performed by each player.

本発明はこのような点を考慮してなされたもので、頭出し点の表示画像が一意に決定できるようにすることにより、再生時に安定した読み出しを行うことのできるオーサリングシステム及びその方法を提案しようとするものである。 The present invention has been made in consideration of such points, and proposes an authoring system and method capable of performing stable reading at the time of reproduction by enabling the display image of the cue point to be uniquely determined. It is something to try.

主要な発明の内の１つは、映像信号を符号化して符号化データを生成するオーサリングシステムにおいて、上記映像信号を再生する際の頭出し点が指定された場合、上記頭出し点に相当するピクチャが表示順にみて新たなＧＯＰの先頭になるように、当該新たなＧＯＰの区切りを決定する区切り決定手段と、上記区切り決定手段により決定された上記新たなＧＯＰの間で符号化に伴う参照予測を必要としないように、上記新たなＧＯＰ内におけるピクチャのピクチャタイプを決定して、上記新たなＧＯＰを構成するＧＯＰ構成手段と、上記ＧＯＰ構成手段により上記新たなＧＯＰが構成された上記映像信号を符号化することにより、符号化に伴う発生ビット量を算出する発生ビット量算出手段と、上記発生ビット量算出手段により算出された上記発生ビット量に基づいて量子化レベルを調整する量子化レベル制御手段と、上記量子化レベル制御手段により調整された量子化レベルに従って、上記新たなＧＯＰが構成された上記映像信号を符号化することにより、上記符号化データを生成する符号化手段と、を有するオーサリングシステムである。 One of the main inventions is an authoring system that encodes a video signal to generate encoded data, and corresponds to the cue point when a cue point for reproducing the video signal is designated. Reference prediction associated with encoding between a delimiter determining unit for determining a delimiter of the new GOP and the new GOP determined by the delimiter determining unit so that the picture becomes the head of the new GOP in the display order. The GOP composing means for determining the picture type of the picture in the new GOP and configuring the new GOP, and the video signal in which the new GOP is composed by the GOP composing means The generated bit amount calculating means for calculating the generated bit amount accompanying encoding, and the generated bit amount calculating means A quantization level control means for adjusting the quantization levels based on the serial number of bits generated in accordance with the adjusted quantization level by the quantization level control means, for encoding the video signal in which the new GOP is constituted Thus, an authoring system having encoding means for generating the encoded data.

上記発明によれば、映像信号を符号化して生成した符号化データが記録された記録媒体を再生するとき、頭出し点の情報を高速、且つ、正しく再生する。 According to the above invention, when reproducing the recording medium on which the encoded data generated by encoding the video signal is recorded, the information on the cue point is reproduced at high speed and correctly .

上述せる本発明によれば、映像信号を符号化して生成した符号化データが記録された記録媒体を再生するとき、頭出し点が一意に決定できるようにし、安定した頭出し機能の提供を保証することができるという効果がある。 According to the present invention described above, when playing back a recording medium on which encoded data generated by encoding a video signal is recorded, it is possible to uniquely determine a cue point, and to provide a stable cue function. There is an effect that can be done .

以下に、図面を参照して本発明の実施の形態について詳細に説明する。
〔本形態の概要〕
例えばＤＶＤのように、デジタル映像及び音声データ等が、相変化型の光ディスク、光磁気ディスク、読み出し専用の光ディスク等のディスク状記録媒体に記録されたメディアにおいては、ＣＤで行なわれているようなトラック・ジャンプによる頭出し、あるいは、ＶＩＤＥＯＣＤのＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬ（ＰＢＣ）機能と同じくユーザーが指定した特定の場所から再生を開始するランダムアクセス機能がある。本形態では、例えばＤＶＤに記録されるＭＰＥＧ２の映像信号の圧縮エンコードにおいて、トラック・ジャンプやＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬなどに代表される特定の場所から再生を開始する機能（頭出し再生と呼ぶことにする）を有する再生機のために、エンコードを行なう際に、
１．トラック（もしくはＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬの１再生単位であるＰＬＡＹＩＴＥＭ）の切れ目などの頭出し点では、必ずＧＯＰ（ＧＲＯＵＰＯＦＰＩＣＴＵＲＥ）の区切りを入れる。
２．かつ、「カットのあるシーケンス」をトラック（もしくはＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬの１再生単位であるＰＬＡＹＩＴＥＭ）の切れ目などの頭出し点に該当するＧＯＰに適用し、その点の前後のピクチャにおいてお互いに前方または後方予測を必要としないコーディングをする。そして、これによって、頭出し点の画像が必ずイントラピクチャになるようにすることで、どのような再生機で解釈したとしても、必ず頭出し点のピクチャが一意に決まるようにし、その結果、安定した頭出し再生を提供できる。 Embodiments of the present invention will be described below in detail with reference to the drawings.
[Outline of this embodiment]
For example, in a medium in which digital video and audio data are recorded on a disk-shaped recording medium such as a phase change type optical disk, a magneto-optical disk, and a read-only optical disk such as a DVD, it is performed on a CD. There is a random access function for starting playback from a specific place designated by the user, like the PLAYBACK CONTROL (PBC) function of VIDEO CD, or cuing by track jump. In this embodiment, for example, in compression encoding of an MPEG2 video signal recorded on a DVD, a function of starting playback from a specific place represented by track jump, PLAYBACKCONTROL, etc. (hereinafter referred to as cue playback) is provided. When encoding for a player,
1. A GOP (GROUPOF PICTURE) delimiter is always inserted at the cue point of a break of a track (or PLAY ITEM, which is one playback unit of PLAYBACK CONTROL).
2. In addition, the “sequence with cut” is applied to a GOP corresponding to a cue point such as a break of a track (or PLAY ITEM, which is one playback unit of PLAYBACKCONTROL), and predictions are made forward or backward with respect to each other in pictures before and after that point. Do not need coding. This ensures that the image at the cue point is always an intra picture, so that the picture at the cue point is always uniquely determined regardless of the playback device. Cue playback can be provided.

ここで、「区切り」となり得るのは、論理的なフォーマット上における単位間、即ち、論理セクタ、論理トラック、論理ブロック等の境界や、物理セクタの境界等である。論理的な単位の境界としては、例えばＣＤにおけるトラックやビデオディスクにおけるチャプタ等のような、制作者の意図した単位間の境界がある。また、物理的な単位の境界としては、例えば光ディスク上における物理セクタ等のように、記録媒体そのものの構造であるとことの、光学ピックアップのアクセス単位間の境界がある。 Here, “separation” can be a unit between logical units, that is, a boundary between logical sectors, logical tracks, logical blocks, or a physical sector. Examples of logical unit boundaries include boundaries between units intended by the producer, such as tracks on a CD and chapters on a video disc. In addition, as a boundary between physical units, there is a boundary between access units of an optical pickup, such as a physical sector on an optical disk, which is the structure of a recording medium itself.

即ち、本形態においては、「カットのあるシーケンス」を応用し、どのような再生機においても、頭出し点の表示画像が一意に決定できるようなエンコードを行なうことにより、安定した頭出し機能の提供を保証するものである。以下、本形態について詳細に説明する。 In other words, in this embodiment, by applying “a sequence with a cut” and performing encoding so that the display image of the cue point can be uniquely determined in any player, a stable cue function can be achieved. It guarantees the provision. Hereinafter, this embodiment will be described in detail.

まず、本形態のベースとなる圧縮エンコードの方法について図１を参照して説明する。図１Ａには通常シーケンスが示され、図１Ｂにはカットシーケンスが示され、図１Ｃにはエンコーダ出力が示され、図１Ｄにはデコーダ入力が示され、図１Ｅには再生された画像が示されている。 First, a compression encoding method as a base of this embodiment will be described with reference to FIG. 1A shows the normal sequence, FIG. 1B shows the cut sequence, FIG. 1C shows the encoder output, FIG. 1D shows the decoder input, and FIG. 1E shows the reproduced image. Has been.

図１Ａに示されるように、通常のシーケンスにおいては、例えば、Ｐ１ピクチャの後ろにはＢ０ピクチャ、Ｂ１ピクチャ、Ｉ２ピクチャと続く。しかしながら、本形態においては、それとは別に特別なシーケンスが用いられる。特別なシーケンスでは、図１Ｂに示されるように、Ｐ１ピクチャの後ろがＩ０ピクチャとされ、そこからＢ１ピクチャ、Ｂ２ピクチャという具合に通常のシーケンスと同じように続けられていく。尚、本形態では、このような形のシーケンスを「カットシーケンス」、または、「カットのあるシーケンス」と呼ぶことにする。 As shown in FIG. 1A, in a normal sequence, for example, a P0 picture is followed by a B0 picture, a B1 picture, and an I2 picture. However, in this embodiment, a special sequence is used separately. In the special sequence, as shown in FIG. 1B, the P1 picture is followed by the I0 picture, and from there, the B1 picture and the B2 picture are continued in the same manner as the normal sequence. In this embodiment, such a sequence is called a “cut sequence” or a “cut sequence”.

このように、通常では、図１Ａに示されるように、「・・・ＢＢＰＢＢＩＢＢＰ・・・」となるシーケンスは、図１Ｂに示されるように、カットのあるシーケンスでは「・・・ＢＢＰＩＢＢＰＢＢ・・・」となり、丁度、図１Ｂにおいて実線の矢印で示されるカットのところでは、Ｂピクチャが抜けたように見える。このカットを置くことにより、図１Ｂから明らかなように、カットの前後のＧＯＰＮ−１とＧＯＰＮの間には、元々予測が存在しておらず、従ってＧＯＰＮ−１とＧＯＰＮは夫々完全に独立している。 Thus, normally, as shown in FIG. 1A, the sequence of “... BBPBBIBBP...” Is “... BBPIBBPBB. Just as in the cut indicated by the solid line arrow in FIG. 1B, it appears that the B picture is missing. By placing this cut, as is apparent from FIG. 1B, there is no original prediction between GOPN-1 and GOP N before and after the cut, so GOP N-1 and GOP N are completely Independent.

＊頭出し点はカットのあるシーケンスにする
さて、カットのあるシーケンスで構成されたＧＯＰの先頭が、トラックの先頭などの頭出し点であった場合についてみてみる。図１Ｄに示されるデコーダ入力で見てみると、カットのあるＧＯＰの先頭は、Ｉ０ピクチャ（ＧＯＰ内で、最も最初に表示されるべき画像がイントラピクチャ）であるから、デコードの順番で見ても、図１Ｅに示される再生された画像における順番で見ても、どちらも先頭の画像となることは明らかである。 * Make the cue point a sequence with a cut. Now, let's look at the case where the beginning of a GOP composed of a sequence with a cut is a cue point such as the beginning of a track. As seen from the decoder input shown in FIG. 1D, the head of the GOP with the cut is the I0 picture (the first image to be displayed in the GOP is an intra picture). However, it is clear that both images are the first images when viewed in the order of the reproduced images shown in FIG. 1E.

ここが、本形態におけるポイントの１つである。トラックやＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬ機能など、再生時に特定の場所を頭出し点として指定したい場合には、１．頭出し点に相当する画像が、表示順に見てＧＯＰの先頭となるようにＧＯＰの区切りを決める。
２．かつ、そのＧＯＰは、カットのあるシーケンスにより構成する。
このルールで圧縮エンコードされた画像データにおいては、図１Ｄにおいて実線の矢印で示される頭出し点から再生を開始しようとする場合、「最初にデコードする画像＝最初に表示する画像」の関係があるので、どのような再生機であっても、必ずＩ０ピクチャからの再生とならざるを得ない。よって、頭出し点は、一意に決定できるのである。さらに、頭出し点の画像に付随するタイムコードは、そのままＧＯＰに付随するタイムコードとして適用できるので、正しい位置に頭出し点が来るようにエンコード出来たかは、エンコード後のタイムコードを見ればよいので、簡単に判定が可能である。 This is one of the points in this embodiment. If you want to specify a specific location as the cue point during playback, such as a track or PLAYBACKCONTROL function, The GOP delimiter is determined so that the image corresponding to the cue point becomes the head of the GOP in the display order.
2. The GOP is composed of a cut sequence.
In the image data compression-encoded according to this rule, when reproduction is started from the cue point indicated by the solid line arrow in FIG. 1D, there is a relationship of “first decoded image = first displayed image”. Therefore, any playback device must be played back from the I0 picture. Therefore, the cue point can be uniquely determined. Furthermore, since the time code associated with the cue point image can be applied as it is as the time code associated with the GOP, it can be determined whether the cue point can be encoded at the correct position by looking at the encoded time code. Therefore, it can be easily determined.

即ち、頭出し点をタイムコードと頭出し点が記録されたディスク上の記録アドレスの対で表し、それをＴＯＣに登録するという従来の「頭出し点検索システム」から見ても、検索される側のデータのアドレスに位置する情報を読み出せば、それに付随すべきタイムコードが直接的に確認できるのである。 In other words, the cue point is represented by a pair of the time code and the recording address on the disc where the cue point is recorded, and the search is performed even when viewed from the conventional “cue point search system” in which it is registered in the TOC. If the information located at the address of the data on the side is read, the time code to be attached to it can be directly confirmed.

〔構成及びその動作の概要〕
図２に、一実施の形態としてのオーサリングシステムの構成例を示す。まず、圧縮エンコードされる対象のマスターテープ（マスターとして用いられるビデオ・テープ・カセット）１は、デジタルＶＴＲ２で再生される。マスターテープ１に付随しているところの、各頭出し点がタイムコードで指定されたデータが書き込まれたフレキシブル・ディスク６は、頭出し点読み取り部７にセットされる。読み取られた頭出し点の情報は、さらに、頭出し点修正装置８により、頭出し点のタイムコード値の微調整や追加または削除が可能となっている。以上の手段で決定されたマスターテープ１に対する頭出し点のタイムコードの情報は、カット判定回路９に送られる。また、デジタルＶＴＲ２で再生されるマスターテープ１には、あらかじめタイムコードが記録され、そのデジタルＶＴＲ２からのタイムコードデータは、カット判定回路９と画像タイプ制御部５に夫々供給されている。 [Overview of configuration and operation]
FIG. 2 shows a configuration example of an authoring system as one embodiment. First, a master tape (video tape cassette used as a master) 1 to be compression-encoded is reproduced by a digital VTR 2. The flexible disk 6 on which data with each cue point designated by a time code written to the master tape 1 is set in a cue point reading unit 7. The read cue point information can be further finely adjusted, added or deleted by the cue point correcting device 8. Information on the time code of the cue point for the master tape 1 determined by the above means is sent to the cut determination circuit 9. A time code is recorded in advance on the master tape 1 reproduced by the digital VTR 2, and the time code data from the digital VTR 2 is supplied to the cut determination circuit 9 and the image type control unit 5, respectively.

ここで、上記フレキシブル・ディスク６には、図３Ａに示すように、カット番号データと、このカット番号データに対応するタイムコードデータとからなる、ＥＤＬデータＥＤＬｄ（エディット・リスト）が記録されている。 Here, as shown in FIG. 3A, EDL data EDLd (edit list) composed of cut number data and time code data corresponding to the cut number data is recorded on the flexible disk 6. .

カット判定回路９では、入力された頭出し点のタイムコード情報と、ＶＴＲ２からのタイムコードデータとを比較しつつ、どのような圧縮シーケンス（Ｉピクチャ、Ｐピクチャ、Ｂピクチャの並び方）を組めばよいか判定を行ない、入力されるタイムコードの一つ一つのフレーム番号データとピクチャのシーケンスの対応を準備しておく。例えば、頭出し点のない付近でのシーケンスでは、通常の「・・・ＢＢＰＢＢＩＢＢＰ・・・」とし、頭出し点の付近では「カットのあるシーケンス」、即ち、「・・・ＢＢＰＩＢＢＰＢＢ・・・」と言う具合にあらかじめピクチャのタイプのエンコードされる順番を決定するわけである。 The cut determination circuit 9 compares the input time code information of the cue point with the time code data from the VTR 2 and what compression sequence (how to arrange the I picture, P picture, and B picture) is assembled. It is determined whether it is okay, and a correspondence between each frame number data of the input time code and the picture sequence is prepared. For example, in a sequence in the vicinity where there is no cue point, the normal "... BBPBBIBBP ..." is used, and in the vicinity of the cue point, a "sequence with a cut", that is, "... BBPIBBPBB ..." In other words, the order in which picture types are encoded is determined in advance.

マスターテープ１の再生が開始されると、カット判定回路９は、入力されてくるタイムコードデータと、あらかじめ求めておいたシーケンスとの対応をとりながら、そのタイムコードに相当する画像をどのピクチャのタイプで圧縮すればよいかをエンコードシーケンス指示データとして画像タイプ制御部５に送る。一方、デジタルＶＴＲ２で再生された映像信号は、画像遅延装置３に送られる。遅延量は操作部４により設定され、それと同じ情報が画像タイプ制御部５にも供給される（遅延量については、後述する）。遅延された映像信号は、画像タイプ制御部５に供給される。 When the reproduction of the master tape 1 is started, the cut determination circuit 9 takes an image corresponding to the time code as to which picture the image corresponding to the time code is matched with the input time code data and the sequence obtained in advance. Whether the image should be compressed by type is sent to the image type control unit 5 as encoded sequence instruction data. On the other hand, the video signal reproduced by the digital VTR 2 is sent to the image delay device 3. The delay amount is set by the operation unit 4, and the same information is supplied to the image type control unit 5 (the delay amount will be described later). The delayed video signal is supplied to the image type controller 5.

画像タイプ制御部５は、夫々のピクチャタイプに応じて生成された画像の２次元画像情報と、その処理時間に対応して遅延されたタイムコードを、ＤＣＴ（ＤｉｓｃｒｅｔｅＣｏｓｉｇｎＴｒａｎｓｆｏｒｍ）１４に送り出す。ＤＣＴ１０では、画像情報を２次元の画素から周波数成分の情報に変換し、各帯域に分かれた画像周波数成分情報を出力する。また、画像タイプ制御部５は、どの時間の画像がどの程度の情報量を持っていたのかを保存するために、同時に、その処理遅延時間に対応して遅延させたタイムコードデータも出力する。当該タイムコードデータは、量子化器１１に供給される。尚、量子化器１１の出力は、逆量子化器１２と逆ＤＣＴ１３を通じて、量子化の誤差を含んだ２次元画素情報に逆変換され、画像タイプ制御部５にフィードバックされる。これにより、画像タイプ制御部５は、ＢないしＰピクチャのための差分信号を生成する処理、いわゆる予測処理を行うことができる。尚、画像タイプ制御部５では、動き予測やその補償をも行っているが、量子化誤差を含めたフィードバック情報は、これらの処理を適応化するのに重要な情報となる。 The image type control unit 5 sends two-dimensional image information of an image generated according to each picture type and a time code delayed in accordance with the processing time to a DCT (Discrete Coordinate Transform) 14. The DCT 10 converts image information from two-dimensional pixels into frequency component information, and outputs image frequency component information divided into each band. The image type control unit 5 also outputs time code data delayed in accordance with the processing delay time at the same time in order to save what amount of information the image of which time has. The time code data is supplied to the quantizer 11. The output of the quantizer 11 is inversely converted into two-dimensional pixel information including a quantization error through the inverse quantizer 12 and the inverse DCT 13 and fed back to the image type control unit 5. Thereby, the image type control part 5 can perform the process which produces | generates the difference signal for B thru | or P picture, what is called a prediction process. The image type controller 5 also performs motion prediction and compensation, but feedback information including quantization error is important information for adapting these processes.

量子化器１１では、特に重み付け等は行われずに、各帯域毎に均等に量子化するような仮量子化レベルで画像の周波数帯域情報が量子化される。量子化されたビットは、量子化に要した時間に対応するだけ遅延されたタイムコードデータと共に出力される。この結果、時間毎の画像情報量の変化が得られる。以下、この時間毎の画像情報量の変化を示すデータを、ディフィカルティ・データと称する。 The quantizer 11 quantizes the frequency band information of the image at a provisional quantization level such that the quantization is performed evenly for each band without particular weighting. The quantized bits are output together with time code data delayed by a time corresponding to the time required for quantization. As a result, a change in the amount of image information for each time can be obtained. Hereinafter, the data indicating the change in the amount of image information for each time is referred to as “difficulty data”.

さて、量子化レベル制御回路１５には、例えば手入力等により、ディフィカルティ・データが供給されており、画像情報量の時間変化を示す情報が与えられている。量子化レベル制御回路１５は、入力されているタイムコードデータと、画像情報量の時間変化を示す情報から、どのような重み付け制御が必要かを決定し、量子化レベル重み付け制御信号を、量子化器１１に供給する。通常、とくに画質に問題がない場合は、量子化レベル修正装置１６は働かない。 Now, to the quantization level control circuit 15, for example, manual data is supplied, and for example, information indicating a temporal change in the amount of image information is given. The quantization level control circuit 15 determines what kind of weighting control is necessary from the input time code data and information indicating the time change of the image information amount, and quantizes the quantization level weighting control signal. To the vessel 11. Usually, when there is no problem in image quality, the quantization level correcting device 16 does not work.

量子化器１１では、上記量子化レベル重み付け制御信号に従って、各帯域毎に量子化を行い、情報量の削減を行う。量子化されたビットは、量子化に要した時間に対応するだけ遅延されたタイムコードデータと共に、エントロピー符号化器１７に供給される。エントロピー符号化器１７では、更に、ビット量の圧縮を行うために可逆圧縮手法であるところのエントロピー符号化が行われる。そして、この前段まで固定長ビット列であったものを可変長ビット列に変換する。エントロピー符号化器１７からは、可変長符号に変換された最終的な画像符号と、その画像を表示すべき時刻を示した時間情報が、出力レート決定装置１８に供給される。 The quantizer 11 performs quantization for each band in accordance with the quantization level weighting control signal, thereby reducing the amount of information. The quantized bits are supplied to the entropy encoder 17 together with time code data delayed by a time corresponding to the time required for quantization. The entropy encoder 17 further performs entropy encoding, which is a lossless compression method, in order to compress the bit amount. Then, the fixed-length bit string up to the previous stage is converted into a variable-length bit string. From the entropy encoder 17, the final image code converted into the variable length code and the time information indicating the time at which the image is to be displayed are supplied to the output rate determination device 18.

出力レート決定装置１８は、上記時間情報に基づき、ＰＥＳ（ＰａｃｋｅｔｉｚｅｄＥｌｅｍｅｎｔａｒｙＳｔｒｅａｍ）若しくはＰＡＣＫ等のヘッダ情報を求め、入力された画像符号をパケット化し、これを最終符号出力として、ストリーム出力装置１９に供給する。さて、一方では、出力レート決定装置１８は、エンコードを始める際に、エンコーダ制御部２３から、通常エンコードの指示が与えられている。出力レート決定装置１８は、後に画質改善のための編集を可能とするために、最終符号出力の元になった情報であるところの、上記画像符号と時間情報を、ストレージ装置２２に、通常エンコードの結果によるというマーキング情報と共に保存する。 Based on the time information, the output rate determination device 18 obtains header information such as PES (Packetized Elementary Stream) or PACK, packetizes the input image code, and supplies this to the stream output device 19 as the final code output. To do. On the other hand, the output rate determination device 18 is given a normal encoding instruction from the encoder control unit 23 when starting the encoding. The output rate determination device 18 normally encodes the image code and the time information, which are the information from which the final code is output, into the storage device 22 in order to enable editing for improving the image quality later. It is saved together with the marking information that depends on the result of.

さて、最終符号出力は、モニタ用デコーダ２１にも送られている。オペレータは、このモニタによって画質を監視することができる。もしも画質に問題があるならば、その部分に相当する画像を表示すべき時刻を示した時間情報を、量子化レベル修正装置１６に記録する必要がある。通常エンコードが一通り終了した時点で、画質に問題のある場所がどの「カット」に相当するかは、量子化レベル修正装置１６に記録されていた時間情報とカット判定回路９に保存されているタイムコードデータとカットの関係とから、容易に求めることができる。 The final code output is also sent to the monitor decoder 21. The operator can monitor the image quality using this monitor. If there is a problem with the image quality, it is necessary to record in the quantization level correction device 16 time information indicating the time at which the image corresponding to that portion should be displayed. When the normal encoding is completed, the “cut” corresponding to the place having the image quality problem is stored in the time information recorded in the quantization level correcting device 16 and in the cut determination circuit 9. It can be easily obtained from the relationship between the time code data and the cut.

ここで、画質が問題となる部分に相当する「カット」をエンコーダ制御部２３から再度再生し、モニタ用デコーダ２１を監視しながら、量子化レベル修正装置１６において量子化レベルの重み付けを調整し、画質に問題がない量子化レベルを求める。求めた量子化レベルの重み付け制御値は、その「カット」に相当するタイムコードデータと共に量子化レベル制御回路１５に与えておく必要がある。勿論、画質に問題のある部分が複数存在するときには、上記処理を繰り返す必要がある。 Here, the “cut” corresponding to the part where the image quality is a problem is reproduced again from the encoder control unit 23, and the weight of the quantization level is adjusted in the quantization level correction device 16 while monitoring the monitor decoder 21. A quantization level that does not cause a problem in image quality is obtained. The obtained quantization level weighting control value needs to be given to the quantization level control circuit 15 together with the time code data corresponding to the “cut”. Of course, when there are a plurality of portions having image quality problems, it is necessary to repeat the above processing.

量子化レベル制御回路１５においては、もともとディフィカルティ・データによって総発生ビット量が既知である。従って、量子化レベル修正装置１６により、量子化レベルが修正された結果、新しい総発生ビット量を認識することも可能である。もし、総発生ビット量が予め定められてメディアの総容量を越えてしまうようなことがあれば、今度は、逆に、ビット量を減らしても問題になりにくそうな「カット」を指定し、その部分を上述と同様に再生しながら、量子化レベル修正装置１６によって情報量が抑制されるように量子化レベルを調整する。 In the quantization level control circuit 15, the total amount of generated bits is originally known from the difference data. Accordingly, it is possible to recognize a new total generated bit amount as a result of the quantization level being corrected by the quantization level correction device 16. If the total generated bit amount exceeds the total capacity of the media in advance, specify a “cut” that is unlikely to cause a problem even if the bit amount is reduced. Then, while reproducing the portion in the same manner as described above, the quantization level is adjusted by the quantization level correction device 16 so that the information amount is suppressed.

このようにして、必要な部分の量子化レベルを調整し直すと、エンコーダ制御部２３から、対象となる「カット」のみをエンコードさせるように、デジタルＶＴＲ２を順次再生させる。エンコードは、通常エンコードと同様に進行する。このとき、エンコーダ制御部２３は、エンコードの結果が再エンコードによるものであることを示すため、再エンコードの指定を、出力レート決定装置１８とストレージ装置２２に対し、夫々与える。これにより、ストレージ装置２２には、改善された画質を持つ画像符号及び時間情報が、再エンコードのマーキング情報と共に記録される。 When the quantization level of the necessary part is adjusted again in this way, the digital VTR 2 is sequentially reproduced from the encoder control unit 23 so as to encode only the “cut” to be processed. Encoding proceeds in the same way as normal encoding. At this time, the encoder control unit 23 gives the designation of re-encoding to the output rate determination device 18 and the storage device 22 in order to indicate that the encoding result is due to re-encoding. As a result, the image code and time information having improved image quality are recorded in the storage device 22 together with the re-encoding marking information.

全ての再エンコードが終了すると、ストレージ装置２２には、通常若しくは再エンコードによる画像符号及び時間情報の全てが記録されている。出力レート決定装置１８は、上記画像符号及び時間情報に基いて、プログラムの先頭から順次パケット化し、最終符号出力としてストリーム出力装置１９に供給する。ストリーム出力装置１９は、最終符号出力を、出力端子２０を介して出力する。 When all re-encoding is completed, the storage device 22 stores all of the image code and time information by normal or re-encoding. Based on the image code and time information, the output rate determination device 18 sequentially packetizes the program from the beginning and supplies it to the stream output device 19 as the final code output. The stream output device 19 outputs the final code output via the output terminal 20.

次に、上記オーサリングシステムが決定するエンコードシーケンスについて、図４を参照して説明する。 Next, an encoding sequence determined by the authoring system will be described with reference to FIG.

ＭＰＥＧにおいては、ＧＯＰ内のピクチャの枚数をＮと呼び、ＩまたはＰピクチャの周期（Ｉピクチャから次のＰピクチャまでのピクチャの枚数）をＭと呼んでいる。ＭＰＥＧにおいては、このＮとＭに制約はないのであるが、ここでは説明のため、通常シーケンスをＮ＝９、Ｍ＝３とする。図４においては、四角囲みされたＩピクチャは、そこがカットのあるシーケンスの先頭であることを示し、丸囲みされたＩピクチャでその他の一般のＧＯＰの先頭を示している。なお、図４は、あくまでも原画像におけるピクチャのエンコード・シーケンスの順番を示しており（図中、Ｉ、Ｂ、Ｐの添え数字が原画上で「何番」目のピクチャであるか表している）、ＧＯＰの概念をもつエンコーダ出力でのピクチャの順番ではないことに注意されたい。エンコーダ出力においては、既に説明したように、Ｂピクチャの順番が入れ替わり、ＧＯＰ単位で見た場合はあくまでもＩピクチャが先頭となる。 In MPEG, the number of pictures in a GOP is called N, and the period of I or P picture (number of pictures from I picture to the next P picture) is called M. In MPEG, there are no restrictions on N and M, but for the sake of explanation, the normal sequence is N = 9 and M = 3. In FIG. 4, an I picture surrounded by a square indicates that it is the head of a cut sequence, and a circled I picture indicates the head of another general GOP. Note that FIG. 4 only shows the order of the encoding sequence of the picture in the original image (in the figure, the subscripts I, B, and P represent the “number” picture in the original picture). Note that this is not the picture order at the encoder output with the concept of GOP. In the encoder output, as described above, the order of the B pictures is switched, and when viewed in GOP units, the I picture is the first.

Ｎ＝９の場合、カットシーケンスは、そのカットの位置により、図４Ａ〜Ｊに夫々示すように、カットシーケンス０からカットシーケンス８迄、合計９種類の場合が発生する。しかし、いずれ場合であっても、カットのあるシーケンスの先頭の（四角囲みで示した）Ｉピクチャの直前のピクチャタイプは必ずＰピクチャであることが大きな特長になっている。 In the case of N = 9, there are a total of nine types of cut sequences from cut sequence 0 to cut sequence 8, as shown in FIGS. However, in any case, the main feature is that the picture type immediately before the I picture (indicated by a square box) in the cut sequence is always a P picture.

〔各シーケンスについて（図４中の波線については後述）〕
カットシーケンス０は、図４Ｂに示されるように、Ｉピクチャの位置が、通常シーケンスに比べて前に２枚ずれて「０番」のピクチャがＩピクチャとなる。しかし、直前のピクチャはＰピクチャであるので、ＧＯＰはＩ０ピクチャから始まり、シーケンスの都合上Ｐ９までのＮ＝１０となる。その後は通常のシーケンスが続く。
カットシーケンス１は、図４Ｃに示されるように、Ｉピクチャの位置が、通常シーケンスに比べて前に１枚ずれて「１番」のピクチャがＩピクチャとなり、かつ、直前の「０番」のピクチャは、通常ではＢピクチャであるが、Ｐピクチャとされる。ＧＯＰはＩ１ピクチャから始まり、シーケンスの都合上Ｐ１０までのＮ＝１０となる。その後は通常のシーケンスが続く。なお、直前のＧＯＰは、Ｐ０ピクチャまでを含むので、それに限りＮ＝１０となる。 [Each sequence (the wavy line in FIG. 4 will be described later)]
In the cut sequence 0, as shown in FIG. 4B, the position of the I picture is shifted two sheets ahead of the normal sequence, and the “number 0” picture becomes the I picture. However, since the immediately preceding picture is a P picture, the GOP starts from the I0 picture and N = 10 up to P9 for convenience of sequence. After that, the normal sequence continues.
In the cut sequence 1, as shown in FIG. 4C, the position of the I picture is shifted by one sheet compared to the normal sequence, and the “No. 1” picture becomes the I picture, and the “No. The picture is normally a B picture, but is a P picture. The GOP starts from the I1 picture, and N = 10 up to P10 for convenience of sequence. After that, the normal sequence continues. Since the immediately preceding GOP includes up to the P0 picture, N = 10 only in that case.

カットシーケンス２は、図４Ｄに示されるように、Ｉピクチャの位置が、通常シーケンスと同じ位置で「２番」のピクチャがＩピクチャとなるが、直前の「０番」と「１番」の２枚のピクチャは、通常ではＢピクチャであるが、２枚ともＰピクチャとされる。ＧＯＰはＩ２ピクチャから始まり、シーケンスの都合上Ｐ１１までのＮ＝１０となる。その後は通常のシーケンスが続く。なお、直前のＧＯＰは、Ｐ０とＰ１ピクチャまでを含むので、それに限りＮ＝１１となる。
カットシーケンス３は、図４Ｅに示されるように、Ｉピクチャの位置が、通常シーケンスに比べて後ろに１枚ずれて「３番」のピクチャがＩピクチャとされる。ただし、「２番」のピクチャまでは通常のシーケンスが延ばされる。ＧＯＰはＩ３ピクチャから始まり、シーケンスの都合上Ｐ１２までのＮ＝１０となる。その後は通常のシーケンスが続く。なお、直前のＧＯＰは、Ｂ０とＢ１とＰ２ピクチャまでを含むので、それに限りＮ＝１２となる。 In the cut sequence 2, as shown in FIG. 4D, the position of the I picture is the same position as the normal sequence, and the “No. 2” picture becomes the I picture, but the immediately preceding “No. 0” and “No. 1” The two pictures are usually B pictures, but both pictures are P pictures. The GOP starts from the I2 picture, and N = 10 up to P11 for convenience of sequence. After that, the normal sequence continues. Since the immediately preceding GOP includes up to P0 and P1 pictures, N = 11 only in that case.
In the cut sequence 3, as shown in FIG. 4E, the position of the I picture is shifted backward by one as compared with the normal sequence, and the “No. 3” picture is the I picture. However, the normal sequence is extended to the “No. 2” picture. The GOP starts from the I3 picture, and N = 10 up to P12 for convenience of sequence. After that, the normal sequence continues. Note that the immediately preceding GOP includes up to B0, B1, and P2 pictures, so that N = 12.

カットシーケンス４は、図４Ｆに示されるように、Ｉピクチャの位置が、通常シーケンスに比べて後ろに２枚ずれて「４番」のピクチャがＩピクチャとされる。ただし、「２番」のピクチャまでは通常のシーケンスが延ばされ、かつ、Ｉ４ピクチャの直前の「３番」のピクチャはＰピクチャとされる。ＧＯＰはＩ４ピクチャから始まり、シーケンスの都合上Ｐ１３までのＮ＝１０となる。その後は通常のシーケンスが続く。なお、直前のＧＯＰは、Ｂ０、Ｂ１、Ｐ２、Ｐ３ピクチャまでを含むので、それに限りＮ＝１３となる。
カットシーケンス５は、図４Ｇに示されるように、Ｉピクチャの位置が、通常シーケンスに比べて後ろに３枚ずれて「５番」のピクチャがＩピクチャとされる。ただし、「２番」のピクチャまでは通常のシーケンスが延ばされ、かつ、Ｉ５ピクチャの直前の２枚のうち「３番」はＢピクチャ、「４番」のピクチャはＰピクチャとされる。ＧＯＰはＩ５ピクチャから始まり、シーケンスの都合上Ｐ１４までのＮ＝１０となる。その後は通常のシーケンスが続く。なお、直前のＧＯＰは、Ｂ０、Ｂ１、Ｐ２、Ｂ３、Ｐ４ピクチャまでを含むので、それに限りＮ＝１３となる。 In the cut sequence 4, as shown in FIG. 4F, the position of the I picture is shifted backward by two compared to the normal sequence, and the “4th” picture is the I picture. However, the normal sequence is extended to the “No. 2” picture, and the “No. 3” picture immediately before the I4 picture is a P picture. GOP starts from an I4 picture, and N = 10 up to P13 for convenience of sequence. After that, the normal sequence continues. Note that since the immediately preceding GOP includes up to B0, B1, P2, and P3 pictures, N = 13 only in that case.
In the cut sequence 5, as shown in FIG. 4G, the position of the I picture is shifted backward by three compared to the normal sequence, and the “No. 5” picture is the I picture. However, the normal sequence is extended up to the “No. 2” picture, and “No. 3” is the B picture and the “No. 4” picture is the P picture among the two pictures immediately before the I5 picture. GOP starts from an I5 picture, and N = 10 up to P14 for convenience of sequence. After that, the normal sequence continues. Since the immediately preceding GOP includes up to B0, B1, P2, B3, and P4 pictures, N = 13 only in that case.

カットシーケンス６は、図４Ｈに示されるように、今までのようにＩピクチャの位置を後ろにずらすと直前のＧＯＰの長さが長くなり過ぎるので、別のルールとされ、短めの通常シーケンスが置かれ、その後に改めてカットのあるシーケンスが置かれる。Ｂ０ピクチャから始まる通常のシーケンスは、Ｐ５ピクチャまでのＮ＝６で打ち切られ、その後の「６番」のピクチャがカットのあるシーケンスのＩピクチャとされる。ＧＯＰはＩ６ピクチャから始まり、シーケンスの都合上Ｐ１５までのＮ＝１０となる。その後は通常のシーケンスが続く。
カットシーケンス７は、図４Ｉに示されるように、カットシーケンス６と同様に、単純にＩピクチャの位置を後ろにずらすと直前のＧＯＰの長さが長くなり過ぎるので、別のルールとされ、短めの通常シーケンスが置かれ、その後に改めてカットのあるシーケンスが置かれる。Ｂ０ピクチャから始まる通常のシーケンスがＰ５ピクチャまで続けられ、かつ、その後の「６番」にＰピクチャが続けられ、ここでＮ＝７でＧＯＰが打ち切られる。その直後の「７番」のピクチャが、カットのあるシーケンスのＩピクチャとされる。ＧＯＰはＩ７ピクチャから始まり、シーケンスの都合上Ｐ１６までのＮ＝１０となる。その後は通常のシーケンスが続く。 As shown in FIG. 4H, the cut sequence 6 is a different rule because the previous GOP length becomes too long if the position of the I picture is shifted backward as in the past. Placed, followed by a sequence with a new cut. The normal sequence starting from the B0 picture is terminated at N = 6 up to the P5 picture, and the subsequent “No. 6” picture is the I picture of the sequence with the cut. GOP starts from an I6 picture, and N = 10 up to P15 due to the sequence. After that, the normal sequence continues.
As shown in FIG. 4I, the cut sequence 7 is a different rule because the length of the previous GOP becomes too long if the position of the I picture is simply shifted backward, just like the cut sequence 6. The normal sequence is placed, followed by a sequence with a new cut. The normal sequence starting from the B0 picture is continued until the P5 picture, and the P picture is continued to “6th” thereafter, where the GOP is aborted at N = 7. The “No. 7” picture immediately after that is the I picture of the cut sequence. The GOP starts from the I7 picture, and N = 10 up to P16 for convenience of sequence. After that, the normal sequence continues.

カットシーケンス８は、図４Ｊに示されるように、カットシーケンス７と同様に、単純にＩピクチャの位置を後ろにずらすと直前のＧＯＰの長さが長くなり過ぎるので、別のルールとされ、短めの通常シーケンスが置かれ、その後に改めてカットのあるシーケンスが置かれる。Ｂ０ピクチャから始まる通常のシーケンスがＰ５ピクチャまで続けられ、かつ、その後の「６番」がＢピクチャとされ、「７番」にＰピクチャが続けられ、ここでＮ＝８でＧＯＰが打ち切られる。その直後の「８番」のピクチャが、カットのあるシーケンスのＩピクチャとされる。ＧＯＰはＩ８ピクチャから始まり、シーケンスの都合上Ｐ１７までのＮ＝１０となる。その後は通常のシーケンスが続く。 As shown in FIG. 4J, the cut sequence 8 is a different rule because the length of the previous GOP becomes too long if the position of the I picture is simply shifted backward, just like the cut sequence 7. The normal sequence is placed, followed by a sequence with a new cut. The normal sequence starting from the B0 picture is continued until the P5 picture, and the subsequent “No. 6” is set as the B picture, and the “No. 7” is followed by the P picture. Here, the GOP is terminated at N = 8. Immediately after that, the “No. 8” picture is the I picture of the cut sequence. GOP starts from an I8 picture, and N = 10 up to P17 for convenience of sequence. After that, the normal sequence continues.

以上説明した通常シーケンスデータＳｄ並びにカットシーケンス０〜８までの合計１０パターンのシーケンスデータＳｄは、図２に示したカット判定回路９の内部ＲＯＭ９ａに記憶されている。そして、カット判定回路９は、処理の開始時点においては、通常のシーケンスデータＳｄを、上記内部ＲＯＭ９ａから内部ＲＡＭ９ｂにロードし、以降、処理中のＧＯＰの次のＧＯＰの構成画像中に、頭出し点が有るか否かを判断する。 The above-described normal sequence data Sd and a total of 10 patterns of sequence data Sd from the cut sequence 0 to 8 are stored in the internal ROM 9a of the cut determination circuit 9 shown in FIG. Then, the cut determination circuit 9 loads normal sequence data Sd from the internal ROM 9a to the internal RAM 9b at the start of processing, and thereafter cues in the constituent image of the GOP next to the GOP being processed. Determine if there is a dot.

そして、カット判定回路９は、次のＧＯＰ中に頭出し点が無いと判断した場合には通常のシーケンスデータＳｄを、内部ＲＯＭ９ａから内部ＲＡＭ９ｂにロードし、当該シーケンスデータＳｄを、シーケンスリストデータＳＬｄとして次のＧＯＰの処理で用いる。そして、カット判定回路９は、次のＧＯＰ中に頭出し点が有ると判断した場合には、処理中のＧＯＰの最後の構成画像から数えて何枚目に頭出し点が有るかを検出し、その値が“１”枚目なら上記シーケンス０のデータを、“２”枚目なら上記シーケンス１のデータを、・・・・９枚目なら上記シーケンス８のデータを、内部ＲＯＭ９ａから読み出し、当該シーケンスデータＳｄを、シーケンスリストデータＳＬｄとして内部ＲＡＭ９ｂにロードし、次のＧＯＰの処理で用いる。 If the cut determination circuit 9 determines that there is no cue point in the next GOP, the cut sequence circuit 9 loads the normal sequence data Sd from the internal ROM 9a to the internal RAM 9b, and the sequence data Sd is loaded into the sequence list data SLd. Used in the next GOP processing. If the cut determination circuit 9 determines that there is a cue point in the next GOP, it detects the number of the cue point from the last component image of the GOP being processed. If the value is “1”, the sequence 0 data is read from the internal ROM 9a, the sequence 2 data is read from the internal ROM 9a. The sequence data Sd is loaded into the internal RAM 9b as sequence list data SLd and used in the next GOP processing.

図３Ｂに上記シーケンスリストデータＳＬｄの一例が示されている。この図３Ｂに示されるように、シーケンスリストデータＳＬｄは、アドレスとしての番号に対し、夫々、タイムコードデータ、ピクチャタイプデータが登録されたものである。上記シーケンスデータＳｄは、このシーケンスリストデータＳＬｄ中のピクチャタイプデータに相当する。タイムコードデータは、最も小さい値がフレームであるから、単純にこのフレームの桁のインクリメントを行うことにより、連続的にタイムコードデータを得ることができることはいうまでもない。 FIG. 3B shows an example of the sequence list data SLd. As shown in FIG. 3B, the sequence list data SLd is obtained by registering time code data and picture type data for numbers as addresses. The sequence data Sd corresponds to the picture type data in the sequence list data SLd. Since the smallest value of the time code data is a frame, it goes without saying that the time code data can be obtained continuously by simply incrementing the digit of this frame.

つまり、カット判定回路９は、タイムコードデータを例えば２ＧＯＰ分生成し、生成した２ＧＯＰ分のタイムコードデータを、シーケンスリストデータＳＬｄに登録する。そして、最初は、通常のシーケンスデータＳｄを内部ＲＯＭ９ａから読み出し、上記タイムコードデータに対応させて、シーケンスリストデータＳＬｄに登録する。そして、以降は、カット判定回路９は、ＥＤＬデータＥＤＬｄに登録されている頭出し点のタイムコードデータと同じ値のタイムコードデータが、次のＧＯＰに対応するタイムコードデータ中に含まれているか否かを検出する。含まれていない場合には、カット判定回路９は、次のＧＯＰに対応するタイムコードデータに対応させて、内部ＲＯＭ９ａから読み出した通常のシーケンスデータＳｄを登録する。含まれている場合には、カット判定回路９が、処理中のＧＯＰの最後の画像から、次のＧＯＰの頭出し点の画像までの枚数を、シーケンスリストデータＳＬｄ上で数え、その値に応じて、上述したように、シーケンス０〜８までのデータを選択し、選択したシーケンスデータＳｄを、内部ＲＯＭ９ａから内部ＲＡＭ９ｂにロードし、当該ＧＯＰに対する処理において、当該ロードデータを用いるのである。ここで「ロードデータを用いる」とは、カット判定回路９が、画像タイプ制御部５に対し、デジタルＶＴＲ２からの画像を、当該画像のタイムコードデータについてシーケンスリストデータＳＬｄに登録されているピクチャタイプデータが示すピクチャタイプで処理することを指示することを意味する。 That is, the cut determination circuit 9 generates time code data for 2 GOP, for example, and registers the generated time code data for 2 GOP in the sequence list data SLd. First, normal sequence data Sd is read from the internal ROM 9a and registered in the sequence list data SLd in correspondence with the time code data. Thereafter, the cut determination circuit 9 includes time code data having the same value as the time code data of the cue point registered in the EDL data EDLd in the time code data corresponding to the next GOP. Detect whether or not. If not included, the cut determination circuit 9 registers the normal sequence data Sd read from the internal ROM 9a in association with the time code data corresponding to the next GOP. If it is included, the cut determination circuit 9 counts the number of images from the last image of the GOP being processed to the image at the cue point of the next GOP on the sequence list data SLd, and according to the value As described above, the data from the sequence 0 to 8 is selected, the selected sequence data Sd is loaded from the internal ROM 9a to the internal RAM 9b, and the load data is used in the processing for the GOP. Here, “use load data” means that the cut determination circuit 9 sends an image from the digital VTR 2 to the image type control unit 5 in the picture type registered in the sequence list data SLd for the time code data of the image. This means that processing is instructed with the picture type indicated by the data.

ここで、再び図２について説明する。カット判定回路９においては、頭出し点読み取り部７に入力される情報と頭出し点修正装置８により追加変更された情報に基づき、どのタイムコードフレームが頭出し点であるか、すなわち、どこにカットを置けばよいかが判明している。従って、カットの近傍では、上述したカットシーケンスのどれを選べばよいかは簡単に求めることが出来る。 Here, FIG. 2 will be described again. In the cut determination circuit 9, based on the information input to the cue point reading unit 7 and the information added and changed by the cue point correction device 8, which time code frame is the cue point, that is, where to cut It has been determined whether to put. Therefore, in the vicinity of the cut, it can be easily determined which of the above-described cut sequences should be selected.

カット判定回路９は、画像タイプ制御部５に対し、通常のカットのない部分では、通常シーケンスの順番に、エンコードシーケンス指示データを単純に繰り返し与えておき、カットの近傍では、カットシーケンス０から８のどれかのうち適合するパターンデータを与える。 The cut determination circuit 9 simply and repeatedly gives the encoding sequence instruction data to the image type control unit 5 in the order of the normal sequence when there is no normal cut, and in the vicinity of the cut, the cut sequences 0 to 8 are provided. The pattern data that matches one of the above is given.

さて、画像遅延装置３の遅延量について説明する。画像タイプ制御部５では、特別な指示をしない限り、ＩまたはＰピクチャが指示されたら必ずＢピクチャが２枚続き、また、Ｂピクチャが２枚続いたらその次は必ずＩまたはＰピクチャを置く、と言うルール（すなわち、ＩまたはＰピクチャの周期Ｍが３ということ）でしかエンコードがされないものとする。但し、あらかじめシーケンスを指定すればそのとおりにエンコードするものとする。 Now, the delay amount of the image delay device 3 will be described. Unless otherwise specified, the image type control unit 5 always places two B pictures when an I or P picture is instructed, and always places an I or P picture after two B pictures. (Ie, the period M of the I or P picture is 3). However, if a sequence is specified in advance, it is encoded as it is.

つまり、この基本的な通常シーケンス以外のエンコードをさせたいときには、あらかじめ、通常とは違うことは分かっていればよいわけであるので、画像タイプ制御部５は、遅延のないタイムコードデータとエンコード指示データを必要とする。さて、必要な遅延量であるが、ここで、図４に戻って「波線」部分に注目してみる。この波線は、通常シーケンスの単純ルール、すなわち、「ＩまたはＰピクチャが指示されたら必ずＢピクチャが２枚続き、また、Ｂピクチャが２枚続いたらその次は必ずＩまたはＰピクチャを置く」というルールが適用されない部分を示している。 That is, when it is desired to perform encoding other than this basic normal sequence, it is only necessary to know in advance that it is different from the normal sequence. Therefore, the image type control unit 5 determines the time code data and the encoding instruction without delay. I need data. Now, although it is a necessary delay amount, let us return to FIG. 4 and pay attention to the “dashed line” portion. This wavy line is a simple rule of a normal sequence, that is, “If an I or P picture is instructed, two B pictures will always follow, and if two B pictures continue, an I or P picture will always be placed next”. The part where a rule is not applied is shown.

この図４から明らかなように、このカットを置くことによる影響は、ＩまたはＰピクチャの周期Ｍが３の場合は、最大でも３フレームであることが分かるので、画像遅延装置３に対する遅延量の設定値は３フレームとすればよいことが分かる。また、この設定遅延量情報は、同時に画像タイプ制御部５にも入力されている。従って、結果的に、画像タイプ制御部５は、遅延量が既知の遅延映像信号と、遅延のないタイムコードデータとエンコードシーケンス指示データから、通常とは異なるシーケンス（Ｍ＝３ではないケース）でエンコードしなくてはならないときには、具体的にどの画面から特別なエンコードをすればよいのかを、その１つ前のフレームで知ることが出来ることになる。 As can be seen from FIG. 4, it can be seen that the effect of placing this cut is 3 frames at the maximum when the period M of the I or P picture is 3, so that the amount of delay for the image delay device 3 is reduced. It can be seen that the set value may be 3 frames. The set delay amount information is also input to the image type control unit 5 at the same time. Therefore, as a result, the image type control unit 5 uses a delay video signal with a known delay amount, time code data without delay, and encode sequence instruction data in a sequence different from normal (in the case where M = 3 is not satisfied). When encoding must be performed, it is possible to know from which frame the specific encoding should be performed from the previous frame.

〔図２に示したオーサリングシステムにおける主要な動作説明〕
次に、図２に示したオーサリングシステムにおける主要な動作について、図５〜図７を参照して説明する。 [Description of main operations in the authoring system shown in FIG. 2]
Next, main operations in the authoring system shown in FIG. 2 will be described with reference to FIGS.

ステップＳ１では、図２に示した頭出し点読み取り部７が、フレキシブル・ディスク６に記録されているＥＤＬデータＥＤＬｄを読み込む。頭出し点読み取り部７の読み込んだＥＤＬデータＥＤＬｄは、カット判定回路９に供給され、このカット判定回路９のＲＡＭに保持される。
ステップＳ２では、図２に示したカット判定回路９が、ＲＡＭに保持しているＥＤＬデータＥＤＬｄ中から、最初のカット番号データについて登録されているタイムコードデータを読み取り、以降のＮフレーム分のシーケンスリストデータＳＬｄを生成する。例えば２ＧＯＰ分のタイムコードデータを生成して内部ＲＡＭ９ｂに保持すると共に、内部ＲＯＭ９ａから通常のシーケンスデータＳｄ、即ち、ピクチャタイプデータを読み出し、当該ピクチャタイプデータを、内部ＲＡＭ９ｂに保持されている先頭のＧＯＰ用のタイムコードデータに対応させて登録する。尚、次のＧＯＰ用としてシーケンスリストデータＳＬｄに登録されているタイムコードデータに対するピクチャタイプデータの登録は、先頭のＧＯＰの最後の画像について処理を行った後、上述したように、頭出し点の有る無しに応じてシーケンスデータＳｄを選択してから行われる。 In step S1, the cue point reading unit 7 shown in FIG. 2 reads the EDL data EDLd recorded on the flexible disk 6. The EDL data EDLd read by the cue point reading unit 7 is supplied to the cut determination circuit 9 and held in the RAM of the cut determination circuit 9.
In step S2, the cut determination circuit 9 shown in FIG. 2 reads the time code data registered for the first cut number data from the EDL data EDLd held in the RAM, and the sequence for the subsequent N frames. List data SLd is generated. For example, time code data for 2 GOPs is generated and held in the internal RAM 9b, and normal sequence data Sd, that is, picture type data is read from the internal ROM 9a, and the picture type data is read from the head stored in the internal RAM 9b. Registration is made corresponding to the time code data for GOP. It should be noted that the registration of picture type data for the time code data registered in the sequence list data SLd for the next GOP is performed after processing the last image of the first GOP, as described above. This is performed after the sequence data Sd is selected according to the presence or absence.

ステップＳ３では、図２に示したカット判定回路９がエンコードを開始する。
ステップＳ４では、図２に示したカット判定回路９が、デジタルＶＴＲ２からのタイムコードデータを読み取る。
ステップＳ５では、図２に示したカット判定回路９が、読み取ったタイムコードデータの値と、内部ＲＡＭ９ｂに保持されているシーケンスリストデータＳＬｄに登録されているタイムコードデータの値を比較する。
ステップＳ６では、図２に示したカット判定回路９が、読み取ったタイムコードデータの値が、現在のＧＯＰの最後に相当するタイムコードデータの値に等しいか否かを判断し、「ＹＥＳ」であればステップＳ７に移行し、「ＮＯ」であればステップＳ９に移行する。 In step S3, the cut determination circuit 9 shown in FIG. 2 starts encoding.
In step S4, the cut determination circuit 9 shown in FIG. 2 reads the time code data from the digital VTR2.
In step S5, the cut determination circuit 9 shown in FIG. 2 compares the read time code data value with the time code data value registered in the sequence list data SLd held in the internal RAM 9b.
In step S6, the cut determination circuit 9 shown in FIG. 2 determines whether or not the value of the read time code data is equal to the value of the time code data corresponding to the end of the current GOP, and “YES”. If there is, the process proceeds to step S7, and if “NO”, the process proceeds to step S9.

ステップＳ７では、図２に示したカット判定回路９が、内部ＲＡＭ９ｂに保持されているシーケンスリストデータＳＬｄ中から、処理中のＧＯＰの最後のタイムコードデータ及びピクチャタイプデータを読み、これをエンコードシーケンス指示データとして、図２に示した画像タイプ制御部５に供給する。画像タイプ制御部５は、エンコードシーケンス指示データとして供給されるタイムコードデータと、ピクチャタイプデータにより、上記タイムコードデータと同じ値のタイムコードデータの示す時間的位置のフレームの映像信号を、上記ピクチャタイプデータが示すピクチャタイプでエンコード処理する。
ステップＳ８では、図２に示したカット判定回路９が、読み取ったタイムコードデータの値から例えば９フレームまで順次値をインクリメントすると共に、各値が、内部ＲＡＭ９ｂに保持されているＥＤＬに登録されているタイムコードデータの値と一致するか否かを検出する。このとき、カット判定回路９は、一致を検出した場合には、次のＧＯＰに頭出し点が含まれることを示すデータと、読み取ったタイムコードから何フレーム目が一致したのかを示すデータを、一致しない場合にはその旨を示すデータを、内部ＲＡＭ９ｂに記憶する。
ステップＳ９では、図２に示したカット判定回路９が、内部ＲＡＭ９ｂに保持されているシーケンスリストデータＳＬｄから、現在のタイムコードデータと、ピクチャタイプデータを読み、これらのデータを、エンコードシーケンス指示データとして、図２に示した画像タイプ制御部５に供給する。ここで「現在のタイムコードデータ」とは、シーケンスリストデータＳＬｄ上における現在の処理対象画像のタイムコードデータである。 In step S7, the cut determination circuit 9 shown in FIG. 2 reads the last time code data and picture type data of the GOP being processed from the sequence list data SLd held in the internal RAM 9b, and uses this as an encoding sequence. The instruction data is supplied to the image type control unit 5 shown in FIG. The image type control unit 5 uses the time code data supplied as the encoding sequence instruction data and the picture type data to convert the picture signal of the frame at the temporal position indicated by the time code data having the same value as the time code data to the picture Encode with the picture type indicated by the type data.
In step S8, the cut determination circuit 9 shown in FIG. 2 sequentially increments the value from the read time code data value to, for example, 9 frames, and each value is registered in the EDL held in the internal RAM 9b. It is detected whether or not it matches the value of the time code data. At this time, when the cut determination circuit 9 detects a match, the cut determination circuit 9 includes data indicating that the next GOP includes a cue point and data indicating how many frames match from the read time code. If they do not match, data indicating that is stored in the internal RAM 9b.
In step S9, the cut determination circuit 9 shown in FIG. 2 reads the current time code data and picture type data from the sequence list data SLd held in the internal RAM 9b, and uses these data as encoded sequence instruction data. Is supplied to the image type controller 5 shown in FIG. Here, the “current time code data” is time code data of the current processing target image on the sequence list data SLd.

ステップＳ１０では、図２に示したカット判定回路９が、エンコードが終了か否かを判断し、「ＹＥＳ」であれば終了し、「ＮＯ」であればステップＳ１１に移行する。ここで、エンコードが終了か否かの判断は、ＥＤＬに登録されている頭出し点のタイムコードデータについて全て処理を行ったか否かによって行われる。
ステップＳ１１では、図２に示したカット判定回路９が、内部ＲＡＭ９ｂに保持しているデータ、即ち、次のＧＯＰに頭出し点が含まれることを示すデータ、若しくは一致しないことを示すデータに基づき、次のＧＯＰに頭出し点が含まれるか否かを判断し、「ＹＥＳ」であればステップＳ１２に移行し、「ＮＯ」であればステップＳ１３に移行する。 In step S10, the cut determination circuit 9 shown in FIG. 2 determines whether or not the encoding is completed. If “YES”, the process ends. If “NO”, the process proceeds to step S11. Here, whether or not encoding is completed is determined by whether or not all the time code data at the cue point registered in the EDL has been processed.
In step S11, the cut determination circuit 9 shown in FIG. 2 is based on data held in the internal RAM 9b, that is, data indicating that the next GOP includes a cue point, or data indicating that they do not match. Then, it is determined whether or not the next GOP includes a cue point. If “YES”, the process proceeds to step S12, and if “NO”, the process proceeds to step S13.

ステップＳ１２では、図２に示したカット判定回路９が、内部ＲＡＭ９ｂに保持している次のＧＯＰに含まれる頭出し点までのフレーム数データ、即ち、読み取ったタイムコードから何フレーム目が一致したのかを示すデータにより、現時点から数えて何枚目が頭出し点かを検出する。
ステップＳ１３では、図２に示したカット判定回路９が、次のＧＯＰのために、内部ＲＯＭ９ａから通常のシーケンスデータＳｄを読み出し、当該シーケンスデータＳｄを、内部ＲＡＭ９ｂのシーケンスリストデータＳＬｄに登録する。 In step S12, the cut determination circuit 9 shown in FIG. 2 matches the frame number data up to the cue point included in the next GOP held in the internal RAM 9b, that is, the frame number from the read time code. It is detected from the current data how many sheets are counted from the current point.
In step S13, the cut determination circuit 9 shown in FIG. 2 reads normal sequence data Sd from the internal ROM 9a for the next GOP, and registers the sequence data Sd in the sequence list data SLd of the internal RAM 9b.

ステップＳ１５では、カット判定回路９が、“１”枚目か否かを判断し、「ＹＥＳ」であればステップＳ１６に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス０のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス０のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。
ステップＳ１８では、カット判定回路９が、“２”枚目か否かを判断し、「ＹＥＳ」であればステップＳ１９に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス１のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス１のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。 In step S15, the cut determination circuit 9 determines whether or not the sheet is “1”. If “YES”, the process proceeds to step S16 and the sequence data Sd of the cut sequence 0 stored in the internal ROM 9a. The process proceeds to step S17, where the sequence data Sd of the cut sequence 0 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.
In step S18, the cut determination circuit 9 determines whether or not it is the “second” sheet. If “YES”, the process proceeds to step S19, and the sequence data Sd of the cut sequence 1 stored in the internal ROM 9a. The process proceeds to step S17, where the sequence data Sd of the cut sequence 1 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.

ステップＳ２０では、カット判定回路９が、“３”枚目か否かを判断し、「ＹＥＳ」であればステップＳ２１に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス２のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス２のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。
ステップＳ２２では、カット判定回路９が、“４”枚目か否かを判断し、「ＹＥＳ」であればステップＳ２３に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス３のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス３のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。 In step S20, the cut determination circuit 9 determines whether or not it is the “third” sheet. If “YES”, the process proceeds to step S21 and the sequence data Sd of the cut sequence 2 stored in the internal ROM 9a. The process proceeds to step S17, the sequence data Sd of the cut sequence 2 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.
In step S22, the cut determination circuit 9 determines whether or not it is the “fourth” sheet. If “YES”, the process proceeds to step S23, and the sequence data Sd of the cut sequence 3 stored in the internal ROM 9a. Is selected, the process proceeds to step S17, the sequence data Sd of the cut sequence 3 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.

ステップＳ２４では、カット判定回路９が、“５”枚目か否かを判断し、「ＹＥＳ」であればステップＳ２５に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス４のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス４のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。
ステップＳ２６では、カット判定回路９が、“６”枚目か否かを判断し、「ＹＥＳ」であればステップＳ２７に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス５のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス５のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。 In step S24, the cut determination circuit 9 determines whether or not it is the “5th” sheet. If “YES”, the process proceeds to step S25, and the sequence data Sd of the cut sequence 4 stored in the internal ROM 9a. Then, the process proceeds to step S17, the sequence data Sd of the cut sequence 4 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.
In step S26, the cut determination circuit 9 determines whether or not it is the “6th” sheet. If “YES”, the process proceeds to step S27, and the sequence data Sd of the cut sequence 5 stored in the internal ROM 9a. The process proceeds to step S17, where the sequence data Sd of the cut sequence 5 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.

ステップＳ２８では、カット判定回路９が、“７”枚目か否かを判断し、「ＹＥＳ」であればステップＳ２９に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス６のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス６のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。
ステップＳ３０では、カット判定回路９が、“８”枚目か否かを判断し、「ＹＥＳ」であればステップＳ３１に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス７のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス７のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。
ステップＳ３２では、カット判定回路９が、“９”枚目か否かを判断し、「ＹＥＳ」であればステップＳ３３に移行して、内部ＲＯＭ９ａに記憶されている、カットシーケンス８のシーケンスデータＳｄを選択し、ステップＳ１７に移行して、上記カットシーケンス７のシーケンスデータＳｄを読み出し、当該データを、内部ＲＡＭ９ｂにロードし、再びステップＳ４に移行する。 In step S28, the cut determination circuit 9 determines whether or not it is the “7th” sheet. If “YES”, the process proceeds to step S29 and the sequence data Sd of the cut sequence 6 stored in the internal ROM 9a. The process proceeds to step S17, the sequence data Sd of the cut sequence 6 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.
In step S30, the cut determination circuit 9 determines whether or not it is the “8th” sheet. If “YES”, the process proceeds to step S31 and the sequence data Sd of the cut sequence 7 stored in the internal ROM 9a. The process proceeds to step S17, the sequence data Sd of the cut sequence 7 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.
In step S32, the cut determination circuit 9 determines whether or not the sheet is “9”. If “YES”, the process proceeds to step S33 to store the sequence data Sd of the cut sequence 8 stored in the internal ROM 9a. The process proceeds to step S17, the sequence data Sd of the cut sequence 7 is read, the data is loaded into the internal RAM 9b, and the process proceeds to step S4 again.

〔変形例及び実施の形態から導き出される効果〕
本方式は、ＭＰＥＧビデオ（もしくは、それに類似するビデオ圧縮）を記録するものの全てに適応可能である。（ＭＭＣＤであろうがＳＤであろうが、将来あるであろうＭＰＥＧビデオ記録再生機の全て、記録媒体には無関係である。）
あらかじめ指定したタイムコードフレームにおける特定の頭出し点（それがどの理由で頭出し点として指定されたかは問わない）は、エンコードされた画像データの並び方の性格上、確実に一意的に、頭出し点としてアクセスし再生可能となる。
そして、一意に決定できるということは、ＣＤにおけるＴＯＣのように、頭出し点をその画像のタイムコードと記録された媒体上の絶対番地のペアとして記録しても、そのＴＯＣにおける情報と実際に頭出し再生される画像とは矛盾がなくなることが保証される。
従って、伝統的なトラック機構によるアクセスのみならず、ＰＬＡＹＢＡＣＫＣＯＮＴＲＯＬなどのようにより複雑なランダムアクセスの機構においても、頭出し点をその画像のタイムコードと記録された媒体上の絶対番地のペアとして取り扱うというルールのもとにおいても、全く同様に適用できる。将来、ゲーム用の動画など、インタラクティブに頭出ししなければならない場合にも応用可能である。
また、図２においては、最適なエンコードが行えるまでＮ回エンコードを行う、いわゆる２パス以上のシステムを示しているが、いわゆる１パスのシステムでも同様である。１パスのシステムの場合、量子化レベル制御回路１５、量子化レベル修正装置１６、出力レート決定装置１８、ストレージ装置２２、モニタ用デコーダ２１が不用となる。１パスのシステムの場合、量子化レベルは固定だからである。そして、エントロピー符号化器１７の出力は、ストリーム出力装置１９に入力される。 [Effects Derived from Modifications and Embodiments]
This method is applicable to all recording MPEG video (or similar video compression). (Every MPEG video recording / playback device, whether MMCD or SD, will be irrelevant to the recording medium.)
The specific cue point in the time code frame specified in advance (regardless of the reason for specifying it as the cue point) is surely unique in terms of the way the encoded image data is arranged. It can be accessed and played as a point.
The fact that it can be uniquely determined means that even if the cue point is recorded as a pair of the time code of the image and the absolute address on the recorded medium, like the TOC in the CD, the information in the TOC is actually It is guaranteed that there will be no contradiction with the image to be played back.
Therefore, not only the access by the traditional track mechanism but also a more complicated random access mechanism such as PLAYBACKCONTROL, the heading point is handled as a pair of the time code of the image and the absolute address on the recorded medium. The same applies to rules. In the future, it can also be applied to cases where it is necessary to cue interactively, such as video for games.
FIG. 2 shows a so-called two-pass or more system in which encoding is performed N times until optimum encoding can be performed, but the same applies to a so-called one-pass system. In the case of a one-pass system, the quantization level control circuit 15, the quantization level correction device 16, the output rate determination device 18, the storage device 22, and the monitor decoder 21 are unnecessary. This is because the quantization level is fixed in the case of a one-pass system. The output of the entropy encoder 17 is input to the stream output device 19.

実施の形態の説明に供する圧縮エンコードの説明図である。〔図１Ａ〕通常シーケンスを示す説明図である。〔図１Ｂ〕カットシーケンスを示す説明図である。〔図１Ｃ〕エンコーダ出力を示す説明図である。〔図１Ｄ〕デコーダ入力を示す説明図である。〔図１Ｅ〕再生された画像を示す説明図である。It is explanatory drawing of the compression encoding used for description of embodiment. FIG. 1A is an explanatory diagram showing a normal sequence. FIG. 1B is an explanatory diagram showing a cut sequence. [FIG. 1C] It is explanatory drawing which shows an encoder output. [FIG. 1D] It is explanatory drawing which shows a decoder input. FIG. 1E is an explanatory diagram showing a reproduced image. 実施の形態を示すオーサリングシステムの構成図である。It is a block diagram of the authoring system which shows embodiment. 図２に示したエディットリストデータ及びシーケンスリストデータの一例を示す説明図である。〔図３Ａ〕エディットリストデータの一例を示す説明図である。〔図３Ｂ〕シーケンスリストデータの一例を示す説明図である。It is explanatory drawing which shows an example of the edit list data and sequence list data shown in FIG. FIG. 3A is an explanatory diagram showing an example of edit list data. FIG. 3B is an explanatory diagram showing an example of sequence list data. カットシーケンスの一例を示す説明図である。〔図４Ａ〕通常シーケンスの一例を示す説明図である。〔図４Ｂ〕カットシーケンス０の一例を示す説明図である。〔図４Ｃ〕カットシーケンス１の一例を示す説明図である。〔図４Ｄ〕カットシーケンス２の一例を示す説明図である。〔図４Ｅ〕カットシーケンス３の一例を示す説明図である。〔図４Ｆ〕カットシーケンス４の一例を示す説明図である。〔図４Ｇ〕カットシーケンス５の一例を示す説明図である。〔図４Ｈ〕カットシーケンス６の一例を示す説明図である。〔図４Ｉ〕カットシーケンス７の一例を示す説明図である。〔図４Ｊ〕カットシーケンス８の一例を示す説明図である。It is explanatory drawing which shows an example of a cut sequence. FIG. 4A is an explanatory diagram showing an example of a normal sequence. FIG. 4B is an explanatory diagram showing an example of a cut sequence 0. FIG. 4C is an explanatory diagram showing an example of a cut sequence 1. FIG. 4D is an explanatory diagram showing an example of a cut sequence 2. FIG. 4E is an explanatory diagram showing an example of a cut sequence 3. FIG. 4F is an explanatory diagram showing an example of the cut sequence 4. FIG. 4G is an explanatory diagram showing an example of the cut sequence 5. FIG. 4H is an explanatory diagram showing an example of the cut sequence 6. FIG. 4I is an explanatory diagram showing an example of the cut sequence 7. FIG. 4J is an explanatory diagram showing an example of the cut sequence 8. 図２に示したオーサリングシステムの主要な動作を説明するためのフローチャートである。3 is a flowchart for explaining main operations of the authoring system shown in FIG. 2. 図２に示したオーサリングシステムの主要な動作を説明するためのフローチャートである。3 is a flowchart for explaining main operations of the authoring system shown in FIG. 2. 図２に示したオーサリングシステムの主要な動作を説明するためのフローチャートである。3 is a flowchart for explaining main operations of the authoring system shown in FIG. 2. 従来の技術の説明に供するＧＯＰが完全に独立していないことを説明するための説明図である。It is explanatory drawing for demonstrating that GOP with which description of a prior art is provided is not completely independent. 従来の技術の説明に供する頭出し再生における問題点を説明するための説明図である。It is explanatory drawing for demonstrating the problem in the cue reproduction | regeneration used for description of the prior art. ＣＬＯＳＥＤＧＯＰを適用する方法を説明するための説明図である。It is explanatory drawing for demonstrating the method of applying CLOSED GOP.

Explanation of symbols

１マスターテープ（ビデオ・テープ・カセット）
２デジタルＶＴＲ
３画像遅延装置
４操作部
５画像タイプ制御部
６フレキシブル・ディスク
７頭出し点読み取り部
８頭出し点修正装置
９カット判定回路
９ａＲＯＭ
Ｓｄシーケンスデータ
９ｂＲＡＭ
ＥＤＬｄＥＤＬデータ
ＳＬｄシーケンスリストデータ
１０ＤＣＴ
１１量子化器
１２逆量子化器
１３逆ＤＣＴ
１５量子化レベル制御回路
１６量子化レベル修正装置
１７エントロピー符号化器
１８出力レート決定装置
１９ストリーム出力装置
２０出力端子
２１モニタ用デコーダ
２２ストレージ装置
２３エンコーダ制御部 1 Master tape (video tape cassette)
2 Digital VTR
DESCRIPTION OF SYMBOLS 3 Image delay apparatus 4 Operation part 5 Image type control part 6 Flexible disk 7 Cue point reading part 8 Cue point correction apparatus 9 Cut determination circuit 9a ROM
Sd Sequence data 9b RAM
EDLd EDL data SLd Sequence list data 10 DCT
11 Quantizer 12 Inverse Quantizer 13 Inverse DCT
DESCRIPTION OF SYMBOLS 15 Quantization level control circuit 16 Quantization level correction apparatus 17 Entropy encoder 18 Output rate determination apparatus 19 Stream output apparatus 20 Output terminal 21 Monitor decoder 22 Storage apparatus 23 Encoder control part

Claims

In an authoring system that generates encoded data by encoding a video signal,
When a cue point at the time of reproducing the video signal is designated, a delimiter determination that determines a delimiter of the new GOP so that the picture corresponding to the cue point becomes the head of the new GOP in the display order Means,
The picture type of the picture in the new GOP is determined so that reference prediction accompanying encoding is not required between the new GOPs determined by the delimiter determining means,
GOP composing means constituting the new GOP;
Generated bit amount calculating means for calculating a generated bit amount accompanying encoding by encoding the video signal in which the new GOP is configured by the GOP configuring unit ;
Quantization level control means for adjusting a quantization level based on the generated bit quantity calculated by the generated bit quantity calculation means;
Encoding means for generating the encoded data by encoding the video signal in which the new GOP is configured according to the quantization level adjusted by the quantization level control means ;
Authoring system with.

2. The authoring system according to claim 1, wherein the quantization level control unit adjusts a quantization level of a designated cut in the video signal in which the new GOP is configured .

Encoding control means for controlling the encoding means to encode only the cut
The authoring system according to claim 2 , further comprising:

The GOP composing means composes the cut sequence so that the picture corresponding to the cue point is an I picture, and the last picture in the display order in the GOP immediately before the delimiter is a P picture. The authoring system according to claim 1, comprising a new GOP .

2. The authoring system according to claim 1, wherein the delimiter determining unit determines a break between reproduction units in a logical format of a recording medium or reproduction units in a physical format of a recording medium as a delimiter of the GOP.

The authoring system according to claim 1, further comprising recording means for recording the encoded data generated by the encoding means on a recording medium.

The authoring system according to claim 1, further comprising output means for outputting the encoded data generated by the encoding means.

In an authoring method for generating encoded data by encoding a video signal,
When a cue point at the time of reproducing the video signal is designated, a delimiter determination that determines a delimiter of the new GOP so that the picture corresponding to the cue point becomes the head of the new GOP in the display order Process,
To not require reference prediction associated with coding between the new GOP determined by the delimiter determining step determines a picture type of picture in the new in GOP,
A GOP configuration process for configuring the new GOP;
A generated bit amount calculating step of calculating a generated bit amount accompanying encoding by encoding the video signal in which the new GOP is configured by the GOP configuring step ;
A quantization level control step of adjusting a quantization level based on the generated bit amount calculated by the generated bit amount calculation step;
An encoding step of generating the encoded data by encoding the video signal in which the new GOP is configured according to the quantization level adjusted by the quantization level control step ;
Authoring method including.

In an authoring system that generates encoded data by encoding a video signal,
When a cut is designated at a predetermined position of the video signal, delimiter determining means for determining a delimiter of the new GOP so that a picture corresponding to the cut becomes the head of the new GOP in the display order;
The picture type of the picture in the new GOP is determined so that reference prediction accompanying encoding is not required between the new GOPs determined by the delimiter determining means,
GOP composing means constituting the new GOP;
Generated bit amount calculating means for calculating a generated bit amount accompanying encoding by encoding the video signal in which the new GOP is configured by the GOP configuring unit ;
Quantization level control means for adjusting a quantization level based on the generated bit quantity calculated by the generated bit quantity calculation means;
Encoding means for generating the encoded data by encoding the video signal in which the new GOP is configured according to the quantization level adjusted by the quantization level control means ;
Authoring system with.

The authoring system according to claim 9, wherein the quantization level control means adjusts a quantization level of a designated cut in the video signal in which the new GOP is configured .

Encoding control means for controlling the encoding means to encode only the cut
The authoring system according to claim 10 , further comprising:

The GOP composing means composes the cut sequence so that the picture corresponding to the cut becomes an I picture, and the new picture so that the last picture viewed in the display order in the GOP immediately before the delimiter becomes a P picture. The authoring system according to claim 9 constituting a GOP .

The authoring system according to claim 9, wherein the delimiter determining means determines a break between reproduction units in a logical format of a recording medium or reproduction units in a physical format of a recording medium as a delimiter of the GOP.

The authoring system according to claim 9, further comprising recording means for recording the encoded data generated by the encoding means on a recording medium.

The authoring system according to claim 9, further comprising output means for outputting the encoded data generated by the encoding means.

In an authoring method for generating encoded data by encoding a video signal,
When a cut is designated at a predetermined position of the video signal, a delimiter determining step for determining a delimiter of the new GOP so that a picture corresponding to the cut becomes a head of the new GOP in the display order;
To not require reference prediction associated with coding between the new GOP determined by the delimiter determining step determines a picture type of picture in the new in GOP,
A GOP configuration process for configuring the new GOP;
A generated bit amount calculating step of calculating a generated bit amount accompanying encoding by encoding the video signal in which the new GOP is configured by the GOP configuring step ;
A quantization level control step of adjusting a quantization level based on the generated bit amount calculated by the generated bit amount calculation step;
An encoding step of generating the encoded data by encoding the video signal in which the new GOP is configured according to the quantization level adjusted by the quantization level control step ;
Authoring method including.