JP2005064638A

JP2005064638A - Method for compressing video signal

Info

Publication number: JP2005064638A
Application number: JP2003289539A
Authority: JP
Inventors: Toshio Motegi; 敏雄茂出木
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2003-08-08
Filing date: 2003-08-08
Publication date: 2005-03-10

Abstract

<P>PROBLEM TO BE SOLVED: To provide a method for compressing a video signal capable of further applying lossless compression to the video signal subjected to lossy compression . <P>SOLUTION: Each block of compression frames configuring video signals subjected to lossy (irreversible) compression is used for a coding object block, and a recording form of the coding object block is revised in response to the number of coincident bytes of a reference block with the highest correlation by collating the coding object block with three reference blocks, an immediately preceding block, an immediately preceding line block, and an immediately preceding frame block. For example, when the coding object block is 9F (14, 2), the immediately preceding block is 9F (14, 2), the immediately preceding line block is 9F (15, 1), and the immediately preceding frame block becomes a block at the same position of the immediately preceding frame. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、業務用または民生用ビデオカメラ映像の保存と伝送、コンピュータにビデオ映像をキャプチャして映像制作を行うノンリニア映像制作におけるマスター素材の保存と伝送、電子カルテや遠隔医療における医用動画像の保存と伝送において好適なデータの圧縮技術に関する。 The present invention relates to storage and transmission of commercial or consumer video camera images, storage and transmission of master material in non-linear video production in which video images are captured and produced by a computer, medical video images in electronic medical records and telemedicine The present invention relates to a data compression technique suitable for storage and transmission.

動画像である映像は、そのデータ量が多いため、非圧縮で扱うと、例えばＮＴＳＣ方式でも１秒あたり３０Ｍ（メガ）バイト、ハイビジョン方式では１５０Ｍバイトになる。すなわち、ＤＶＤ１枚あたり、ＮＴＳＣは２．６分、ハイビジョンは３０秒しか収納できない。そのため、従来より、様々な手法で映像信号の圧縮が試みられている。 A video that is a moving image has a large amount of data. Therefore, if it is handled uncompressed, for example, the NTSC system has 30 M (mega) bytes per second, and the high vision system has 150 Mbytes. In other words, NTSC can store only 2.6 minutes and HDTV can store only 30 seconds per DVD. Therefore, conventionally, compression of video signals has been attempted by various methods.

最も好ましいのは、ロスレス（可逆）型の圧縮方式であり、復号することにより、完全に元の状態に戻すことができるものである（例えば、特許文献１参照）。しかし、ロスレス型では、データの圧縮率は１／２程度が限界である。そのため、映像信号の圧縮には、復号したときには、完全に元の状態に戻すことはできないが、圧縮率を１／５〜１／１０とすることができるロッシー（非可逆）型が用いられている。 Most preferred is a lossless (reversible) compression method, which can be restored to its original state by decoding (see, for example, Patent Document 1). However, the lossless type has a limit of about 1/2 of the data compression rate. Therefore, the compression of the video signal cannot be completely restored to the original state when it is decoded, but a lossy (irreversible) type capable of reducing the compression rate to 1/5 to 1/10 is used. Yes.

特に、現状のＶＴＲに収録されるデジタル映像データとして、業務用デジタルベータカムと、業務用ＤＶカムおよび民生用ＤＶ等のロッシー圧縮方式が用いられている。これらは、いずれもフレームごとに離散コサイン変換による圧縮を行うことにより、データ量の削減を図っている。ここで、業務用ＤＶカムおよび民生用ＤＶの圧縮方式について具体的に説明しておく。 In particular, as digital video data recorded in a current VTR, a lossy compression method such as a business digital beta cam, a business DV cam, and a consumer DV is used. In both of these, the amount of data is reduced by performing compression by discrete cosine transform for each frame. Here, the compression method of the commercial DV cam and the consumer DV will be specifically described.

まず、映像をＴＶカメラ等で撮影してデジタル化した状態の原映像信号を模式的に表現したものを図６（ａ）に示す。図６において、左右方向は時系列方向であり、右側に行く程、時間が進むことになる。図６（ａ）に示した原映像信号は１フレームが光の三原色であるＲ・Ｇ・Ｂの３プレーンで構成されている。なお、ここでは、１秒間に３０フレーム、各画素の各色（プレーン）に８ビットが割り当てられているものとしている。例えば、１０秒間に渡って記録したデジタル映像信号は、３００フレーム、９００プレーンで構成されるフレーム群となる。また、各フレームの画素数は、７２０×４８０としている。 First, FIG. 6A shows a schematic representation of an original video signal in a state where a video is photographed with a TV camera or the like and digitized. In FIG. 6, the left-right direction is a time-series direction, and the time advances as it goes to the right. The original video signal shown in FIG. 6A is composed of three planes of R, G, and B in which one frame is the three primary colors of light. Here, it is assumed that 30 frames per second and 8 bits are assigned to each color (plane) of each pixel. For example, a digital video signal recorded for 10 seconds becomes a frame group composed of 300 frames and 900 planes. The number of pixels in each frame is 720 × 480.

ＤＶフォーマットでは、上述のようなデジタル映像信号を、各フレーム単位で以下のように処理していく。まず、Ｒ、Ｇ、Ｂの各プレーンの画素信号を以下の〔数式１〕を利用して、Ｙ、Ｕ、Ｖに変換する。
〔数式１〕
Ｙ＝０．２９９Ｒ＋０．５８７Ｇ＋０．１１４Ｂ
Ｕ＝−０．１６８４Ｒ−０．３３１６Ｇ＋０．５Ｂ
Ｖ＝０．５Ｒ−０．４１８７Ｇ−０．０８１３Ｂ
上記〔数式１〕は、小数点演算を伴うため、整数値化されたＹ、Ｕ、Ｖから、元のＲ、Ｇ、Ｂの値を復元することはできず、この状態で基本的にロッシー型になる。さらに、Ｕプレーン、Ｖプレーンについては、縦方向、横方向に画素を１／２に間引きする。例えば、７２０×４８０画素の画像の場合、３６０×２４０画素の画像となり、画素数が１／４に減少することになる。このときの映像信号の状態を図６（ｂ）に示す。続いて、８×８画素単位で、Ｙプレーンの１単位に対して、Ｕプレーンの１単位、Ｖプレーンの１単位を割り当てて、計６単位の画素からなるブロックを形成する。１画素が１バイトであるため、このブロックのデータ量は、３８４バイトとなる。続いて、このブロックに対して、ＤＣＴ変換（離散コサイン変換）、量子化、可変長符号化を行うことにより、１ブロックを８０バイトのデータに圧縮する。 In the DV format, the digital video signal as described above is processed in units of frames as follows. First, the pixel signals of the R, G, and B planes are converted into Y, U, and V using the following [Equation 1].
[Formula 1]
Y = 0.299R + 0.587G + 0.114B
U = -0.1684R-0.3316G + 0.5B
V = 0.5R-0.4187G-0.0813B
Since the above [Formula 1] involves a decimal point operation, the original values of R, G, and B cannot be restored from Y, U, and V converted to integer values. In this state, the lossy type is basically used. become. Further, for the U plane and the V plane, the pixels are thinned out in half in the vertical direction and the horizontal direction. For example, in the case of an image of 720 × 480 pixels, an image of 360 × 240 pixels is obtained, and the number of pixels is reduced to ¼. The state of the video signal at this time is shown in FIG. Subsequently, in units of 8 × 8 pixels, one unit of the U plane and one unit of the V plane are allocated to one unit of the Y plane to form a block composed of a total of six units of pixels. Since one pixel is 1 byte, the data amount of this block is 384 bytes. Subsequently, one block is compressed into 80-byte data by performing DCT transform (discrete cosine transform), quantization, and variable length coding on this block.

上記のようにブロックを定義すると、元の１フレームは、４５×３０個のブロックで表現されることになる。ＤＶフォーマットでは、圧縮されたブロックを「９Ｆ」と定義し、１５ブロック毎に１つのヘッダー「７Ｆ」を付加する。さらに、この１つのヘッダー「７Ｆ」とそれに続く１５個のブロックを１セットとし、９セット毎に、「１Ｆ」１つ、「３Ｆ」２つ、「５Ｆ」３つで構成されるヘッダーを付加してラインを構成する。すると、元のフレームは１０ラインで構成されることになる。このラインは、原画像の走査線方向に沿ったものとなっている。この結果、得られる圧縮フレームの構造を図７に示す。以上のようにしてＤＶフォーマットに変換することにより、映像データは、１０３６８００バイトから１２００００バイトに圧縮されることになる。ＤＶ方式では、同一フレームにおけるプレーン間の演算処理は行われるが、フレーム間の演算は行われない。そのため、プレーンの概念はなくなるが、図６に示したようにｎ個のフレームがあった場合は、図７に示したような圧縮フレームがｎ個作成されることになる。また、フレーム番号Ｆ１〜Ｆｎもそのまま引き継がれる。
特開平１１−１６４３０３号公報 When blocks are defined as described above, one original frame is represented by 45 × 30 blocks. In the DV format, a compressed block is defined as “9F”, and one header “7F” is added every 15 blocks. Furthermore, this one header “7F” and the following 15 blocks are set as one set, and a header composed of “1F”, “3F”, and “5F” is added every 9 sets. And configure the line. Then, the original frame is composed of 10 lines. This line is along the scanning line direction of the original image. The resulting compressed frame structure is shown in FIG. By converting to the DV format as described above, the video data is compressed from 1036800 bytes to 120,000 bytes. In the DV system, calculation processing between planes in the same frame is performed, but calculation between frames is not performed. For this reason, the concept of a plane is eliminated, but when there are n frames as shown in FIG. 6, n compressed frames as shown in FIG. 7 are created. The frame numbers F1 to Fn are also taken over as they are.
JP 11-164303 A

しかし、映像信号は、ＤＶ等のロッシー型で圧縮した場合であっても依然としてデータ量が多い。そのため、データを伝送する際などに、媒体や通信回線の制約のため、さらに圧縮を行う必要に迫られる場合がある。この場合、データ量は減らしたいが、一度ロッシー圧縮されている映像信号から、それ以上データを欠落させたくないという要望がある。しかし、このようなロッシー圧縮された映像信号は、上述のように独特の構造をしているため、圧縮されていない映像信号に適応している従来のロスレス圧縮方式を適用することはできない。 However, even if the video signal is compressed by a lossy type such as DV, the amount of data is still large. For this reason, when data is transmitted, it may be necessary to perform further compression due to restrictions on a medium or a communication line. In this case, there is a desire to reduce the amount of data, but not to delete any more data from the video signal once lossy compressed. However, since the lossy compressed video signal has a unique structure as described above, the conventional lossless compression method adapted to the uncompressed video signal cannot be applied.

そこで、本発明は、ロッシー型の圧縮が行われた映像信号を、さらにロスレス圧縮することが可能な映像信号の圧縮方法を提供することを課題とする。 Therefore, an object of the present invention is to provide a video signal compression method capable of further lossless compression of a video signal subjected to lossy compression.

上記課題を解決するため、本発明では、時系列に連続したフレーム各々がロッシー（非可逆）型手法で圧縮された圧縮フレームで構成される映像信号に対して、圧縮フレームを再現できるように情報量を圧縮する方法として、圧縮フレーム内において、画素をブロック化した各画素ブロックを符号化対象ブロックとして、当該各符号化対象ブロックに対して、走査線方向に直前に位置する画素ブロックである直前ブロック、走査線方向と垂直な方向において直前に位置する画素ブロックである直前ラインブロック、直前フレームの同一箇所に位置する画素ブロックである直前フレームブロックの３つの参照ブロックと照合を行い、最も相関性の高い参照ブロックを選出する参照ブロック選出段階、選出された参照ブロックと符号化対象ブロックとの相関性として、完全一致、相関度大、相関度小の３種より判別する相関性判定段階、符号化対象ブロックの出力データとして、相関性が完全一致の場合は、選出された参照ブロックの識別ＩＤを出力し、相関性が相関度大の場合は、選出された参照ブロックの識別ＩＤと、選出された参照ブロックと符号化対象ブロック内の一致データのブロック内における位置を示すマスクデータと、参照ブロックと一致しない符号化対象ブロック内のデータを出力し、相関性が相関度小の場合は符号化対象ブロック内の一致データおよび不一致データを出力するブロック符号化段階を実行するようにしたことを特徴とする。 In order to solve the above problems, in the present invention, information is reproduced so that a compressed frame can be reproduced for a video signal composed of compressed frames in which each successive frame in time series is compressed by a lossy (irreversible) method. As a method of compressing the quantity, each pixel block obtained by blocking pixels in the compressed frame is set as an encoding target block, and immediately before the pixel block positioned immediately before in the scanning line direction with respect to each encoding target block. Compared with the three reference blocks of the block, the immediately preceding line block which is the pixel block located immediately before in the direction perpendicular to the scanning line direction, and the immediately preceding frame block which is the pixel block located at the same position of the immediately preceding frame, the most correlated The reference block selection stage for selecting a reference block with a high value, the selected reference block and the encoding target block As a correlation, a correlation determination stage that discriminates from three types of complete match, high correlation degree, and low correlation degree, and when the correlation is complete match as the output data of the encoding target block, the selected reference block If the correlation is large, the identification ID of the selected reference block, and the mask data indicating the position of the selected reference block and the matching data in the encoding target block in the block And a block encoding step that outputs data in the encoding target block that does not match the reference block, and outputs matching data and non-matching data in the encoding target block when the correlation is low. It is characterized by that.

本発明によれば、圧縮フレーム内の符号化対象ブロックについて、走査線方向、直前のブロック、走査線に直交する方向の直前ラインのブロック、直前フレームの同一のブロックの３つのブロック、すなわち、画像面における２方向および時系列方向に隣接する３つのブロックを参照ブロックとして照合を行って、最も相関の高い参照ブロックを選出し、参照ブロックとの相関度に応じて、異なる形式で符号化を行うようにしたので、圧縮フレームにおけるブロック間の関係を壊すことなく、圧縮が可能となる。すなわち、ロッシー型の圧縮が行われた映像信号を、さらにロスレス圧縮することが可能となる。 According to the present invention, with respect to the encoding target block in the compressed frame, three blocks of the scanning line direction, the previous block, the previous line block in the direction orthogonal to the scanning line, and the same block of the previous frame, that is, the image Three blocks adjacent to each other in the two directions and the time-series direction in the plane are compared as reference blocks, the reference block having the highest correlation is selected, and encoding is performed in different formats according to the degree of correlation with the reference block. Since it did in this way, compression is possible, without destroying the relationship between the blocks in a compression frame. That is, it becomes possible to further losslessly compress the video signal subjected to lossy compression.

以下、本発明の実施形態について図面を参照して詳細に説明する。
（ロッシー圧縮された映像信号の構造）
本実施形態では、ＤＶフォーマットの映像信号に対して処理を行う場合を例にとって説明する。図７のように、元の１フレーム３プレーン（Ｒ、Ｇ、Ｂ）の情報は、ＤＶフォーマットの１つの圧縮フレームに変換される。データ形式は、このようになっているが、元のフレームとの位置関係を考えると、圧縮フレームにおける１ラインは、ブロックのライン３本分に相当する。そのため、ここでは、説明の便宜上、圧縮フレームを図１に示すような、元のブロックの位置に対応した状態に移動したものを用いて説明する。図７と図１に示したものは、ブロックの位置を変えたのみであり、全く同一の構成を示している。
（本発明の圧縮方法）
続いて、本発明に係る映像信号の圧縮方法の概要について説明する。本発明の圧縮方法は、コンピュータ、およびコンピュータに搭載される専用のソフトウェアプログラムにより実行される。図２は、本発明に係る映像信号の圧縮方法の概要を示すフローチャートである。まず、図１に示したようなロッシー圧縮された映像信号を、ロスレス圧縮するための装置（専用のソフトウェアを搭載したコンピュータ）に読み込ませる。すると、ロスレス圧縮用の装置が処理を開始する。まず、ロッシー圧縮映像信号の各フレームのうち、画素ブロックを順次読込んで符号化対象ブロックとし、各符号化対象ブロックについて、直前の画素ブロックである直前ブロックとの照合を行う（ステップＳ１）。具体的には、符号化対象ブロック９Ｆ（ｉ，ｊ）について、直前ブロック９Ｆ（ｉ−１，ｊ）との照合を行う。例えば、図１に示した９Ｆ（１，１）の場合は、直前ブロックが存在しないので、照合が行われないが、９Ｆ（２，１）の場合は、直前ブロック９Ｆ（１，１）との照合が行われる。なお、ＤＶフォーマットのデータとしては、９Ｆ（１，２）の直前ブロックは、９Ｆ（４５，１）であるが、図１に示すように、ブロック９Ｆ（１，２）は元の画像の左端に位置するブロックであり、ブロック９Ｆ（４５，１）は元の画像の右端に位置するブロックであるため、相関が低いと考えられる。そのため、このような場合には、直前ブロックとの照合は行わない。すなわち、ｉ＝１以外の場合に、直前ブロックとの照合が行われることになる。照合は、バイト単位で行う。１ブロックは８０バイトであるので、８０回の照合が行われることになる。このうち、何バイトが一致したかをメモリに一時的に記録しておく。例えば、９Ｆ（１５，２）が、直前ブロック９Ｆ（１４，２）と５０バイト一致したとすると、図３（ａ）に示すような照合データが記録される。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
(Structure of lossy compressed video signal)
In the present embodiment, a case where processing is performed on a DV format video signal will be described as an example. As shown in FIG. 7, the information of the original 1 frame 3 plane (R, G, B) is converted into one compressed frame in the DV format. The data format is as described above, but considering the positional relationship with the original frame, one line in the compressed frame corresponds to three lines of the block. Therefore, here, for convenience of explanation, description will be made using a compressed frame moved to a state corresponding to the position of the original block as shown in FIG. FIG. 7 and FIG. 1 only show the same configuration except that the position of the block is changed.
(Compression method of the present invention)
Next, an outline of a video signal compression method according to the present invention will be described. The compression method of the present invention is executed by a computer and a dedicated software program installed in the computer. FIG. 2 is a flowchart showing an outline of a video signal compression method according to the present invention. First, a lossy-compressed video signal as shown in FIG. 1 is read by a device for lossless compression (computer equipped with dedicated software). Then, the device for lossless compression starts processing. First, among the frames of the lossy compressed video signal, the pixel blocks are sequentially read to be the encoding target blocks, and each encoding target block is compared with the immediately preceding block which is the immediately preceding pixel block (step S1). Specifically, the encoding target block 9F (i, j) is collated with the immediately preceding block 9F (i-1, j). For example, in the case of 9F (1, 1) shown in FIG. 1, since the immediately preceding block does not exist, collation is not performed. However, in the case of 9F (2, 1), the immediately preceding block 9F (1, 1) Are collated. As DV format data, the block immediately before 9F (1,2) is 9F (45,1), but as shown in FIG. 1, the block 9F (1,2) is the left end of the original image. Since the block 9F (45, 1) is a block located at the right end of the original image, it is considered that the correlation is low. Therefore, in such a case, collation with the immediately preceding block is not performed. In other words, when i = 1, collation with the immediately preceding block is performed. Collation is performed in units of bytes. Since one block is 80 bytes, collation is performed 80 times. Of these, how many bytes match is temporarily recorded in the memory. For example, if 9F (15, 2) matches 50 bytes with the immediately preceding block 9F (14, 2), collation data as shown in FIG. 3A is recorded.

次に、走査線方向と垂直な方向において直前に位置する画素ブロックである直前ラインブロックとの照合を行う（ステップＳ２）。具体的には、処理対象ブロック９Ｆ（ｉ，ｊ）について、直前ラインブロック９Ｆ（ｉ，ｊ−１）との照合を行う。図１に示した９Ｆ（１，１）〜９Ｆ（４５，１）の場合は、直前ラインブロックが存在しないので、照合が行われないが、例えば９Ｆ（１５，２）の場合は、直前ラインブロック９Ｆ（１５，１）との照合が行われる。すなわち、ｊ＝１以外の場合に、直前ラインブロックとの照合が行われることになる。ステップＳ２における照合の手法は、ステップＳ１と同様、バイト単位で行い、一致したバイト数を一時的に記録しておく。例えば、９Ｆ（１５，２）が、直前ラインブロック９Ｆ（１５，１）と３８バイト一致したとすると、図３（ｂ）に示すような照合データが記録される。 Next, collation with the immediately preceding line block which is a pixel block located immediately before in the direction perpendicular to the scanning line direction is performed (step S2). Specifically, the processing target block 9F (i, j) is collated with the immediately preceding line block 9F (i, j-1). In the case of 9F (1, 1) to 9F (45, 1) shown in FIG. 1, there is no immediately preceding line block, so no matching is performed, but in the case of 9F (15, 2), for example, the immediately preceding line A check with block 9F (15, 1) is performed. That is, when j is not 1, collation with the previous line block is performed. The matching method in step S2 is performed in units of bytes as in step S1, and the number of matched bytes is temporarily recorded. For example, if 9F (15, 2) matches 38 bytes with the immediately preceding line block 9F (15, 1), collation data as shown in FIG. 3B is recorded.

次に、直前フレームの同一箇所に存在する直前フレームブロックとの照合を行う（ステップＳ３）。ロッシー圧縮映像信号を構成する全てのフレームは、全て同一の構成となっており、図１に示したような構成となっている。そのため、直前フレームには、符号化対象ブロックと同一箇所に位置する画素ブロックが必ず存在する。従って、ステップＳ３においては、ステップＳ１、ステップＳ２とは異なり、例外無く全ての符号化対象ブロックに対して照合が行われることになる。例えばフレームＦｎにおける９Ｆ（１５，２）の場合は、直前のフレームＦｎ-1の９Ｆ（１５，２）との照合が行われる。なお、照合自体は、ステップＳ１、ステップＳ２と同様、バイト単位で行われ、一致したバイト数を一時的に記録しておく。例えば、９Ｆ（１５，２）が、直前のフレームＦｎ-1の９Ｆ（１５，２）と４６バイト一致したとすると、図３（ｃ）に示すような照合データが記録される。 Next, collation with the immediately preceding frame block existing at the same location of the immediately preceding frame is performed (step S3). All the frames constituting the lossy compressed video signal have the same configuration, as shown in FIG. Therefore, there is always a pixel block located at the same location as the encoding target block in the immediately preceding frame. Accordingly, in step S3, unlike in steps S1 and S2, collation is performed for all the encoding target blocks without exception. For example, in the case of 9F (15, 2) in the frame Fn, collation with 9F (15, 2) of the immediately preceding frame Fn−1 is performed. The collation itself is performed in units of bytes as in Steps S1 and S2, and the number of matched bytes is temporarily recorded. For example, if 9F (15, 2) matches 46 bytes with 9F (15, 2) of the immediately preceding frame Fn-1, collation data as shown in FIG. 3C is recorded.

ある符号化対象ブロックについて、上記ステップＳ１〜ステップＳ３の３つの照合を行ったら、次に、照合した参照ブロックのうち、相関性の最も高い参照ブロックを選定する（ステップＳ４）。具体的には、メモリに一時的に記録してある各一致バイト数が最大となる参照ブロックを選出することになる。例えば、図３（ｃ）の例では、９Ｆ（１４，２）との一致バイト数が最大であるので、９Ｆ（１４，２）が参照ブロックとして選出されることになる。ここで、選出された参照ブロックの一致バイト数により以降の処理が異なる。 After performing the above-mentioned three collations of Step S1 to Step S3 for a certain encoding target block, the reference block having the highest correlation is selected from the collated reference blocks (Step S4). Specifically, the reference block having the maximum number of matching bytes temporarily recorded in the memory is selected. For example, in the example of FIG. 3C, since the number of matching bytes with 9F (14, 2) is the maximum, 9F (14, 2) is selected as the reference block. Here, the subsequent processing differs depending on the number of matching bytes of the selected reference block.

例えば、選出された参照ブロックとの一致バイト数が８０バイトの場合、すなわち、符号化対象ブロックと選出した参照ブロックのデータが全く同一であった場合、モード「２」を出力すると共に、参照ブロックの識別ＩＤを、符号化対象ブロックの識別ＩＤと対応付けて出力する（ステップＳ５）。例えば、９Ｆ（１６，３）が直前画素ブロックと完全に一致した場合、図４（ａ）に示すような情報が記録される。なお、識別ＩＤは、ブロックが識別できるような情報であればどのようなものであっても良い。例えば、図４に示す９Ｆ（１４，２）のように、走査線方向および走査線と直交する方向における順番を識別ＩＤとして用いても良いし、直前フレームブロックを特定する場合には、フレーム内の位置は符号化対象ブロックと同一でわかっているため、直前のフレームであることを示す情報を記録しておけば、特定できることになる。また、モードとは、符号化のパターンを示すものであり、復号時に参照することにより、高速に処理することが可能となる。 For example, when the number of matching bytes with the selected reference block is 80 bytes, that is, when the data of the block to be encoded and the selected reference block are exactly the same, the mode “2” is output and the reference block Is output in association with the identification ID of the encoding target block (step S5). For example, when 9F (16, 3) completely matches the previous pixel block, information as shown in FIG. 4A is recorded. The identification ID may be any information as long as the information can identify the block. For example, as in 9F (14, 2) shown in FIG. 4, the order in the scanning line direction and the direction orthogonal to the scanning line may be used as the identification ID. Since the position of is known in the same way as the encoding target block, it can be specified by recording information indicating that it is the immediately preceding frame. The mode indicates an encoding pattern, and can be processed at a high speed by referring to it at the time of decoding.

一致バイト数が２０バイト以上８０バイト未満（７９バイト以下）の場合、モード「１」を出力すると共に、符号化対象ブロックにおける参照ブロックと一致したバイトの位置情報を出力し、一致しなかったバイトのデータそのものを出力する（ステップＳ６）。例えば、９Ｆ（１５，２）に対して、参照ブロック９Ｆ（１４，２）と５０バイト一致している場合、図４（ｂ）に示すような情報が記録される。 When the number of matching bytes is 20 bytes or more and less than 80 bytes (79 bytes or less), mode “1” is output, and position information of bytes that match the reference block in the encoding target block is output, and bytes that do not match The data itself is output (step S6). For example, when 50 bytes match with the reference block 9F (14, 2) for 9F (15, 2), information as shown in FIG. 4B is recorded.

マスクデータの記録には、１０バイトを割り当てている。また、参照ブロックの識別ＩＤの記録、および必要な情報を記録するための予備領域として、１０バイト割り当てている。そのため、一致しなかったバイトが６０バイト以上になると、総データ量が８０バイト以上となってしまう。そのため、一致バイト数が２０バイト未満（１９バイト以下）の場合は、モード「０」を出力すると共に、符号化対象ブロックのデータを何ら変更することなく、そのまま出力する（ステップＳ７）。例えば、９Ｆ（１，５）に対して、参照ブロックの一致バイト数が２０バイト未満だったとすると、図４（ｃ）に示すような情報が記録される。 10 bytes are allocated for recording the mask data. Also, 10 bytes are allocated as a reserved area for recording the identification ID of the reference block and recording necessary information. Therefore, if the number of unmatched bytes is 60 bytes or more, the total data amount is 80 bytes or more. Therefore, if the number of matching bytes is less than 20 bytes (19 bytes or less), the mode “0” is output and the data of the encoding target block is output as it is without any change (step S7). For example, if the matching byte count of the reference block is less than 20 bytes for 9F (1, 5), information as shown in FIG. 4C is recorded.

以上の処理を全フレームの全ての符号化対象ブロックに対して行い、ロッシー圧縮映像信号の全てを符号化する。全ての符号化対象ブロックは、図４の（ａ）〜（ｃ）のいずれかの形式で記録されることになる。符号化された情報は、任意の記録媒体に、その記録媒体に対応したフォーマットで記録される。また、そのままネットワークを介して伝送するようにしても良い。 The above processing is performed on all the encoding target blocks of all the frames, and all the lossy compressed video signals are encoded. All the encoding target blocks are recorded in one of the formats (a) to (c) of FIG. The encoded information is recorded on an arbitrary recording medium in a format corresponding to the recording medium. Further, it may be transmitted via a network as it is.

上記の処理では、最も圧縮効果の得られる画素ブロックのみを符号化対象ブロックとして処理した。これは、ＤＶフォーマットにおいて、画素ブロックである９Ｆがデータ量の大多数を占めているため、画素ブロックだけ圧縮すれば、かなりのデータ量が削減されるためである。しかし、画素ブロック以外の他のブロックについて圧縮して、さらにデータ量を削減するようにしても良い。この場合、制御ブロック「１Ｆ」、「３Ｆ」、「５Ｆ」については、ステップＳ１における直前ブロックとの照合は行わず、ステップＳ２における直前ラインブロックとの照合、およびステップＳ３における直前フレームブロックとの照合を行う。ステップＳ２においては、符号化対象ブロック１Ｆ（ｊ）については、直前ラインブロック１Ｆ（ｊ−１）との照合を行い、３Ｆ（ｉ，ｊ）、５Ｆ（ｉ，ｊ）については、それぞれ直前ラインブロック３Ｆ（ｉ，ｊ−１）、５Ｆ（ｉ，ｊ−１）との照合を行う。制御ブロック「７Ｆ」については、ブロック「９Ｆ」と同様にステップＳ１〜ステップＳ３の３つの照合を行う。
（復号方法）
次に、上記圧縮方法により圧縮された符号データを復号解凍する方法について説明する。復号は、復号装置（コンピュータ、およびコンピュータに搭載される専用のソフトウェアプログラムで構成）により実行される。復号方法の概要を図５のフローチャートに示す。 In the above processing, only the pixel block with the most compression effect is processed as the encoding target block. This is because, in the DV format, 9F which is a pixel block occupies the majority of the data amount, so if only the pixel block is compressed, a considerable amount of data is reduced. However, the data amount may be further reduced by compressing blocks other than the pixel block. In this case, the control blocks “1F”, “3F”, and “5F” are not collated with the immediately preceding block in step S1, but are collated with the immediately preceding line block in step S2 and with the immediately preceding frame block in step S3. Perform verification. In step S2, the encoding target block 1F (j) is collated with the immediately preceding line block 1F (j-1), and 3F (i, j) and 5F (i, j) are respectively compared with the immediately preceding line. Collation with blocks 3F (i, j-1) and 5F (i, j-1) is performed. For the control block “7F”, three collations of Step S1 to Step S3 are performed in the same manner as the block “9F”.
(Decryption method)
Next, a method for decoding and decompressing code data compressed by the compression method will be described. Decoding is performed by a decoding device (comprising a computer and a dedicated software program installed in the computer). An outline of the decoding method is shown in the flowchart of FIG.

図５のフローチャートは、各符号化ブロックに対する処理について示している。符号化ブロックは、各フレームの先頭から順次復号装置に読込まれて処理される。まず、読込まれた符号化ブロックの符号化モードを認識する（ステップＳ１１）。具体的には、符号化ブロックのモードが０、１、２のいずれであるかを認識する。モード「０」の場合、すなわち、図４（ｃ）に示したようなデータが記録されている場合は、元の符号化対象ブロックのデータがそのまま記録されているので、読込んだ符号化ブロックをそのまま出力する（ステップＳ１２）。モード「２」の場合、すなわち、図４（ａ）に示したようなデータが記録されている場合は、参照ブロックのＩＤのみが記録されているので、対応する識別ＩＤで特定される参照ブロックのデータをそのまま出力する（ステップＳ１３）。モード「０」の場合、すなわち、図４（ｂ）に示したようなデータが記録されている場合は、参照ブロックのＩＤおよびマスクデータ、不一致バイトデータが記録されているので、対応する識別ＩＤで特定される参照ブロックにおいて、マスクデータで特定される位置のバイトデータを抽出し、抽出したバイトデータと、不一致バイトデータを統合することにより、符号化対象ブロックを復元する（ステップＳ１４）。 The flowchart in FIG. 5 shows processing for each coding block. The encoded block is sequentially read from the head of each frame into the decoding device and processed. First, the encoding mode of the read encoding block is recognized (step S11). Specifically, it recognizes whether the mode of the coding block is 0, 1, or 2. In the case of mode “0”, that is, when data as shown in FIG. 4C is recorded, the data of the original encoding target block is recorded as it is, so that the encoded block read Is output as it is (step S12). In the case of mode “2”, that is, when data as shown in FIG. 4A is recorded, only the reference block ID is recorded, so the reference block specified by the corresponding identification ID Are output as they are (step S13). In the case of mode “0”, that is, when data as shown in FIG. 4B is recorded, the ID, mask data, and mismatched byte data of the reference block are recorded. In the reference block specified in (2), the byte data at the position specified by the mask data is extracted, and the extracted byte data and the mismatched byte data are integrated to restore the encoding target block (step S14).

上記ステップＳ１１〜ステップＳ１４の処理を繰り返し行うことにより、符号化対象ブロックを復元していく。この結果、図１に示したようなＤＶフォーマットのデータが復元されることになる。 The block to be encoded is restored by repeatedly performing the processes in steps S11 to S14. As a result, data in the DV format as shown in FIG. 1 is restored.

本発明で圧縮対象とする映像信号を構成する圧縮フレームの構造を示す図である。It is a figure which shows the structure of the compression frame which comprises the video signal made into compression object by this invention. 本発明に係る映像信号の圧縮方法の概要を示すフローチャートである。It is a flowchart which shows the outline | summary of the compression method of the video signal which concerns on this invention. メモリ上に一時的に記録される照合データを示す図である。It is a figure which shows the collation data temporarily recorded on a memory. 本発明により得られる符号化データを示す図である。It is a figure which shows the coding data obtained by this invention. 符号化データを復号する処理の概要を示すフローチャートである。It is a flowchart which shows the outline | summary of the process which decodes coding data. 圧縮されていない原映像信号およびロッシー圧縮の中間処理により得られる映像信号を示す図である。It is a figure which shows the uncompressed original video signal and the video signal obtained by the intermediate process of lossy compression. ＤＶ方式の圧縮フレームの構造を示す図である。It is a figure which shows the structure of the compression frame of DV system.

Claims

A method of compressing the amount of information so that the compressed frame can be reproduced with respect to a video signal composed of compressed frames obtained by compressing time-series continuous frames by a lossy method,
In the compressed frame, each pixel block obtained by blocking pixels is set as an encoding target block, and with respect to each encoding target block, the immediately preceding block which is a pixel block positioned immediately before in the scanning line direction, the scanning line direction, and The reference block with the highest correlation is selected by comparing with the three reference blocks of the previous line block, which is the pixel block located immediately before in the vertical direction, and the previous frame block, which is the pixel block located at the same position in the previous frame. A reference block selection stage to perform,
Correlation determination step for determining the correlation between the selected reference block and the encoding target block from three types of perfect match, high correlation degree, and low correlation degree;
As the output data of the block to be encoded, when the correlation is a perfect match, the identification ID of the selected reference block is output, and when the correlation is high in correlation, the selected reference block Output identification ID, mask data indicating the position of the selected reference block and the matching data in the encoding target block in the block, and data in the encoding target block that does not match the reference block, A block encoding step for outputting matched data and mismatched data in the encoding target block when the correlation is low, and
A method for compressing a video signal, comprising:

In the reference block selection step, for each control block other than the pixel block included in the video signal, the immediately preceding line control block which is a control block positioned immediately before in the direction perpendicular to the scanning line direction, and the same portion of the immediately preceding frame 2. The video signal compression according to claim 1, wherein the reference block having the highest correlation is further selected by collating with two reference blocks of the immediately preceding frame control block which is a control block located at Method.

The video signal is compressed by the DV or DVCAM system, and the pixel block is 80-byte fixed length data, which is equivalent to 6 code units of discrete cosine transform composed of 8 × 8 pixels in the original image before compression. The video signal compression method according to claim 1, wherein the video signal compression method is obtained by processing 1350 code units per NTSC specification frame.

The recording medium of the video signal which recorded the code | cord | chord data obtained by any of the method of Claim 1 to 3 with respect to the given video signal.