WO2006046334A1 - Video encoder, video decoder, video encoding method, and video decoding method - Google Patents

Video encoder, video decoder, video encoding method, and video decoding method Download PDF

Info

Publication number
WO2006046334A1
WO2006046334A1 PCT/JP2005/013406 JP2005013406W WO2006046334A1 WO 2006046334 A1 WO2006046334 A1 WO 2006046334A1 JP 2005013406 W JP2005013406 W JP 2005013406W WO 2006046334 A1 WO2006046334 A1 WO 2006046334A1
Authority
WO
WIPO (PCT)
Prior art keywords
bit
state
video
decoding
unit
Prior art date
Application number
PCT/JP2005/013406
Other languages
French (fr)
Japanese (ja)
Inventor
Daijiroh Ichimura
Yoshimasa Honda
Original Assignee
Matsushita Electric Industrial Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co., Ltd. filed Critical Matsushita Electric Industrial Co., Ltd.
Publication of WO2006046334A1 publication Critical patent/WO2006046334A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/36Scalability techniques involving formatting the layers as a function of picture distortion after decoding, e.g. signal-to-noise [SNR] scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/18Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/34Scalability techniques involving progressive bit-plane based encoding of the enhancement layer, e.g. fine granular scalability [FGS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Definitions

  • Video coding apparatus video decoding apparatus, video coding method and video decoding method
  • the present invention relates to a video coding apparatus and method for coding video and generating a video stream, and
  • the present invention relates to a video decoding apparatus and method for decoding a video stream to generate a decoded video.
  • Video transmitted through the transmission unit is compressed into a video stream having a smaller amount of data using video coding technology, and information is efficiently transmitted.
  • video stream transmission has become increasingly popular, in which video code data that has been received is sequentially played back, rather than being completely downloaded after all video information has been downloaded.
  • the code amount used for decoding is uniquely determined and changes the quality of the video to be reproduced. It is not possible.
  • the video is encoded twice and transmitted according to each band, or the image quality and resolution of the video according to the narrow communication band, Reduce the frame rate and set the video code.
  • MPEG-4 FGS (Fine Granularity Scalable coding) is one of the scalable video coding methods defined in ISO / IEC 1449 6-2 Amendment 2, and in particular, it is necessary to finely select the picture quality of the stream. Is standardized as a possible coding method.
  • a video stream encoded by MPEG-4 FGS is composed of a base layer stream and an enhancement layer stream.
  • the base layer stream is a low-band low-quality video stream that can be decoded alone
  • the enhancement layer stream is a video stream for improving the quality of the base layer stream.
  • the amount of code to be transmitted is 1 frame (1 screen, 1 image) by hierarchically encoded layer structure and encoding process called bit-plane encoding used for enhancement layer stream. It can be controlled on a case-by-case basis and can be very flexible in responding to the transmission rate and the required image quality.
  • bit plane coding used to generate an MPEG-4 FGS enhancement layer stream.
  • FIG. 14 is a diagram showing a video encoding device 130 of MPEG-4 FGS.
  • the original video is input from the video signal input unit 132, and the base layer coding unit 134 generates a base layer stream, and the enhancement layer coding unit 140 generates an enhancement layer stream, and the base layer output unit 136. And output from the enhancement layer output unit 138.
  • enhancement layer coding section 140 a DCT transform is performed on each differential pixel, which is the difference between a base layer decoded video obtained by decoding the original video and the base layer stream, every 8 X 8 pixels (Discrete Cosine
  • the DCT unit 144 DCT-transforms the difference image to generate DCT coefficients.
  • the scan unit 146 scans DCT coefficients of 8 ⁇ 8 pixels and rearranges the scanned DCT coefficients.
  • the DCT coefficients have a statistically large absolute value biased to horizontal and vertical low frequencies, and it becomes statistically less that the horizontal and vertical high frequency coefficients have large absolute values.
  • the scanning unit 146 is used to By performing a scan that reorders the DCT coefficients towards the flat and vertical high frequency coefficients, there will be more "0" s in the second half of the reordered scanned DCT coefficients.
  • FIG. 16A shows a scan of DCT coefficients of 8 ⁇ 8 pixels. Signs representing plus and minus are encoded separately from absolute values. In this way, scan in the order of lower horizontal frequency and lower vertical frequency. As a result, the coding efficiency can be increased because the probability that "0" appears in the high frequency component is high.
  • hierarchical coding section 140 performs bit plane coding.
  • the MPEG-4 FGS performs a zero run length code and a nonman code for each bit plane from the upper bits as bit plane coding.
  • a bit plane is a bit string in which only the same bit positions of a plurality of binary numbers are arranged, and is also called a bit plane.
  • FIG. 17 is a diagram in which DCT coefficients are arranged with the horizontal axis in scan order and the vertical axis in bit units. One column represents one DCT coefficient, and one row represents one bit plane. The most significant “1” bit of the DCT coefficient is called MSB, and the blank cell in FIG. 17 is the one in which the description of “0” bit higher than MSB is omitted.
  • the bit plane containing the MSB with the largest value among the DCT coefficients is called the MSB plane.
  • Zero run length coding is to encode how many “0” appears before coefficients other than “0” appear, and by assigning one signal to a plurality of “0” s. It is a coding method that compresses the amount of information. Since only “0” or “1” appears in the bit plane, it becomes “0 length”. In addition, by encoding "whether it is the last 1 or not” in the bit plane, the "1" appearing after the "0” efficiently encodes the "0” that is biased in the second half by the scan. . In the zero run length code MPEG of MPEG-4 FGS illustrated in FIG. 17, referring to the area C, four “0” s follow and “1” s follow, but there is a “1” after that.
  • FIG. 16B is a diagram showing a bit plane of bit positions of DCT coefficients of 8 ⁇ 8 pixels. According to the scan order shown in FIG. 16A, bit B12 in FIG. 16B is "1" at the end of the bit plane, otherwise it is not “1" at the end of the bit plane.
  • Huffman coding is a type of variable-length coding, and in MPEG-4 FGS, "length following 0", ie, zero run length and "whether it is the last 1 or not", ie, bit plane end signal
  • the appearance probability is calculated in advance for each combination of numbers, and the information is compressed by assigning high probability of occurrence, short! Symbol for combination, low probability of occurrence, and long symbol for combination.
  • FIG. 15 shows an MPEG-4 FGS video decoding apparatus 150 that decodes the base layer stream and the enhancement layer stream generated by the video coding apparatus 130 to generate a decoded image.
  • the base layer decoding image generated by the base layer decoding block 156 by decoding the base layer stream and the differentially decoded image generated by the enhancement layer decoding section 160 by decoding the enhancement layer stream are added.
  • the quality of the decoded image is proportional to the amount of decoded enhancement layer stream.
  • the upper bit plane power of the DCT coefficient that has a strong influence on the quality of the decoded image is also coded and video stream is preferentially coded.
  • the image quality can be flexibly adjusted by storing in. For example, in a system for transmitting a video stream from a video transmitting terminal to a video receiving terminal, if the video receiving terminal can not receive the video stream completely, the video stream of the MPEG-4 FGS will be displayed in image quality. It is possible to decode the picture since only the lower bit planes of low influence are missing. In the case of video non-scaleable coding schemes such as MPEG-4 AVC, if the video stream can not be received completely, the subjective image quality is greatly degraded, such as the lower half of one screen can not be decoded. I have a problem.
  • the hierarchical coding unit 148 is improved in the MPEG-4 FGS video coding apparatus shown in FIG.
  • FIG. 18 shows the layer coding unit 148 described in the above publication.
  • bit separation unit 170 separates the input DCT coefficients into MSB and other non-MSBs.
  • the MSB encoding unit 172 encodes only the MSB of the DCT coefficient
  • the non-MSB encoding unit 174 encodes the non-MSB
  • the combining unit 176 combines and outputs the MSB and non-MSB codes.
  • the coding efficiency is improved by giving priority to the video stream by giving priority to the MSB which most affects the image quality of the decoded image among one DCT coefficient. “The code efficiency is good” means that when two video streams of the same data amount, which code the same original video, are decoded, the image quality of one decoded video is superior to the other. Then, the former has better coding efficiency.
  • the differential image to be encoded by the enhancement layer encoding unit 140 is a difference between the original image and the basic layer decoded image and includes many edges, so a large absolute value DCT coefficient appears in a high frequency region. There is a case.
  • the MSB of one DCT coefficient is 11 bits and the MSBs of the other DCT coefficients are all 5 bits or less, the DCT coefficients coding from 11 MSB planes to 6 bits are Even though it is one, it is assumed that a large number of “1” appears as a Huffman table, and the expected probability of occurrence and the actual deviation become large and the coding efficiency becomes worse.
  • the present invention decodes a video coding apparatus and coding data that can perform efficient coding according to the state of bits of transform coefficients to be encoded.
  • An object of the present invention is to provide a video decoding device.
  • a video encoding apparatus is a conversion coefficient that frequency-converts an image and represents frequency components.
  • Transform coefficients generated by the transform coefficient generator and the transform coefficient generator into binary numbers, and the bit plane consisting of the same order bits of the plurality of transform coefficients is from the most significant bit to the least significant bit A state in which the state of bits of transform coefficients to be encoded later is predicted based on information on a bitplane encoding unit that generates up to and encoding in order of upper bitplane power and transform coefficients that have been encoded earlier.
  • the bit plane coding unit performs coding according to the state of the bit predicted by the state prediction unit.
  • the state prediction unit may predict the state of the bits of the lower bit plane based on (1) information of the upper bit plane, or (2) the prediction target
  • the bit state may be predicted according to the number of transform coefficients in which a bit state “1” appears in the bit plane higher than the bit, or (3) the transform coefficient of the bit to be predicted is
  • the state of the bit may be predicted based on whether the state of the bit "1” appears in the upper bit or not, or (4) the state of the bit in the bit plane higher than the bit to be predicted Among the transform coefficients in which “1” appears, the state of the bit is predicted based on the distance to the closest transform coefficient where the code order is located later than the transform coefficient of the bit to be predicted. Or (5) a bit higher than the bit to be predicted.
  • the state of the bit may be predicted based on the transform coefficient at which the state of the bit “1” appears at the last position of the code sequence.
  • the state prediction unit is configured such that the state of the bit to be predicted is "1", and all the code coefficients belonging to the same bit plane belonging to the same bit plane of the transform coefficient located behind. It predicts the probability that the bit state is "0", and the bit-plane coder may perform zero run length code based on the probability.
  • the state prediction unit is configured to convert bits of the coding coefficients in the order of the codes after the predetermined order on the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and in accordance with the determination that the state “1” of the bit is not included, the state of the bit to be predicted is “1” and the order of the codes is The probability of all bits belonging to the same bit plane of the conversion coefficient located behind being "0" is predicted, and the bit plane code block performs a zero run length code block based on the probability. May be
  • the state prediction unit is a zero run length including bits to be predicted.
  • the bit plane coding unit may perform Huffman coding using a Huffman table selected based on the zero run length predicted by the state prediction unit.
  • the state prediction unit determines the number of zeros up to the point where the probability that the bit state "0" continuously appears falls below a preset threshold as the zero run length. May be
  • the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit-plane coding part uses the state prediction unit. Arithmetic coding may be performed using the occurrence probability of the symbol determined based on the predicted probability.
  • the state prediction unit is configured such that the state of the bit to be predicted is “1”, and the code order is all belonging to the same bit plane of the conversion coefficient located behind.
  • the probability of the bit state being "0" is predicted, and the bit plane coding unit performs arithmetic coding using the occurrence probability of the symbol determined based on the probability predicted by the state prediction unit. You may go.
  • the video coding apparatus is a video coding apparatus that performs hierarchical coding on a base layer capable of decoding video independently and an enhancement layer for improving the video quality of the base layer.
  • the prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on the information of the base layer, and the bit plane coding unit converts the enhancement layer according to the state of the bit predicted by the state prediction unit. You may sign the coefficients!
  • the state prediction unit is based on edge information included in the base layer or the base layer code amount, and the bit of the transform coefficient of the enhancement layer is You may predict the condition.
  • the state prediction unit predicts the state of the bit of the conversion coefficient of the frame constituting the video based on the information of the reference frame used for the motion prediction / compensation code
  • the plane code unit may code the transform coefficient of the frame according to the state of the bit predicted by the state prediction unit.
  • the state prediction unit determines the shape of bits of the transform coefficient of the frame based on edge information included in the reference frame or the code quantity of the reference frame.
  • the state may be predicted.
  • the video decoding apparatus comprises a bit plane decoding unit that decodes bit plane coded video encoded data in order of upper bit plane power, and a conversion coefficient that has been previously decoded. And a state prediction unit that predicts the state of bits of transform coefficients to be decoded later based on the information related to H.
  • the bit plane decoding unit decodes in accordance with the state of bits predicted by the state prediction unit. Do a bribe.
  • the state prediction unit may predict the state of the bits of the lower bit plane based on the information of the upper bit plane (1).
  • the bit state may be predicted according to the number of transform coefficients in which a bit state “1” appears in the bit plane higher than the bit, or (3) the transform coefficient of the bit to be predicted is The state of the bit may be predicted based on whether the state of the bit "1” appears in the upper bit or not, or (4) the state of the bit in the bit plane higher than the bit to be predicted Among the transform coefficients in which “1” appears, the state of the bit is predicted based on the distance to the closest transform coefficient to which the decoding order is located later than the transform coefficient of the bit to be predicted. (5) bits higher than the bit to be predicted In the top plane, the state of the bit is predicted based on the transform coefficient at which the state of the bit "1" appears at the position where the decoding order is the most backward.
  • the state prediction unit is configured such that the state of the bit to be predicted is “1”, and the decoding order is all belonging to the same bit plane of the conversion coefficient located behind. By predicting the probability that the bit state is "0", the bit plane decoding unit may perform zero run length decoding based on the probability.
  • the state prediction unit is configured to convert the decoding coefficients in the decoding order after the predetermined order in the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and the state of the bit to be predicted is “1” and the decoding order is determined according to the determination that the state “1” of the bit is not included. The probability of all bits belonging to the same bitplane of the conversion coefficient located behind being "0" is predicted, and the bitplane decoding unit performs zero run length decoding based on the probability. May be
  • the state prediction unit is a zero run length including bits to be predicted.
  • the bit plane decoding unit may perform Huffman decoding using the Huffman table selected based on the zero run length predicted by the state prediction unit.
  • the state prediction unit may obtain, as a zero run length, the number of zeros until the probability that the bit state “0” continuously appears falls below a preset threshold.
  • the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit plane decoding unit is operated by the state prediction unit. Arithmetic decoding may be performed using the probability of occurrence of the symbol determined based on the predicted probability.
  • the state prediction unit determines that the state of the bit to be predicted is “1”, and that the decoding order is all bits belonging to the same bit plane of the conversion coefficient positioned behind.
  • the bit-plane decoding unit predicts the probability that the state of “0” is “0”, and the bit-plane decoding unit performs arithmetic decoding using the occurrence probability of the symbol determined based on the zero run length predicted by the state prediction unit.
  • the state prediction unit predicts, based on the decoded information of the base layer, the state of the bit of the transform coefficient of the enhancement layer for improving the video quality of the base layer
  • the decoding unit may decode the transform coefficients of the enhancement layer according to the state of the bit predicted by the state prediction unit !.
  • the state prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on edge information included in the base layer or the base layer code amount. It is also good.
  • the state prediction unit predicts the state of the bit of the conversion coefficient of the frame constituting the video based on the information of the reference frame used for the motion prediction / compensation code. Then, the bit-plane decoding unit decodes the transform coefficients of the frame according to the state of the bit predicted by the state prediction unit.
  • the state prediction unit predicts the state of the bits of the frame based on edge information included in the reference frame or the amount of coding of the reference frame.
  • the bit states of the transform coefficients to be encoded later are And a state prediction step of predicting, and the bit plane coding step performs coding in accordance with the state of the bit predicted in the state prediction step.
  • the bit plane decoding step of decoding the bit plane coded picture data and the coded picture data of the picture in the order of higher bit plane power, and the conversion previously decoded are performed.
  • a state prediction step of predicting the state of bits of transform coefficients to be decoded later based on the information on the coefficient, and the bit plane decoding step is performed according to the state of the bits predicted by the state prediction unit. Perform decryption.
  • the program for video coding according to the present invention includes a conversion coefficient generation step of converting the frequency of the video into a conversion coefficient representing a frequency component and converting the video to a computer for coding the video.
  • the conversion coefficient generated in the coefficient generation step is converted to a binary number, and bit planes consisting of the same order bits of multiple conversion coefficients are generated from the most significant bit to the least significant bit.
  • the plane coding step performs coding according to the state of the predicted bit in the state prediction step.
  • the program for video decoding according to the present invention is a bit for decoding the upper bit plane power in order to the computer in order to decode the bit plane coded video data of the bit plane. Performing a plane decoding step and a state prediction step of predicting a state of a bit of a transform coefficient to be decoded later based on information on the previously decoded transform coefficient; Decoding is performed according to the state of the bit predicted by the prediction unit.
  • FIG. 1 is a diagram showing the configuration of a layer code portion according to a first embodiment of the present invention.
  • FIG. 2 is a diagram showing the configuration of a video encoding apparatus according to the first embodiment of the present invention.
  • FIG. 3 is a flowchart showing the operation of the video coding apparatus according to the first embodiment of the present invention.
  • FIG. 4 is a flow chart showing the operation of hierarchical code processing of the video coding apparatus of the first embodiment of the present invention.
  • FIG. 5 is an explanatory view of the state prediction of the present invention.
  • FIG. 6 is a diagram showing the configuration of a video decoding apparatus according to a second embodiment of the present invention.
  • FIG. 7 is a diagram showing the configuration of a hierarchical decoding module according to a second embodiment of the present invention.
  • FIG. 8 is a flowchart showing the operation of the video decoding apparatus according to the second embodiment of the present invention.
  • FIG. 9 is a flowchart showing an operation of hierarchical decoding processing of the video decoding apparatus according to the second embodiment of the present invention.
  • FIG. 10 is a diagram showing a configuration of a video encoding apparatus according to a third embodiment of the present invention.
  • FIG. 11 is a diagram showing a video encoding apparatus according to the third embodiment of the present invention. Flow chart showing operation
  • FIG. 12 shows a configuration of a video decoding apparatus according to a fourth embodiment of the present invention.
  • FIG. 13 shows a video decoding apparatus according to the fourth embodiment of the present invention.
  • FIG. 14 is a diagram showing the configuration of a video coding device of MPEG-4 FGS.
  • FIG. 15 is a diagram showing the configuration of the MPEG-4 FGS video decoding device.
  • FIG. 16A is an explanatory diagram of scanning of DCT coefficients.
  • FIG. 16B is an explanatory diagram of bit planes of DCT coefficients.
  • FIG. 17 is an explanatory diagram of a bit plane code line.
  • FIG. 18 is a diagram showing the configuration of a conventional hierarchical coding unit. BEST MODE FOR CARRYING OUT THE INVENTION
  • the video encoding apparatus is configured to convert the frequency of the video to generate a conversion coefficient representing a frequency component, and to convert the conversion coefficient generated by the conversion coefficient generation unit into a binary number.
  • a state prediction unit that predicts the state of bits of the conversion coefficient to be encoded later based on the information on the previously encoded conversion coefficient, and the bit plane coding unit is predicted by the state prediction unit. It performs coding according to the state of the bit.
  • the probability of occurrence of “1” and “0” in the state of the bit to be encoded later, ie, the bits to be encoded later, is predicted based on the information on the previously encoded transform coefficient.
  • efficient code can be obtained. For example, by assigning a code having a short code bit length to a combination having a high occurrence probability, the code efficiency is improved.
  • the state prediction unit may predict the state of the bits of the lower bit plane based on the information of the upper bit plane.
  • the state of bits can be predicted without using information other than the region to which the coefficient bits in the code belong.
  • the state prediction unit predicts the state of the bit according to the number of transform coefficients in which the state of the bit “1” appears in the bit plane higher than the bit to be predicted. It is also good.
  • the state prediction unit determines whether a bit state “1” appears in the high-order bit and V in the conversion coefficient of the bit to be predicted.
  • the state may be predicted. According to this configuration, the prediction accuracy of the bit state can be improved, and the coding efficiency can be improved.
  • the state prediction unit converts the bit to be predicted among the conversion coefficients in which the state of bit “1” appears in the bit plane higher than the bit to be predicted.
  • the state of the bit may be predicted based on the distance to the nearest transform coefficient where the code order is located behind the coefficient and the closest to the transform coefficient.
  • the state prediction unit is based on the conversion coefficient in which the state of the bit “1” appears at the last position of the code order in the bit plane higher than the bit to be predicted. First, let's predict the state of the bit.
  • the state prediction unit is configured such that the state of the bit to be predicted is “1”, and all the code coefficients belonging to the same bit plane belonging to the same bit plane of the conversion coefficient located behind It predicts the probability that the bit state is "0”, and the bit-plane coder may perform zero run length code based on the probability.
  • the state prediction unit is configured to convert bits of the coding coefficients in the order of the codes after the predetermined order in the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and in accordance with the determination that the state “1” of the bit is not included, the state of the bit to be predicted is “1” and the order of the codes is The probability of all bits belonging to the same bit plane of the conversion coefficient located behind being "0" is predicted, and the bit plane code block performs a zero run length code block based on the probability. May be
  • the state prediction unit predicts a zero run length including bits to be predicted, and the bit plane coding unit predicts a zero run predicted by the state prediction unit.
  • the state prediction unit determines the number of zeros up to the point where the probability that the bit state "0" appears continuously falls below a preset threshold as the zero run length. May be
  • the zero run length can be appropriately predicted.
  • the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit-plane coding part uses the state prediction unit. Arithmetic coding may be performed using the occurrence probability of the symbol determined based on the predicted probability.
  • the state prediction unit determines that the state of the bit to be predicted is "1", and that all the codes belonging to the same bit plane of the transform coefficient whose code sequence is located behind. The probability of the bit state being "0" is predicted, and the bit plane coding unit performs arithmetic coding using the occurrence probability of the symbol determined based on the probability predicted by the state prediction unit. You may go.
  • the video coding apparatus is a video coding apparatus that performs hierarchical coding on a base layer capable of decoding video independently and an enhancement layer for improving the video quality of the base layer.
  • the prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on the information of the base layer, and the bit plane coding unit converts the enhancement layer according to the state of the bit predicted by the state prediction unit. You may sign the coefficients! According to this configuration, it is possible to appropriately predict the bit state of the transform coefficient of the enhancement layer based on the information of the enhancement layer and the base layer strongly correlated.
  • the state prediction unit is configured to use the bit of the transform coefficient of the enhancement layer based on the edge information included in the base layer or the code quantity of the base layer. You may predict the condition.
  • the state prediction unit predicts the state of the bit of the conversion coefficient of the frame constituting the video, based on the information of the reference frame used for the motion prediction / compensation code
  • the plane code unit may code the transform coefficient of the frame according to the state of the bit predicted by the state prediction unit.
  • the state of the bits of the transform coefficients of the frame in the code can be properly predicted. It is possible to predict the state of the residual signal of motion prediction compensation using the characteristics of the reference frame.
  • the code efficiency can be improved.
  • the state prediction unit predicts the bit state of the transform coefficient of the frame based on edge information included in the reference frame or the code amount of the reference frame. May be
  • the video decoding apparatus decodes a bit-plane coded video encoded data in the order of higher bit-plane coding, and a bit-plane decoding module that decodes the data first. And a state prediction unit for predicting the state of bits of the conversion coefficient to be decoded later based on the information on the converted conversion coefficient, the bit plane decoding unit including bits predicted by the state prediction unit. Decode according to the state of.
  • the state prediction unit may predict the state of the bits of the lower bit plane based on the information of the upper bit plane.
  • the state of the bits of the lower bit plane can be appropriately predicted, and coding can be performed efficiently.
  • the encoded data can be decoded.
  • the state of bits can be predicted without using information other than the region to which the bit of coefficient being decoded belongs.
  • the state prediction unit predicts the state of the bit according to the number of transform coefficients in which the state of the bit “1” appears in the bit plane higher than the bit to be predicted. It is also good.
  • the state prediction unit determines the conversion coefficient of the bit to be predicted.
  • the state of the bit may be predicted based on whether or not the state of the bit "1" appears in V, the upper bits.
  • the state prediction unit converts the bit to be predicted among the conversion coefficients in which the bit state “1” appears in the bit plane higher than the bit to be predicted.
  • the state of the bit may be predicted based on the distance to the nearest transform coefficient where the decoding order is located behind the coefficient and the nearest to it.
  • the state prediction unit is based on the conversion coefficient at which the state of the bit "1" appeared at the position of the rearmost in the decoding order in the bit plane higher than the bit to be predicted. First, let's predict the state of the bit.
  • the state prediction unit determines that the state of the bit to be predicted is “1”, and that the decoding order is all belonging to the same bit plane of the conversion coefficient located behind. By predicting the probability that the bit state is "0", the bit plane decoding unit may perform zero run length decoding based on the probability.
  • the state prediction unit is configured to convert the decoding coefficients in the decoding order after the predetermined order in the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and the state of the bit to be predicted is “1” and the decoding order is determined according to the determination that the state “1” of the bit is not included. The probability of all bits belonging to the same bitplane of the conversion coefficient located behind being "0" is predicted, and the bitplane decoding unit performs zero run length decoding based on the probability. May be
  • the state prediction unit predicts a zero run length including bits to be predicted, and the bit plane decoding unit is based on the zero run length predicted by the state prediction unit. Do the Huffman decoding using the selected Huffman table.
  • the state prediction unit may obtain, as the zero run length, the number of zeros until the probability that the bit state “0” appears continuously falls below a preset threshold.
  • the zero run length can be appropriately predicted.
  • the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit plane decoding unit uses the state prediction unit to Arithmetic decoding may be performed using the probability of occurrence of the symbol determined based on the predicted probability.
  • the code can be efficiently encoded using the occurrence probability selected according to the state of the bit.
  • the encoded data can be decoded.
  • the state prediction unit determines that the state of the bit to be predicted is “1”, and that the decoding order is all bits belonging to the same bit plane of the conversion coefficient positioned behind.
  • the bit-plane decoding unit predicts the probability that the state of “0” is “0”, and the bit-plane decoding unit performs arithmetic decoding using the occurrence probability of the symbol determined based on the zero run length predicted by the state prediction unit.
  • the state prediction unit predicts, based on the decoded information of the base layer, the state of the bit of the transform coefficient of the enhancement layer for improving the video quality of the base layer
  • the plane decoding unit may decode the transform coefficients of the enhancement layer according to the state of the bit predicted by the state prediction unit !.
  • the state prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on edge information included in the base layer or the code quantity of the base layer. It is also good.
  • the state prediction unit predicts the state of the bit of the conversion coefficient of the frame making up the video based on the information of the reference frame used for the motion prediction / compensation code, and the bit is
  • the plane decoding unit may decode the transform coefficients of the frame according to the state of the bit predicted by the state prediction unit.
  • the state prediction unit predicts the state of the bits of the frame based on edge information included in the reference frame or the amount of coding of the reference frame. According to this configuration, it is possible to appropriately predict the bit state of the transform coefficient of the frame in the decoding process.
  • the state prediction step of predicting the state of the bits of the transform coefficient to be encoded later based on the information on the previously encoded transform coefficient, the bit plane encoding step comprising the steps of: Coding is performed according to the state of the predicted bit.
  • a bit-plane decoding step of decoding bit-code coded video coded data in order of upper bit-plane power, and decoding first.
  • a state prediction step of predicting the state of bits of the conversion coefficient to be decoded later based on the information on the converted conversion coefficient, the bit plane decoding step including the state of the bit predicted by the state prediction unit.
  • the program for video coding is a conversion coefficient generation step of frequency converting the video into a computer and generating a conversion coefficient representing a frequency component in order to code the video. Then, the conversion coefficient generated in the conversion coefficient generation step is converted into a binary number, and a bit plane consisting of the same order bits of a plurality of conversion coefficients is generated from the most significant bit to the least significant bit, Bit-Plane Force A bit-plane encoding step of encoding in order, and a state prediction step of predicting the state of bits of transform coefficients to be encoded later based on information on transform coefficients encoded earlier. Let the bit run The plane coding step performs coding according to the state of the predicted bit in the state prediction step.
  • the program for video decoding according to the present embodiment is to decode the upper bit plane power in order to the computer in order to decode the bit plane coded video data of the bit plane code.
  • the bit plane decoding step is performed, and the state prediction step is performed to predict the state of bits of the transform coefficient to be decoded later based on the information on the transform coefficient decoded earlier.
  • the decoding step performs decoding in accordance with the state of the bit predicted by the state prediction unit.
  • the state of the bit to be encoded later based on the information on the previously encoded transformation coefficient, ie, the bit to be encoded later.
  • Good efficiency by predicting the appearance probability of '1' and '0' in It has the excellent effect of being able to perform the ⁇ sign.
  • a video coding system in a scalable video coding system, a video coding system will be described which predicts the state of the enhancement layer using the information of the enhancement layer encoded earlier and performs coding.
  • the video encoding apparatus uses the encoded / decoded information, which is the information stored in the video stream as the encoded information, to the encoded information, which is the information in the encoded information.
  • a state prediction parameter representing a state of coding information is predicted, and code information is coded based on the state prediction parameter.
  • FIG. 1 is a block diagram showing the configuration of hierarchical code input unit 26 in video encoding apparatus 10
  • FIG. 1 is a block diagram showing the configuration of a video coding apparatus 10 according to a first embodiment of the present invention.
  • the video encoding device 10 includes a video signal input unit 12, a base layer coding unit 14, a base layer output unit 16, an enhancement layer coding unit 18, and an enhancement layer output unit 28.
  • the enhancement layer coding unit 18 has a difference unit 20, a DCT unit 22, a scan unit 24, and a hierarchical coding unit 26.
  • the video signal input unit 12 receives an image as an original image frame by frame from the outside of the video code processing apparatus 10 and outputs the video to the base layer code communication unit 14 and the enhancement layer coding circuit unit 18. The presence or absence of a video input from the outside of the video encoding device 10 is determined, and if there is no video input, the processing ends.
  • the base layer coding unit 14 codes the original image input from the video signal input unit 12 to generate a base layer stream, and outputs the generated base layer stream to the base layer output unit 16. Do.
  • the base layer stream is decoded to generate a base layer decoded image, and the generated base layer decoded image is output to the enhancement layer coding unit 18.
  • the base layer output unit 16 outputs the base layer stream input from the base layer code input unit 14 to the outside of the video code input device 10.
  • the difference unit 20 generates a difference image by calculating the difference between the original image input from the video signal input unit 12 and the base layer decoded image input from the base layer code input unit 14.
  • the difference image is output to the DCT unit 22.
  • the DCT unit 22 divides the difference image input from the difference unit 20 into blocks, which are areas of 8 ⁇ 8 pixels, performs DCT transform for each block, and generates DCT coefficients.
  • the DCT coefficients are output to the scan unit 24.
  • the DCT unit 22 corresponds to a transform coefficient generation unit of the present invention.
  • the scan unit 24 scans the DCT coefficients input from the DCT unit 22 in a predetermined order to generate scanned DCT coefficients, and outputs the generated scanned DCT coefficients to the hierarchical coding unit 26. Do.
  • the layer coding unit 26 in the first embodiment will be described.
  • the hierarchical coding unit 26 has a zero run length coding unit 30, a variable length coding unit 32, and a state prediction unit 34.
  • the zero run length coding unit 30 zero-scans the scanned DCT coefficients input from the scan unit 24 into a combination of a zero run length and a bit plane end signal for each bit plane, and performs variable-length code coding. Output to the conversion unit 32.
  • variable length coding unit 32 uses the state prediction parameter input from the state prediction unit 34 to combine the zero run length and bit plane end signal input from the zero run length coding unit 30. Variable-length code.
  • the variable-length coding unit 32 outputs the enhancement layer stream generated by the encoding to the enhancement layer output unit 28.
  • the state prediction unit 34 generates a state prediction parameter by predicting the state of code information which is a bit in the scanned DCT coefficient power input from the scan unit 24 to the hierarchical code unit 26. Then, the generated state prediction parameters are output to the variable-length coding unit 32.
  • the state prediction unit 34 corresponds to the state prediction unit of the present invention, and the variable length coding unit 32 corresponds to the bit plane coding unit.
  • FIG. 3 is a flow chart showing an example of the operation of the video encoding apparatus 10 of the first embodiment shown in FIGS. 1 and 2.
  • the flowchart shown in FIG. 3 causes a control program stored in a storage device (for example, ROM, flash memory, etc.) not shown to be executed as software by executing the program by the CPU not shown also executing the control program. It is also possible.
  • a storage device for example, ROM, flash memory, etc.
  • the image encoding device 10 performs video signal input processing (S10) o Specifically, a video signal input unit 12, one frame image from the outside of the video encoder spoon 10 original images As an input, it is outputted to the base layer code block 14 and the enhancement layer code block 18.
  • S10 video signal input processing
  • the video encoding device 10 performs base layer encoding processing (S12). Specifically, the base layer coding unit 14 codes the original image input from the video signal input unit 12 to generate a base layer stream, and the generated base layer stream is output to the base layer output unit 1. Output to 6. Base layer code block 14 units A base layer stream is decoded and a base layer decoded image is generated, and the generated base layer decoded image is an enhancement layer coding unit. Output to 18
  • a base layer decoded image is a decoded image generated by performing intermediate processing on a base layer stream if the base layer stream can not be generated by decoding the base layer stream, and an identical one can be generated. But good!
  • the video encoding device 10 performs differential processing (S14). Specifically, the difference unit 20 of the enhancement layer coding unit 18 calculates the difference between the original image input from the video signal input unit 12 and the base layer decoded image input from the base layer coding unit 14. As a result, a difference image is generated, and the generated difference image is output to the DCT unit 22.
  • the video encoding device 10 performs a DCT process (S16). Specifically, the DCT unit 22 divides the difference image input from the difference unit 20 into blocks of an area of 8 ⁇ 8 pixels, performs DCT transform for each block, and generates DCT coefficients. The DCT coefficient is output to the scan unit 24.
  • the division method of the area at the time of DCT conversion is not limited to 8 ⁇ 8 pixels.
  • the method of performing frequency conversion of video is not limited to DCT conversion, and other orthogonal conversion such as Wavelet conversion may be performed.
  • the coding efficiency is inferior, it is not necessary to perform any orthogonal transformation.
  • the video encoding device 10 performs a scan process (S18). Specifically, the scanning unit 24 scans the DCT coefficients input from the DCT unit 22 in a predetermined order to generate scanned DCT coefficients, and generates the scanned DCT coefficients in the hierarchical coding unit 26. Output.
  • the scan performed by the scan unit 24 is not limited to the order shown in FIG. 16, but may be performed in another order.
  • scanning may be performed after quantization processing to reduce the values by dividing the DCT coefficients by a predetermined number. In that case, it is necessary to perform inverse quantization processing for decoding.
  • FIG. 4 is a flowchart showing an example of the hierarchical encoding process (S20).
  • the hierarchical encoding unit 26 performs zero run length encoding processing (S30).
  • the scanned DCT coefficients input from the zero run length coding unit 30 to the hierarchical code unit 26 from the zero power scan unit 24 are divided into several zero run lengths and bit plane end signals for each bit plane.
  • the combination is zero-run length encoded and output to the variable-length encoder 32.
  • the bit plane end signal being ON indicates that "1" following the zero run signal is the last "1" of the bit plane.
  • the OF F indicates that it is not the last “1”.
  • a zero run length code may be obtained by putting together a plurality of bit planes into a scanned DCT coefficient.
  • a level signal indicating “a force that“ 0 ”follows and then what number has appeared” is required.
  • a level signal indicating “a force that“ 0 ”follows and then what number has appeared” is required.
  • 3 bit planes are grouped together and subjected to zero run length coding, there is a possibility that values “1” to “7” other than “0” may appear after the zero run.
  • a level signal indicating In the present embodiment, only the zero run length and the bit plane end signal are described for simplicity of description.
  • the hierarchical coding unit 26 performs state prediction processing (S32). Specifically, the state prediction unit
  • the state prediction parameter generated by the state prediction unit 34 is a zero run length formed by a number of bits included in the encoded information or a probability that each bit included in the code information is “1”, and a bit plane end signal is ON. The probability is
  • FIG. 5 is a diagram showing an example of the state prediction of the present invention.
  • FIG. 5 is a diagram in which the DCT coefficients of a certain block are arranged in the order of scanning on the horizontal axis and in units of bits on the vertical axis. For convenience, the distance between two points on the scan axis is called the scan distance, and the distance from the left edge of the scan axis is called the scan coordinates.
  • Region C in FIG. 5 indicates a bit group to be subjected to state prediction. The leftmost bit of bit B4 in region C is the bit currently in the code.
  • variable length coding unit 32 performs Huffman coding
  • Huffman code ⁇ a zero run length is predicted.
  • the sign information of the DCT coefficient in which the MSB appears in the upper bits for example, the bit B in FIG. 7 and B8 etc.
  • the appearance probability of “0” and “1” appears at the top with MSB appearing! /,! /, DCT coefficient bit B 4, B 5 etc. has higher value “1”
  • the probability is predicted to be 50%, for example.
  • bits B4 and B5 in FIG. 5 predict the probability that the value is "1" lower than 50%.
  • bit B4 the probability that the bit without MSB appearing is “1” is 50%, and the probability that the bit with MSB appearing is “1” is 25%, probability that “0” follows bit B5 0%
  • the number of consecutive “0s” is considered as the prediction of zero run length when it falls below.
  • the leftmost bit B4 of the region C is MSB at the upper end and is therefore ⁇ , so the probability of being “0” is 75%, and the probability of the bit B5 to its right is also “0” is 75%.
  • the next bit B6 also appears MSB at the upper level! /,!
  • the state prediction unit 34 may predict the zero run length to be shorter as the bit position becomes lower.
  • the zero run length may be set shorter as the number of MSBs included in the bit plane higher than the bit plane to which the zero run length belongs increases.
  • the high frequency coefficient of the DCT coefficient has many small absolute values, and the MSB does not appear.
  • the probability that the sign information of the DCT coefficient is “1” is low. Therefore, the larger the scan coordinate of the zero run start bit, the smaller the probability that the bit “1” will appear, and therefore the state prediction unit 34 may extend the zero run length to be predicted.
  • the state prediction unit 34 predicts the probability of the bit plane end signal being ON to be higher as the bit behind the scan axis.
  • the state prediction unit 34 reduces the probability of ON if the scan coordinate is smaller than the MSB located at the end of the scan axis in the upper bit plane with respect to the code information. Measure and predict the probability of ON high if it is large. For example, in FIG. 5, if the scan coordinate is smaller than bit B1, the ON probability is predicted higher if it is lower.
  • the bit plane end signal may not be predicted for the bit plane to which the code information belongs. For example, when the bit plane end signal of the upper bit plane is at the 62nd DCT coefficient in the 8 ⁇ 8 DCT conversion, the bit plane with a smaller number of “0” can be omitted by the bit plane end signal in lower bit planes. It is possible to improve the coding efficiency by not coding the end signal.
  • the state prediction unit 34 When predicting the state of code information, the state prediction unit 34 according to the present embodiment performs prediction using only the upper bit plane in the block to which the coding information belongs. As a result, in the case where the user who decodes the video stream decodes only the specific block, it is sufficient to decode the enhancement layer of the specific block only, so the whole is decoded. The processing load of decoding can be further reduced. In addition, it is possible to save the bit rate by transmitting only a part of the video stream over the network.
  • the hierarchical coding unit 26 performs variable-length coding processing (S34). Specifically, the variable length code input unit 32 combines the zero run length and bit plane end signal combination input from the zero run length code input unit 30 with the state prediction parameter input from the state prediction unit 34. Use variable-length coding.
  • the layer coding unit 26 outputs the enhancement layer stream generated by the encoding to the enhancement layer output unit 28.
  • variable-length coding unit 32 performs the Huffman coding.
  • the video encoding device 10 has a plurality of Huffman tables in advance.
  • the coding table is improved by assigning short codes to frequently appearing combinations and long codes to high combinations.
  • variable-length coding unit 32 uses a Huffman tape in which a short code is assigned to a combination having a length and a zero run length. Select and Huffman encode. Conversely, when coding a combination predicted to have a short zero run length, the Huffman table is selected by selecting a short and a Huffman table to which a short code is assigned to a combination having a zero run length. [0163] When coding a combination predicted that the bit plane end signal is likely to be OFF, the variable-length coding unit 32 sets a short code to the combination in which the bit plane end signal power is OFF. Select the assigned Nomfman table and perform Huffman coding.
  • the video encoding device 10 performs stream output processing (S22). Specifically, the base layer output unit 16 outputs the base layer stream input from the base layer code input unit 14 to the outside of the video code input unit 10.
  • the enhancement layer output unit 28 outputs the enhancement layer stream input from the enhancement layer coding unit 18 to the outside of the video coding unit 10.
  • the video encoding device 10 performs an end determination process (S24). Specifically, the video signal input unit 12 determines the presence or absence of the video input from the outside of the video encoding device 10. As a result of the determination, if there is no video input, the processing is ended, and if there is a video input, the processing returns to the video input processing (S10).
  • the video coding apparatus according to the first embodiment of the present invention has been described above.
  • the video encoding device 10 uses the state of the MSB of the higher bit plane of the enhancement layer for the encoding information of the enhancement layer to set the zero run length or “1 By predicting the probability that the bit plane end signal is ON, and replacing the Huffman table used for the Huffman code based on the prediction, the code of the variable-length code It is possible to improve the chewing efficiency.
  • the video encoding device 10 predicts the state of code information from the upper bit plane belonging to the same block, independence between blocks is maintained. It is possible to code only the block of interest of the user performing decoding. This can reduce the coding processing load. It is also possible to save bitrate by transmitting only the video stream that corresponds to the block of interest.
  • variable length coding unit 32 may perform arithmetic coding. State prediction in the case of arithmetic coding will be described.
  • the state prediction unit 34 predicts that the probability that the low-order bit of the DCT coefficient in which the MSB appears at the high order is "1" is 50%.
  • the other coding information has a higher probability of being "1” as the lower bit-planes. If the number of MSBs included in the bit plane higher than the bit plane to which the encoding information belongs is large, the probability of being “1” may be increased. The larger the scan coordinates of the zero run start bit, the smaller the probability that a "1" bit will appear may be.
  • the state prediction unit 34 predicts the probability that the bit plane end signal is ON for each bit. As in the case of the Huffman code, if the scan coordinate is smaller than the MSB located at the end of the scan axis in the upper bit plane, the probability of ON is predicted low, and if it is large, the probability of ON is predicted high. For example, in FIG. 5, if the scan coordinate is smaller than bit B1, the ON probability is predicted high if it is too low.
  • the variable-length coding unit 32 performs variable-length coding using the state prediction parameters generated by the state prediction unit 34.
  • the state prediction parameter used here the probability that the code information is “1” and the probability that the bit plane end signal is ON are used as the appearance probability of the symbol necessary for the arithmetic coding. It is arithmetically encoded with two kinds of symbols whether the encoding information is “0” or “1”, and the bit plane end signal of the encoding information “1” is ON. OFF is OFF If you use two different symbols, you need to use another arithmetic code. Whether the encoding information is “0” or whether the bit plane end signal is “1” of OFF or the bit plane end signal is “1” of ON can be arithmetically coded with three kinds of symbols. ,.
  • a state prediction parameter representing the state of the decoded information is predicted from the decoded information decoded earlier with respect to the decoded information which is information being decoded, and the state prediction is performed.
  • a video decoding apparatus that performs decoding based on parameters will be described.
  • a video decoding apparatus for decoding a video stream generated by the video coding apparatus of the first embodiment will be described.
  • FIG. 6 is a block diagram showing the configuration of a video decoding / decoding device 40 according to the second embodiment
  • FIG. 7 is a diagram showing the configuration of the hierarchical decoding unit 50.
  • the video decoding device 40 has a base layer input unit 42, a base layer decoding unit 44, an enhancement layer input unit 46, an enhancement layer decoding unit 48, and a video signal output unit 58.
  • the enhancement layer decoding unit 48 includes a hierarchical decoding unit 50, an inverse scan unit 52, an inverse DCT unit 54, and an addition unit 56.
  • Base layer input section 42 also inputs the base layer stream as an external force of video decoding / decoding apparatus 40, and outputs the same to base layer decoding section 44.
  • the base layer input unit 42 determines the presence or absence of the input of the base layer stream from the outside, and ends the processing if there is no input of the base layer stream.
  • Base layer decoding section 44 decodes the base layer stream input from base layer input section 42 to generate a base layer decoded image, and the generated base layer decoded image is subjected to enhancement layer decoding. It is output to the head unit 48 and the video signal output unit 58.
  • the enhancement layer input unit 46 also receives an enhancement layer stream as an external force of the video decoding / decoding device 40 and outputs the enhancement layer stream to the enhancement layer decoding unit 48.
  • the video signal output unit 58 includes the base layer decoded image input from the base layer decoding unit 44 and the enhancement layer decoded image input from the enhancement layer decoding unit 48 in the video decoding apparatus 40. Output to the outside.
  • reverse scanning unit 52 performs reverse scanning to rearrange scanned DCT coefficients input from hierarchical decoding unit 50 into defined river page numbers to generate DCT coefficients. And outputs the generated DCT coefficients to the inverse DCT unit 54.
  • the inverse DCT unit 54 generates a differentially decoded image by performing inverse DCT on the block basis and applying DCT coefficients input from the inverse scan unit 52 to the block.
  • the inverse DCT unit 54 outputs the generated differentially decoded image to the addition unit 56.
  • the addition unit 56 adds the inverse of the base layer decoded image input from the base layer decoding unit 44 and the inverse D
  • the differential decoded image input from the CT unit 54 is added to generate an enhancement layer decoded image.
  • the addition unit 56 outputs the generated enhancement layer decoded ⁇ image to the video signal output unit 58.
  • FIG. 7 is a block diagram showing a configuration of hierarchical decoding unit 50 in the second embodiment.
  • the hierarchical decoding unit 50 includes a variable-length decoding unit 60, a zero run length decoding unit 62, and a state prediction unit 64.
  • variable-length decoding unit 60 performs variable-length decoding on the enhancement layer stream input from the enhancement layer input unit 46 using the state prediction parameters input from the state prediction unit 64 to obtain a zero run length. Generate a combination of and the bit plane end signal. The variable length decoding unit 60 outputs the combination of the generated zero run length and bit plane end signal to the zero run length decoding unit 62.
  • the zero run length decoding unit 62 decodes the combination of the zero run length and the bit plane end signal input from the variable length decoding unit 60 to generate a scanned DCT coefficient.
  • the zero run length decoding unit 60 outputs the generated scanned DCT coefficients to the reverse scan unit 52.
  • the state prediction unit 64 uses the scanned DCT coefficients input from the zero run length decoding unit 62, and the zero run length or the decoded information during the variable length decoding unit 60 is decoded.
  • the state prediction parameter is generated by predicting the state of the bit plane end signal and the probability that “1” is “1.
  • the state prediction unit 60 outputs the generated state prediction parameter to the variable-length decoding unit 60.
  • FIG. 8 is a flowchart showing an example of the operation of the video decoding device 40 according to the second embodiment. Note that the flowchart shown in FIG. 8 is executed as software by executing a control program stored in a storage device (for example, ROM, flash memory, etc.) (not shown) by the CPU (not shown). It is also possible to execute a control program stored in a storage device (for example, ROM, flash memory, etc.) (not shown) by the CPU (not shown). It is also possible to
  • the video decoding apparatus 40 performs stream input processing (S40). Specifically, the base layer input unit 42 also inputs the base layer stream as an external force of the video decoding apparatus 40 and outputs the base layer stream to the base layer decoding unit 44.
  • the enhancement layer input unit 46 receives the enhancement layer stream from the outside of the video decoding device 40 and outputs the stream to the enhancement layer decoding unit 48.
  • the video decoding apparatus 40 performs base layer decoding processing (S42). Specifically, the base layer decoding unit 44 decodes the base layer stream input from the base layer input unit 42 to generate a base layer decoded image. The base layer decoding unit 44 outputs the generated base layer decoded image to the enhancement layer decoding unit 48 and the video signal output unit 58. Do.
  • the video decoding apparatus 40 performs hierarchical coding processing (S44). Specifically, the enhancement layer decoding unit 48 decodes the enhancement layer stream input from the enhancement layer input unit 46 to generate an enhancement layer decoded image. The enhancement layer decoding unit 48 outputs the generated enhancement layer decoded image to the video signal output unit 58.
  • FIG. 9 is a flowchart illustrating an example of the hierarchical decoding process (S44).
  • the hierarchy decoding unit 50 performs state prediction processing (S60). Specifically, using the scanned DCT coefficients input from the zero run length decoding unit 62, the state prediction unit 64 uses the scanned DCT coefficients to perform zero run length during decoding or decoding performed by the variable length decoding unit 60.
  • the state prediction parameter is generated by predicting the probability that the information is “1” and the state of the bit plane end signal, and the generated state prediction parameter is output to the variable length decoding unit 60.
  • the prediction performed by the state prediction unit 64 is the same as that performed in the first embodiment, and the state prediction parameter generated by the state prediction unit 64 is a zero run length formed by consecutive bits including decoding information.
  • the state prediction unit 64 needs to always input the latest scanned DCT coefficient from the zero run length decoding unit 62 in order to predict the state of the decoded information.
  • the hierarchical decoding unit 50 performs variable-length decoding processing (S62). Specifically, the variable-length decoding unit 60 performs variable-length decoding on the enhancement layer stream input from the enhancement layer input unit 46 using the state prediction parameters input from the state prediction unit 64. Generate a combination of zero run length and bit plane end signal. The variable length decoding unit 60 outputs the combination of the generated zero run length and bit plane end signal to the zero run length decoding unit 62.
  • the method of selecting a Huffman table used for variable length decoding based on the state prediction parameter is the same as that of the first embodiment.
  • layer coding section 50 performs zero run length decoding processing (S64) 0 Specifically, zero run length decoding section 62 receives an input from variable length decoding section 60. The combination of the zero run length and the bit plane end signal is decoded to generate scanned DCT coefficients. The zero run length decoding unit 62 outputs the generated scanned DCT coefficients to the reverse scan unit 52.
  • the hierarchical decoding process (S44) has been described above.
  • the video decoding device 40 performs reverse scan processing (S46). Specifically, the reverse scan unit 52 performs reverse scan to rearrange the scanned DCT coefficients input from the hierarchical decoding unit 52 in a predetermined order to generate DCT coefficients, and generates the generated DCT coefficients. Output to the inverse DCT unit 54.
  • the video decoding / decoding device 40 performs inverse DCT processing (S48). Specifically, the DCT coefficient input from the inverse DCT unit 54 1S inverse scan unit 52 is subjected to inverse DCT for each block, and the inverse DCT converted blocks are synthesized to generate a differentially decoded image. The inverse DCT unit 54 outputs the generated differentially decoded image to the addition unit 56.
  • the video decoding apparatus 40 performs addition processing (S50). Specifically, the addition unit 56 adds the base layer decoded image input to the enhancement layer decoding unit 48 by the base layer decoding unit 44 and the differentially decoded image input from the inverse DCT unit 54. To generate an enhancement layer decoded image. The addition unit 56 outputs the generated enhancement layer decoded image to the video signal output unit 58.
  • the video decoding / decoding device 40 performs video signal output processing (S 52). Specifically, the video signal output unit 58 decodes the base layer decoded image input from the base layer decoding unit 44 and the enhancement layer decoded image input from the enhancement layer decoding unit 48. Output to the outside of the device 40. The video signal output unit 58 may output only one of the base layer decoded image and the enhancement layer decoded image or the outer layer image to the outside.
  • the video decoding apparatus 40 performs an end determination process (S 54) 0 Specifically, the basic layer input unit 42 externally determines the presence / absence of the input of the base layer stream. As a result, if there is no input of the base layer stream, the processing ends, and if not, it returns to the stream input processing (S40).
  • the video decoding apparatus 40 according to the second embodiment of the present invention has been described above.
  • the video decoding apparatus 40 uses the state of the MSB of the bit plane higher than the bit in the decoding table to set the zero run length of the coded bit and the value “0”.
  • the code amount for decoding the decoded information is calculated by predicting the probability of being “1” or the probability that the bit plane end signal is ON, replacing the Huffman table used for the Huffman decoding based on the prediction. It is possible to reduce the number and to improve the image quality of the image.
  • since the video decoding apparatus 40 predicts the state of the decoded information from the upper bit plane belonging to the same block, independence between blocks is maintained. It is possible to decode only the block of interest of the user performing decoding. This makes it possible to lighten the processing load of decryption. It is also possible to save the bit rate by transmitting only the video stream that corresponds to the block of interest.
  • variable length decoding is performed.
  • a section 60 performs arithmetic decoding.
  • arithmetic decoding is performed by changing the appearance probability of symbols used for arithmetic decoding based on the predicted bit state.
  • a video coding system which predicts the state of the enhancement layer using the information of the base layer encoded earlier and performs coding. Based on the information previously encoded and stored in the video stream of the base layer V, a state prediction parameter representing the state of the enhancement layer is generated, and based on the generated state prediction parameter, the code information is Sign it.
  • FIG. 10 is a block diagram showing the configuration of a video coding apparatus 70 according to the third embodiment of the present invention.
  • the video coding device 70 has a video signal input unit 72, a basic layer coding unit 74, a base layer output unit 76, an enhancement layer coding unit 78, and an enhancement layer output unit 90.
  • the enhancement layer coding unit 78 includes a difference unit 80, a DCT unit 82, a scan unit 84, a layer coding unit 86, and a state prediction unit 88.
  • the configuration of the video encoding device 70 according to the third embodiment is basically the same as the video encoding device 10 according to the first embodiment, but in the third embodiment, the hierarchical encoding is performed.
  • the functions of the unit 86 and the state prediction unit 88 are different from those of the first embodiment.
  • the video signal input unit 72 receives an image as an original image frame by frame from the outside of the video code processing apparatus 70 and outputs the video to the base layer code processing unit 74 and the enhancement layer coding unit 78. It is determined whether there is a video input from the outside of the video encoding device 70, and if there is no video input, the processing is terminated.
  • Base layer coding section 74 encodes the original image input from video signal input section 72. The base layer stream is generated, and the generated base layer stream is output to the base layer output unit 76. Further, base layer coding section 74 decodes the base layer stream to generate a base layer decoded image, and outputs the generated base layer decoded image to enhancement layer coding section 78.
  • the base layer output unit 76 outputs the base layer stream input from the base layer coding unit 74 to the outside of the video coding unit 70.
  • the enhancement layer output unit 90 outputs the enhancement layer stream input from the enhancement layer coding unit 78 to the outside of the video coding unit 70.
  • the difference unit 80 generates a difference image by calculating the difference between the original image input from the video signal input unit 72 and the base layer decoded image input from the base layer coding unit 74.
  • the difference image is output to the DCT unit 82.
  • the DCT unit 82 divides the differential image input from the differential unit 80 into blocks which are areas of 8 ⁇ 8 pixels, performs DCT transform for each block, and generates DCT coefficients.
  • the CT coefficient is output to the scan unit 84.
  • the scan unit 84 scans the DCT coefficients input from the DCT unit 82 in a predetermined order to generate scanned DCT coefficients, and generates the scanned DCT coefficients in the hierarchical coding unit.
  • State prediction unit 90 generates a state prediction parameter based on the base layer decoded image input from base layer coding unit 74, and generates the state prediction parameter as a hierarchical code. Output to section 86.
  • FIG. 11 is a flow chart showing an example of the operation of the video coding apparatus 70 of the third embodiment.
  • the flowchart shown in FIG. 3 is executed as software by executing a control program stored in a storage device (for example, a ROM, a flash memory, etc.) (not shown) and executed by the CPU (not shown). It is also possible to
  • the video encoding device 70 performs video signal input processing (S70). Specifically, the video signal input unit 72 converts the video into an original image frame by frame from the outside of the video coding device 70. Input to the base layer code section 74 and the enhancement layer code section 78.
  • the video encoding device 70 performs base layer encoding processing (S72). Specifically, the base layer coding unit 74 encodes the original image input from the video signal input unit 72 to generate a base layer stream, and the generated base layer stream is output to the base layer output unit 76. Output. Base layer coding section 74 decodes the base layer stream to generate a base layer decoded image, and outputs the generated base layer decoded image to enhancement layer coding section 78. Similar to the first embodiment, the base layer coding method uses the existing method such as MPEG-4 AVC.
  • the video encoding device 70 performs differential processing (S 74). Specifically, the difference unit 80 obtains a difference between the original image input from the video signal input unit 72 and the base layer decoded image input from the base layer coding unit 74 to generate a difference image. The difference unit 80 outputs the generated difference image to the DCT unit 82.
  • the video encoding device 70 performs state prediction processing (S76). Specifically, the state prediction unit 88 generates a state prediction parameter based on the base layer decoded image input from the base layer coding unit 74, and the generated state prediction parameter is transmitted to the layer coding unit 86. Output to
  • the base layer decoded image is divided into blocks according to the processing block of the DCT unit 82, the characteristics are checked for each block, and the state of the DCT coefficient of the corresponding block of the enhancement layer Predict.
  • the characteristics of a block are the amount of code when a block is coded as a base layer stream, and the amount of edges included in the base layer decoded image of that block. When the number of edges included in the block is large, the code amount is also large.
  • the amount of edge is calculated by using a known Roberts filter, Sobel filter, DCT transform, and using the absolute sum of DCT coefficients.
  • the state of the block's enhancement layer is the amount of edges that the corresponding block of the difference image contains.
  • the state prediction unit 88 predicts that the DCT coefficient of the enhancement layer contains a large value at a high frequency as the certain block of the base layer decoded image contains more edges.
  • the bias to the high frequency coefficient of the DCT coefficient of the enhancement layer is used as the state prediction parameter.
  • the video code processing apparatus 70 performs a DCT process (S 78). Specifically, the DCT unit 82 divides the difference image input from the difference unit 80 into blocks which are regions of 8 ⁇ 8 pixels, performs DCT transform for each block, and generates DCT coefficients.
  • the DCT coefficient is output to the scan unit 84.
  • the video code processing apparatus 70 performs a scan process (S80). Specifically, the scanning unit 84 scans the DCT coefficients input from the four-force DCT unit 82 in a predetermined order to generate scanned DCT coefficients, and the generated scanned DCT coefficients are stored in the layer coding unit 86. Output to
  • the video encoding device 70 performs hierarchical coding processing (S 82). Specifically, the hierarchical encoding unit 86 encodes the scanned DCT coefficients input from the scanning unit 84 using the state prediction parameters input from the state prediction unit 88, and generates the encoded coefficients. The enhancement layer stream thus output is output to the enhancement layer output unit 90.
  • the scanned DCT coefficients for each block are subjected to zero run length coding for each bit plane to generate a combination of a zero run length and a bit plane end signal, Variable length coding is performed using state prediction parameters.
  • Hierarchical code unit 86 performs Huffman coding as variable-length coding.
  • state prediction unit 88 predicts that the deviation to the high frequency of the DCT coefficient is large, the bit “1” in which the bit plane end signal is ON appears behind the scan, compared to when the deviation is predicted to be small Probability is high. Therefore, when the hierarchical coding unit 86 codes the combination belonging to the block predicted to have a large deviation, the larger the scan coordinate of the bit that starts the zero run, the larger the bit plane end signal turns ON. Select the Noh-Fuman table where short codes have been assigned to the combinations.
  • the hierarchical code input unit 86 has a small scan coordinate of the bit that starts the zero run, and the bit plane end signal is Select a short table with codes assigned to short for combinations that are ON.
  • the hierarchical coding unit 86 When it is predicted that the high-frequency bias of the DCT coefficient is large, the hierarchical coding unit 86 performs arithmetic coding with a high probability of occurrence that the code information with large scan coordinates is "1". I do.
  • the video encoding device 70 performs stream output processing (S 84). Specifically, the base layer output unit 76 outputs the base layer stream input from the base layer coding unit 74 to the outside of the video coding unit 70.
  • the enhancement layer output unit 90 outputs the enhancement layer stream input from the enhancement layer coding unit 78 to the outside of the video coding device 70.
  • the video encoding device 70 performs an end determination process (S86). Specifically, the video signal input unit 72 determines the presence or absence of a video input from the outside of the video encoding device 70. As a result of the determination, if there is no video input, the process ends, and if there is a video input, the process returns to the video signal input process (S70).
  • the video encoding device 70 according to the third embodiment has been described above.
  • the video encoding device 70 has a probability of being a zero run length or “1” from the state of the edge of the base layer with respect to the code layer information of the enhancement layer. It is possible to predict the probability that the bit plane end signal is ON, and replace the Huffman table used for the Huffman code based on the prediction to improve the code efficiency of the variable length code. is there.
  • the state prediction unit 88 performs state prediction as follows. When it is predicted that the high-frequency bias of the DCT coefficient is large, the hierarchical coding unit 86 increases the probability that the bit plane end signal is ON as the scanning coordinate of the bit increases, thereby increasing the arithmetic code. Perform. Conversely, when the state prediction unit 88 predicts that the deviation of the DCT coefficients toward high frequencies is small, the probability that the bit plane end signal power is ON is made high even when the scan coordinates of the bit are small. Perform encoding.
  • the hierarchical coding scheme is based on the base layer.
  • the motion prediction / compensation code MPEG used for MPEG code ⁇ etc. divides the original image into several areas, searches the reference area from the previous and subsequent decoded images for each area to be encoded, and encodes the reference area and the encoding area.
  • the variable with the area to be It is also possible to predict the state of the difference image with the subsequent frame based on the reference frame in this motion prediction / compensation code system.
  • prediction is performed from a reference frame by using a reference frame instead of the base layer in the third embodiment and using a difference image instead of the enhancement layer.
  • the differential image can be efficiently encoded based on the state.
  • a state prediction parameter representing the state of the enhancement layer is predicted based on decoded information which is information of the base layer decoded earlier, and decoding is performed based on the state prediction parameter.
  • a video decoding apparatus that performs a communication will be described.
  • a video decoding apparatus for decoding a video stream generated by the video coding apparatus 70 according to the third embodiment will be described.
  • FIG. 12 is a block diagram showing a configuration of a video decoding / reproducing apparatus 100 according to a fourth embodiment of the present invention.
  • the video decoding apparatus 100 includes a base layer input unit 102, a base layer decoding unit 104, an enhancement layer input unit 106, an enhancement layer decoding unit 108, and a video signal output unit 120.
  • the enhancement layer decoding unit 108 includes a hierarchical decoding unit 110, a reverse scan unit 112, an inverse DCT unit 114, an addition unit 116, and a state prediction unit 118.
  • the basic configuration of the video decoding apparatus 100 according to the fourth embodiment is the same as that of the video decoding apparatus according to the second embodiment, but the fourth embodiment is a hierarchical decoding.
  • the functions of the unit 110 and the state prediction unit 118 are different.
  • Base layer input section 102 also receives the base layer stream as an external force of video decoding / decoding apparatus 100, and outputs the base layer stream to base layer decoding section 104.
  • the base layer input unit 102 determines the presence or absence of input of the base layer stream from the outside, and ends the processing if there is no input of the base layer stream.
  • Base layer decoding section 104 receives a base layer sequence input from base layer input section 102. The stream is decoded to generate a base layer decoded image, and the generated base layer decoded image is output to the enhancement layer decoding unit 108 and the video signal output unit 120.
  • Enhancement layer input section 106 also receives an enhancement layer stream as an external force of video decoding / decoding apparatus 100 and outputs the enhancement layer stream to enhancement layer decoding section 108.
  • Video signal output section 120 outputs the base layer decoded image input from base layer decoding section 104 and the enhancement layer decoded image input from enhancement layer decoding section 108 to video decoding apparatus 100. Output to the outside of
  • Hierarchical decoding section 110 decodes the enhancement layer stream input from enhancement layer input section 106 using the state prediction parameters input from state prediction section 118 to generate scanned DCT coefficients.
  • the generated scanned DCT coefficients are output to the inverse scan unit 112.
  • the reverse scan unit 112 performs reverse scan to rearrange the scanned DCT coefficients input from the hierarchical decoding unit 112 in a specified order to generate DCT coefficients, and reverses the generated DCT coefficients. Output to the DCT unit 114.
  • the inverse DCT unit 114 inverts the DCT coefficients input from the inverse scan unit 112 for each region.
  • T is applied to synthesize a region to generate a differentially decoded image, and the generated differentially decoded image is output to the addition unit 116.
  • the addition unit 116 adds the base layer decoded image input from the base layer decoding unit 104 and the differentially decoded image input from the inverse DCT unit 114 to generate an enhancement layer decoded image,
  • the generated enhancement layer decoded image is output to the video signal output unit 120.
  • State prediction section 118 generates state prediction parameters based on the base layer decoded image input from base layer decoding section 104 to enhancement layer decoding section 108, and is generated. The state prediction parameters are output to hierarchical decoding section 116.
  • FIG. 13 is a flowchart showing an example of the operation of the video decoding / decoding device 100 of the fourth embodiment shown in FIG.
  • the flowchart shown in FIG. 13 is executed by executing a control program stored in a storage device (not shown) (eg, ROM, flash memory, etc.) by executing the program by executing a program (not shown).
  • a storage device eg, ROM, flash memory, etc.
  • the video decoding / reading apparatus 100 performs stream input processing (S90). Specifically, the base layer input unit 102 inputs the base layer stream also from the external power of the video decoding apparatus 100 and outputs the base layer stream to the base layer decoding unit 104.
  • the enhancement layer input unit 106 also receives the enhancement layer stream from the external power of the video decoding apparatus 100 and outputs the enhancement layer stream to the enhancement layer decoding unit 108.
  • the video decoding apparatus 100 performs base layer decoding processing (S92). Specifically, base layer decoding section 104 decodes the base layer stream input from base layer input section 102 to generate a base layer decoded image, and the generated base layer decoded image is generated. It is output to the enhancement layer decoding unit 108 and the video signal output unit 120.
  • the video decoding apparatus 100 performs state prediction processing (S94). Specifically, the state prediction unit 118 also generates state prediction parameters for the base layer decoding / image power input from the base layer decoding unit 104, and outputs the generated state prediction parameters to the hierarchy decoding unit 116. To force. In the prediction of the state prediction unit 118, the base layer decoded image is divided according to the region to be synthesized by the inverse DCT unit 112, the characteristics are checked for each divided region, and the DCT coefficients of the corresponding extension layer region are Predict the state. The prediction performed by the state prediction unit 118 is the same as the prediction performed in the third embodiment.
  • the video decoding apparatus 100 performs hierarchical decoding processing (S96). Specifically, the hierarchical decoding unit 110 decodes the enhancement layer stream input from the enhancement layer input unit 106 using the state prediction parameters input from the state prediction unit 118, and the scanned DCT coefficients are decoded. , And outputs the generated scanned DCT coefficients to the inverse scan unit 112.
  • variable length decoding is performed using state prediction parameters, and a combination of zero run length and bit plane end signal for each bit plane of each area is generated, Generate scanned DCT coefficients by zero run length decoding.
  • the hierarchy decoding unit 110 selects a Huffman table based on the state prediction parameter, and performs decoding using the selected Huffman table.
  • reverse scan unit 112 determines the order in which the scanned DCT coefficients input from hierarchical decoding unit 112 are determined. Inverse scan is performed to rearrange numbers to generate DCT coefficients, and the generated DCT coefficients are output to the inverse DCT unit 114.
  • the video decoding / writing apparatus 100 performs inverse DCT processing (S 100). Specifically, the inverse DCT unit 114 performs inverse DCT on the DCT coefficients input from the inverse scan unit 112 for each area, synthesizes the areas, and generates a differentially decoded image, and the generated differential decoding The image is output to the addition unit 116.
  • the video decoding apparatus 100 performs an addition process (S112). Specifically, the addition unit 116 adds the base layer decoded image input to the enhancement layer decoding unit 108 by the base layer decoding unit 104 and the differentially decoded image input from the inverse DCT unit 114. To generate an enhancement layer decoded image. The addition unit 116 outputs the generated enhancement layer decoded image to the video signal output unit 120.
  • the video decoding / reproducing apparatus 100 performs video signal output processing (S114). Specifically, the video signal output unit 120 outputs the base layer decoded image input from the base layer decoding unit 104 and the enhancement layer decoded image input from the enhancement layer decoding unit 108 as video. Output to the outside of the decryption device 100.
  • the video signal output unit 120 may output only one of the base layer decoded image and the enhancement layer decoded ⁇ image to the outside.
  • the video decoding apparatus 100 performs an end determination process (S116) 0 Specifically, the basic layer input unit 102 determines whether there is an input of the external layer or the input of the basic layer stream. If there is no input of the basic layer stream, the processing ends, and if there is an input of the basic layer stream, the processing returns to the stream input processing (S90).
  • S116 end determination process
  • the basic layer input unit 102 determines whether there is an input of the external layer or the input of the basic layer stream. If there is no input of the basic layer stream, the processing ends, and if there is an input of the basic layer stream, the processing returns to the stream input processing (S90).
  • the video decoding apparatus 100 predicts the zero run length and the probability of being “1” from the state of the edge of the base layer with respect to the decoding information of the enhancement layer. Also, by predicting the probability that the bit plane end signal is ON, the Huffman table used for the Huffman decoding is replaced based on the prediction, and the code amount for decoding the decoding information is reduced. It is possible to improve the quality of the image.
  • the hierarchical decoding unit 110 may perform arithmetic decoding.
  • the state of the predicted enhancement layer Decoding is performed by changing the appearance probability of symbols used for arithmetic decoding according to the state.
  • the state of the enhancement layer is predicted from the base layer and decoding of the enhancement layer is performed based on the predicted state.
  • motion prediction / compensation decoding When performing chewing, the state may be predicted based on the reference image.
  • a reference frame is used instead of the base layer in the fourth embodiment, and a difference image is used instead of the enhancement layer to obtain a reference frame. The difference image can be decoded based on the predicted state.
  • the states of bits to be encoded later based on information on previously encoded transform coefficients, that is, “1” and “0” in bits to be encoded later.
  • the present invention is particularly useful for a video coding scheme of a system that transmits and receives video while dynamically changing the amount of video stream via a network with a variable communication speed.

Abstract

A video encoder (10) comprises a DCT section for creating a transform coefficient representing a frequency component by frequency-transforming a video, a variable length encoding section (32) for transforming the transform coefficient into a binary number, generating bit planes composed of bits at the same place of transform coefficients from the most significant bit to the least significant bit and encoding the bit planes sequentially from the high-order bit plane, and a state predicting section (34) for predicting the state of a bit of the transform coefficient to be encoded next on the basis of information the previously encoded transform coefficient. The variable length encoding section (32) performs encoding according to the predicted bit state. Thus, high-efficiency encoding can be performed according to the bit state of the transform coefficient to be encoded.

Description

明 細 書  Specification
映像符号化装置、映像復号化装置、映像符号化方法及び映像復号化方 法  Video coding apparatus, video decoding apparatus, video coding method and video decoding method
技術分野  Technical field
[0001] 本発明は、映像を符号化し映像ストリームを生成する映像符号化装置および方法と [0001] The present invention relates to a video coding apparatus and method for coding video and generating a video stream, and
、映像ストリームを復号ィ匕して復号ィ匕映像を生成する映像復号ィ匕装置および方法に 関する。 The present invention relates to a video decoding apparatus and method for decoding a video stream to generate a decoded video.
背景技術  Background art
[0002] 映像は、もはや我々の生活とは切り離せない関係にあり、インターネットや携帯電話 網、放送波、蓄積メディアなどの伝送部を通じ、パソコンや携帯端末、テレビ、ハイビ ジョンテレビなどの多様な表示端末にぉ 、て、視覚的な情報を享受させてくれる重要 な存在となった。  [0002] Images are no longer separated from our lives, and various displays such as personal computers, portable terminals, televisions, high-vision televisions, etc. are transmitted through transmission units such as the Internet, mobile phone networks, broadcast waves, and storage media. It has become an important presence that allows the terminal to enjoy visual information.
[0003] 伝送部を通じて伝送される映像は、映像符号ィ匕技術を用いてより少ないデータ量 を持つ映像ストリームに圧縮され、情報が効率良く伝達される。最近では、映像情報 を全てダウンロードし終えて力 再生するのではなぐ受信した映像符号ィ匕データを 順次再生する映像ストリーム伝送が盛んになってきている。 ISO/IEC 14496- 1 0に記載の MPEG— 4 AVC方式のような映像符号ィ匕技術では、一度符号化すると 復号ィ匕に用いられる符号量は一意に決定され、再生する映像の品質を変えることは できない。 1つの映像ストリームを通信帯域の異なる 2者に提供する場合には、映像 を 2回符号化しそれぞれの帯域にあわせて伝送するか、もしくは、狭い方の通信帯域 にあわせて映像の画質や解像度、フレームレートを下げて映像符号ィ匕して 、た。  [0003] Video transmitted through the transmission unit is compressed into a video stream having a smaller amount of data using video coding technology, and information is efficiently transmitted. Recently, video stream transmission has become increasingly popular, in which video code data that has been received is sequentially played back, rather than being completely downloaded after all video information has been downloaded. In the video coding technology such as the MPEG-4 AVC system described in ISO / IEC 14496-10, once coding, the code amount used for decoding is uniquely determined and changes the quality of the video to be reproduced. It is not possible. When one video stream is provided to two different communication bands, the video is encoded twice and transmitted according to each band, or the image quality and resolution of the video according to the narrow communication band, Reduce the frame rate and set the video code.
[0004] 幾つかの階層からなるデータ構造を持ち、符号ィ匕後も必要に応じてストリームの伝 送量を変更することのできるスケーラブル映像符号ィ匕方式が幾つか考案規格化され た。映像階層符号ィ匕方式では、画質、解像度、フレームレートなどを映像が符号化さ れた後に選択することができる。  [0004] Several scalable video coding schemes have been devised and standardized, which have a data structure consisting of several layers and can change the amount of stream transmission as needed even after coding. In the video hierarchical coding system, the image quality, resolution, frame rate, etc. can be selected after the video is coded.
[0005] 高度化されたカメラ技術により映像が高精細になるにしたがって、映像の有する情 報量は増し、ユーザ毎に伝送量を調整する必要性が高まり、符号ィ匕効率の良いスケ ーラブル映像符号ィ匕方式が求められて ヽる。 [0005] As video becomes higher definition due to advanced camera technology, the amount of information possessed by the video increases, and the need to adjust the amount of transmission for each user increases, resulting in a code-efficient schedule. A rubble video coding system is required and described.
[0006] MPEG -4 FGS (Fine Granularity Scalable coding)は ISO/IEC 1449 6- 2 Amendment 2に規定されているスケーラブル映像符号化方式の 1つで、と りわけストリームの画質を微細に選択することが可能な符号ィ匕方法として規格化され ている。  [0006] MPEG-4 FGS (Fine Granularity Scalable coding) is one of the scalable video coding methods defined in ISO / IEC 1449 6-2 Amendment 2, and in particular, it is necessary to finely select the picture quality of the stream. Is standardized as a possible coding method.
[0007] MPEG -4 FGSで符号化された映像ストリームは、基本レイヤストリームと拡張レ ィャストリームによって構成される。基本レイヤストリームは単体で復号ィ匕が可能な低 帯域で低画質の映像ストリームであり、拡張レイヤストリームは基本レイヤストリームの 画質を向上させるための映像ストリームである。 MPEG -4 FGSは、階層的に符号 化されたレイヤ構造と、拡張レイヤストリームに用いられるビットプレーン符号化と呼ば れる符号化処理などにより、伝送する符号量を 1フレーム(1画面、 1画像)ごとに制御 することが可能で、伝送速度や求められる画像品質に非常に柔軟に対応できる。  [0007] A video stream encoded by MPEG-4 FGS is composed of a base layer stream and an enhancement layer stream. The base layer stream is a low-band low-quality video stream that can be decoded alone, and the enhancement layer stream is a video stream for improving the quality of the base layer stream. In MPEG-4 FGS, the amount of code to be transmitted is 1 frame (1 screen, 1 image) by hierarchically encoded layer structure and encoding process called bit-plane encoding used for enhancement layer stream. It can be controlled on a case-by-case basis and can be very flexible in responding to the transmission rate and the required image quality.
[0008] 以下に、 MPEG— 4 FGSの拡張レイヤストリームの生成に用いられるビットプレー ン符号ィ匕の概念を簡単に説明する。  [0008] The following briefly describes the concept of bit plane coding used to generate an MPEG-4 FGS enhancement layer stream.
[0009] 図 14は、 MPEG— 4 FGSの映像符号化装置 130を示す図である。映像信号入 力部 132から原映像を入力し、基本レイヤ符号化部 134にお 、て基本レイヤストリ一 ムを、拡張レイヤ符号ィ匕部 140において拡張レイヤストリームを生成し、基本レイヤ出 力部 136と拡張レイヤ出力部 138から出力する。  [0009] FIG. 14 is a diagram showing a video encoding device 130 of MPEG-4 FGS. The original video is input from the video signal input unit 132, and the base layer coding unit 134 generates a base layer stream, and the enhancement layer coding unit 140 generates an enhancement layer stream, and the base layer output unit 136. And output from the enhancement layer output unit 138.
[0010] 拡張レイヤ符号ィ匕部 140では、原映像と基本レイヤストリームを復号ィ匕した基本レイ ャ復号映像の差分である差分映像を 8 X 8画素ごとに DCT変換 (Discrete Cosine [0010] In enhancement layer coding section 140, a DCT transform is performed on each differential pixel, which is the difference between a base layer decoded video obtained by decoding the original video and the base layer stream, every 8 X 8 pixels (Discrete Cosine
Transformation:離散コサイン変換)、スキャン、階層符号化を行う。 MPEG— 4 FGSでは、差分映像を DCT変換した DCT係数に多く現れる「0」の値をまとめて符 号ィ匕することにより符号ィ匕効率を上げる。 Transformation (discrete cosine transform), scan, hierarchical coding. In MPEG-4 FGS, the coding efficiency is improved by collectively coding the value of “0” that appears frequently in the DCT coefficients of the differential video.
[0011] 拡張レイヤ符号ィ匕部 140では、まず、 DCT部 144が差分映像を DCT変換して DC T係数を生成する。次に、スキャン部 146が 8 X 8画素の DCT係数をスキャンして、ス キャンした DCT係数を並べ替える。 DCT係数は統計的に大きな絶対値を持つ係数 が水平および垂直低周波に偏り、水平および垂直高周波係数が大きな絶対値を持 つことは統計的に少なくなる。スキャン部 146が、水平および垂直低周波係数から水 平および垂直高周波係数に向力つて DCT係数を並べ替えるスキャンを行うことにより 、並べ替えられたスキャン済 DCT係数の後半には「0」が多くなる。図 16Aは、 8 X 8 画素の DCT係数のスキャンを表す図である。プラスマイナスを表す符号は絶対値と は別途符号化する。このように、水平周波数、垂直周波数の低い方カゝら順にスキャン する。これにより、高周波成分では「0」が出現する確率の高ので、符号化効率を高め ることがでさる。 [0011] In the enhancement layer coding unit 140, first, the DCT unit 144 DCT-transforms the difference image to generate DCT coefficients. Next, the scan unit 146 scans DCT coefficients of 8 × 8 pixels and rearranges the scanned DCT coefficients. The DCT coefficients have a statistically large absolute value biased to horizontal and vertical low frequencies, and it becomes statistically less that the horizontal and vertical high frequency coefficients have large absolute values. The scanning unit 146 is used to By performing a scan that reorders the DCT coefficients towards the flat and vertical high frequency coefficients, there will be more "0" s in the second half of the reordered scanned DCT coefficients. FIG. 16A shows a scan of DCT coefficients of 8 × 8 pixels. Signs representing plus and minus are encoded separately from absolute values. In this way, scan in the order of lower horizontal frequency and lower vertical frequency. As a result, the coding efficiency can be increased because the probability that "0" appears in the high frequency component is high.
[0012] 次に、階層符号化部 140がビットプレーン符号化を行う。 MPEG -4 FGSは、ビッ トプレーン符号化として上位ビットからビットプレーンごとに零ラン長符号ィ匕およびノヽ フマン符号ィ匕を行う。ビットプレーンとは、 2進数で表された複数の数値の同じビット 位のみを並べたビット列のことであり、ビット平面とも呼ばれる。図 17は、 DCT係数を 、横軸をスキャン順に、縦軸をビットごとに並べた図である。 1つの列が 1つの DCT係 数を表し、 1つの行が 1つのビットプレーンを表す。 DCT係数の最上位の「1」のビット を MSBと呼び、図 17において空白セルは、 MSBより上位の「0」ビットの記述を省略 したものである。複数の DCT係数の中で最も大き 、値を持つ MSBを含むビットプレ ーンを MSB平面と呼ぶ。  Next, hierarchical coding section 140 performs bit plane coding. The MPEG-4 FGS performs a zero run length code and a nonman code for each bit plane from the upper bits as bit plane coding. A bit plane is a bit string in which only the same bit positions of a plurality of binary numbers are arranged, and is also called a bit plane. FIG. 17 is a diagram in which DCT coefficients are arranged with the horizontal axis in scan order and the vertical axis in bit units. One column represents one DCT coefficient, and one row represents one bit plane. The most significant “1” bit of the DCT coefficient is called MSB, and the blank cell in FIG. 17 is the one in which the description of “0” bit higher than MSB is omitted. The bit plane containing the MSB with the largest value among the DCT coefficients is called the MSB plane.
[0013] 零ラン長符号化とは、「0」以外の係数が現れる前に、「0」が幾つ現れるかを符号化 するものであり、複数の「0」に 1つの信号を割り当てることにより情報量を圧縮する符 号化手法である。ビットプレーンにおいては「0」か「1」しか現れないので「0の長さ」に なる。また、「0」の後に現れる「1」がビットプレーン内においてスキャン順序で「最後 の 1か否か」を符号化することにより、スキャンによって後半に偏った「0」を効率よく符 号化する。図 17に例示する MPEG—4 FGSの零ラン長符号ィ匕では、領域 Cを参照 すると、「0」が 4つ続いた後に「1」が続くがさらにその後に「1」があるので、「0が 4つ」 そして「最後の 1ではない」の組み合わせを割り当てる。図 16Bは、 8 X 8画素の DCT 係数のあるビット位のビットプレーンを示した図である。図 16Aで示したスキャン順序 によれば、図 16Bのビット B12は、ビットプレーン最後の「1」であり、それ以外はビット プレーン最後の「 1」ではな 、。  Zero run length coding is to encode how many “0” appears before coefficients other than “0” appear, and by assigning one signal to a plurality of “0” s. It is a coding method that compresses the amount of information. Since only "0" or "1" appears in the bit plane, it becomes "0 length". In addition, by encoding "whether it is the last 1 or not" in the bit plane, the "1" appearing after the "0" efficiently encodes the "0" that is biased in the second half by the scan. . In the zero run length code MPEG of MPEG-4 FGS illustrated in FIG. 17, referring to the area C, four “0” s follow and “1” s follow, but there is a “1” after that. Assign a combination of 0 is 4 and "not 1". FIG. 16B is a diagram showing a bit plane of bit positions of DCT coefficients of 8 × 8 pixels. According to the scan order shown in FIG. 16A, bit B12 in FIG. 16B is "1" at the end of the bit plane, otherwise it is not "1" at the end of the bit plane.
[0014] ハフマン符号化とは可変長符号化の一種で、 MPEG— 4 FGSにおいては、「0の 続く長さ」すなわち零ラン長と「最後の 1であるか否か」すなわちビットプレーン終了信 号の各組み合わせに対してあらかじめ出現確率を求めておき、出現確率の高!、組み 合わせには短! ヽ符号を、出現確率の低!、組み合わせには長 ヽ符号を割り当てて情 報を圧縮する。組み合わせに対して割り当てる符号の表をノ、フマンテーブルと!/、 、、 MPEG -4 FGSでは、 MSB平面用と、 MSBより 1ビット下位のビットプレーン用と、 2ビット下位のビットプレーン用と、それより下位のビットプレーン用の 4つのテープノレ を用意する。これらのハフマンテーブルでは、 8 X 8画素の DCT係数の MSB平面は 「1」を少なく含み、下位ビットプレーンになるに従って徐々に 1の出現確率が増えて 行くことを想定して、「零ラン長」と「最後の『1』」の組み合わせを予測して、効率良く情 報を圧縮する。 [0014] Huffman coding is a type of variable-length coding, and in MPEG-4 FGS, "length following 0", ie, zero run length and "whether it is the last 1 or not", ie, bit plane end signal The appearance probability is calculated in advance for each combination of numbers, and the information is compressed by assigning high probability of occurrence, short! Symbol for combination, low probability of occurrence, and long symbol for combination. Do. Table of codes to be assigned to combinations No., H. Mant. And! /,, MPEG-4 FGS, for MSB plane, for bit plane 1 bit lower than MSB, for bit plane 2 bit lower, Prepare 4 tape holes for lower bit planes. In these Huffman tables, assuming that the MSB plane of the DCT coefficient of 8 × 8 pixels contains less “1”, and assuming that the probability of occurrence of 1 gradually increases as it becomes the lower bit plane, “zero run length Compress information efficiently by predicting the combination of “1” and “last [1]”.
[0015] 図 15は、映像符号ィ匕装置 130が生成した基本レイヤストリームと拡張レイヤストリー ムを復号化して復号化画像を生成する MPEG— 4 FGSの映像復号化装置 150で ある。基本レイヤ復号ィ匕部 156が基本レイヤストリームを復号ィ匕して生成した基本レイ ャ復号化画像と、拡張レイヤ復号化部 160が拡張レイヤストリームを復号化して生成 した差分復号化画像とを加算して復号化画像を生成する。復号化画像の画質は、復 号ィ匕した拡張レイヤストリームの量に比例する。  [0015] FIG. 15 shows an MPEG-4 FGS video decoding apparatus 150 that decodes the base layer stream and the enhancement layer stream generated by the video coding apparatus 130 to generate a decoded image. The base layer decoding image generated by the base layer decoding block 156 by decoding the base layer stream and the differentially decoded image generated by the enhancement layer decoding section 160 by decoding the enhancement layer stream are added. To generate a decoded image. The quality of the decoded image is proportional to the amount of decoded enhancement layer stream.
[0016] 以上のように符号ィ匕および復号ィ匕する MPEG— 4 FGSでは、復号ィ匕画像の画質 に強い影響を及ぼす DCT係数の上位ビットプレーン力も優先的に符号ィ匕して映像ス トリームに格納することにより、画質を柔軟に調整することが可能である。たとえば、映 像送信端末から映像受信端末に映像ストリームを伝送するシステムにお ヽて、映像 受信端末が映像ストリームをすベて受信できな力つた場合、 MPEG -4 FGSの映 像ストリームでは画質への影響度の低い下位ビットプレーンが欠落するだけなので、 映像を復号化することが可能である。し力し、 MPEG— 4 AVCなどの映像非スケー ラブル符号ィ匕方式では、映像ストリームをすベて受信できな力つた場合、 1画面の下 半分が復号できないなどといった主観画質を大幅に低下させる問題が発生する。  [0016] As described above, in MPEG-4 FGS that performs coding and decoding, the upper bit plane power of the DCT coefficient that has a strong influence on the quality of the decoded image is also coded and video stream is preferentially coded. The image quality can be flexibly adjusted by storing in. For example, in a system for transmitting a video stream from a video transmitting terminal to a video receiving terminal, if the video receiving terminal can not receive the video stream completely, the video stream of the MPEG-4 FGS will be displayed in image quality. It is possible to decode the picture since only the lower bit planes of low influence are missing. In the case of video non-scaleable coding schemes such as MPEG-4 AVC, if the video stream can not be received completely, the subjective image quality is greatly degraded, such as the lower half of one screen can not be decoded. I have a problem.
[0017] 特開 2003— 274406号公報では、図 14に示す MPEG— 4 FGSの映像符号化 装置にお 、て、階層符号ィ匕部 148の改良を行って 、る。  In Japanese Patent Application Laid-Open No. 2003-274406, the hierarchical coding unit 148 is improved in the MPEG-4 FGS video coding apparatus shown in FIG.
[0018] 図 18は、上記公報に記載された階層符号化部 148を示す。階層符号化部 148で は、ビット分離部 170が、入力された DCT係数を MSBとそれ以外の非 MSBに分離 し、 MSB符号化部 172が DCT係数の MSBのみを符号化し、非 MSB符号化部 174 が非 MSBを符号ィ匕し、合成部 176が MSBと非 MSBの符号を合成し出力する。上 記公報では、 1つの DCT係数の中で復号化画像の画質に最も影響を及ぼす MSB を優先して映像ストリームに符号ィ匕することにより、符号ィ匕効率の向上を図っている。 「符号ィ匕効率が良い」とは、同じ原映像を符号ィ匕した同じデータ量の 2つの映像ストリ ームを復号ィ匕したときに、一方の復号ィ匕画像の画質が他方に勝っていれば、前者の 方の符号化効率が良 、と 、う。 FIG. 18 shows the layer coding unit 148 described in the above publication. In hierarchical coding unit 148, bit separation unit 170 separates the input DCT coefficients into MSB and other non-MSBs. The MSB encoding unit 172 encodes only the MSB of the DCT coefficient, the non-MSB encoding unit 174 encodes the non-MSB, and the combining unit 176 combines and outputs the MSB and non-MSB codes. In the above-mentioned publication, the coding efficiency is improved by giving priority to the video stream by giving priority to the MSB which most affects the image quality of the decoded image among one DCT coefficient. “The code efficiency is good” means that when two video streams of the same data amount, which code the same original video, are decoded, the image quality of one decoded video is superior to the other. Then, the former has better coding efficiency.
発明の開示  Disclosure of the invention
発明が解決しょうとする課題  Problem that invention tries to solve
[0019] 上記したように、 MPEG— 4 FGSはビットプレーンごとにハフマンテーブルを用意 して 、るものの、そのハフマンテーブルが「0の続く長さ」と「最後の『1』であるか否力」 の各組み合わせの出現確率を正確に反映しているとは限らず、符号化効率が良いと は言えない。拡張レイヤ符号化部 140で符号化する差分映像は、原映像と基本レイ ャ復号ィ匕画像との差分であり多くのエッジを含んでいるので、高周波領域に大きい絶 対値の DCT係数が現れる場合がある。例えば、ある 1つの DCT係数の MSBが 11ビ ット位で他の DCT係数の MSBが全て 5ビット位以下とすると、 11ビット位の MSB平 面から 6ビット位まで符号ィ匕する DCT係数は 1つであるにも関わらず、ハフマンテー ブルとして「1」が多く出現すると想定したものが用いられ、想定する出現確率と実際 のズレが大きくなり符号ィ匕効率が悪くなる。  As described above, although MPEG-4 FGS prepares a Huffman table for each bit plane, it does not determine whether the Huffman table has “a length of 0” and “last“ 1 ”. It does not necessarily reflect the appearance probability of each combination of “”, and it can not be said that the coding efficiency is good. The differential image to be encoded by the enhancement layer encoding unit 140 is a difference between the original image and the basic layer decoded image and includes many edges, so a large absolute value DCT coefficient appears in a high frequency region. There is a case. For example, if the MSB of one DCT coefficient is 11 bits and the MSBs of the other DCT coefficients are all 5 bits or less, the DCT coefficients coding from 11 MSB planes to 6 bits are Even though it is one, it is assumed that a large number of “1” appears as a Huffman table, and the expected probability of occurrence and the actual deviation become large and the coding efficiency becomes worse.
[0020] 上記公報で用いる DCT係数の MSBのみを優先する符号化方法では、必ずしも符 号ィ匕効率の改善は望めない。例えば、 MSBが 8ビット位と 2ビット位の DCT係数を考 える場合、後者の 2ビット位の MSBより、前者の 7ビット位の非 MSBの方が画質に与 える影響が大きいからである。  In the coding method in which only the MSB of the DCT coefficient used in the above publication is prioritized, improvement in coding efficiency can not always be expected. For example, when the MSB is considered to be a DCT coefficient of 8 bits and 2 bits, the 7-bit non-MSB of the former has a larger influence on the image quality than the MSB of the latter 2 bits.
[0021] 本発明は、上記背景に鑑み、符号化対象の変換係数のビットの状態に応じて効率 の良い符号ィ匕を行うことができる映像符号ィ匕装置および符号ィ匕データを復号ィ匕する 映像復号ィ匕装置を提供することを目的とする。  [0021] In view of the above background, the present invention decodes a video coding apparatus and coding data that can perform efficient coding according to the state of bits of transform coefficients to be encoded. An object of the present invention is to provide a video decoding device.
課題を解決するための手段  Means to solve the problem
[0022] 本発明の映像符号化装置は、映像を周波数変換して周波数成分を表す変換係数 を生成する変換係数生成部と、変換係数生成部にて生成された変換係数を 2進数に 変換し、複数の変換係数の同じ位のビットからなるビットプレーンを最上位ビットから 最下位ビットに至るまで生成し、上位ビットプレーン力 順に符号ィ匕するビットプレー ン符号化部と、先に符号化された変換係数に関する情報に基づいて、後に符号化さ れる変換係数のビットの状態を予測する状態予測部とを備え、ビットプレーン符号ィ匕 部は、状態予測部によって予測されたビットの状態に応じて符号ィ匕を行う。 A video encoding apparatus according to the present invention is a conversion coefficient that frequency-converts an image and represents frequency components. Transform coefficients generated by the transform coefficient generator and the transform coefficient generator into binary numbers, and the bit plane consisting of the same order bits of the plurality of transform coefficients is from the most significant bit to the least significant bit A state in which the state of bits of transform coefficients to be encoded later is predicted based on information on a bitplane encoding unit that generates up to and encoding in order of upper bitplane power and transform coefficients that have been encoded earlier. The bit plane coding unit performs coding according to the state of the bit predicted by the state prediction unit.
[0023] 上記映像符号化装置において、状態予測部は、(1)上位のビットプレーンの情報 に基づいて、下位のビットプレーンのビットの状態を予測してもよいし、(2)予測対象 のビットより上位のビットプレーンにお 、て、ビットの状態「1」が現れた変換係数の個 数に応じてビットの状態を予測してもよいし、(3)予測対象のビットの変換係数におい て上位ビットにビットの状態「1」が現れたか否かに基づ!/、て、ビットの状態を予測して もよいし、(4)予測対象のビットより上位のビットプレーンにおいてビットの状態「1」が 現れた変換係数のうちで、予測対象のビットの変換係数よりも符号ィ匕順序が後方に 位置し、かつ、最も近い変換係数までの距離に基づいて、ビットの状態を予測しても よいし、(5)予測対象のビットより上位のビットプレーンにおいて、符号ィ匕順序が最も 後方の位置でビットの状態「1」が現れた変換係数に基づいて、ビットの状態を予測し てもよい。 In the above-described video encoding device, the state prediction unit may predict the state of the bits of the lower bit plane based on (1) information of the upper bit plane, or (2) the prediction target The bit state may be predicted according to the number of transform coefficients in which a bit state “1” appears in the bit plane higher than the bit, or (3) the transform coefficient of the bit to be predicted is The state of the bit may be predicted based on whether the state of the bit "1" appears in the upper bit or not, or (4) the state of the bit in the bit plane higher than the bit to be predicted Among the transform coefficients in which “1” appears, the state of the bit is predicted based on the distance to the closest transform coefficient where the code order is located later than the transform coefficient of the bit to be predicted. Or (5) a bit higher than the bit to be predicted. In the case of the bit plane, the state of the bit may be predicted based on the transform coefficient at which the state of the bit “1” appears at the last position of the code sequence.
[0024] 上記映像符号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、符号ィ匕順序が後方に位置する変換係数の同じビットプレーンに属する全 てのビットの状態が「0」である確率を予測し、ビットプレーン符号ィ匕部は、確率に基づ いて、零ラン長符号ィ匕を行ってもよい。  [0024] In the video encoding device, the state prediction unit is configured such that the state of the bit to be predicted is "1", and all the code coefficients belonging to the same bit plane belonging to the same bit plane of the transform coefficient located behind. It predicts the probability that the bit state is "0", and the bit-plane coder may perform zero run length code based on the probability.
[0025] 上記映像符号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにお 、て、あら力じめ定められた順番以降の符号ィ匕順序の変換係数にビッ トの状態「1」を含む力否かを判定し、ビットの状態「1」を含まないとの判定に応じて、 予測対象のビットの状態が「1」であり、かつ、符号ィ匕順序が後方に位置する変換係 数の同じビットプレーンに属する全てのビットの状態が「0」である確率を予測し、ビット プレーン符号ィ匕部は、確率に基づいて、零ラン長符号ィ匕を行ってもよい。  [0025] In the above video coding device, the state prediction unit is configured to convert bits of the coding coefficients in the order of the codes after the predetermined order on the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and in accordance with the determination that the state “1” of the bit is not included, the state of the bit to be predicted is “1” and the order of the codes is The probability of all bits belonging to the same bit plane of the conversion coefficient located behind being "0" is predicted, and the bit plane code block performs a zero run length code block based on the probability. May be
[0026] 上記映像符号化装置にお!ヽて、状態予測部は、予測対象のビットを含む零ラン長 を予測し、ビットプレーン符号ィ匕部は、状態予測部にて予測された零ラン長に基づい て選択したハフマンテーブルを用いてハフマン符号化を行ってもょ 、。 [0026] In the above video encoding device, the state prediction unit is a zero run length including bits to be predicted. The bit plane coding unit may perform Huffman coding using a Huffman table selected based on the zero run length predicted by the state prediction unit.
[0027] 上記映像符号ィ匕装置において、状態予測部は、ビットの状態「0」が連続して出現 する確率があらかじめ設定された閾値を下回る箇所までのゼロの個数を零ラン長とし て求めてもよい。  [0027] In the above video coding device, the state prediction unit determines the number of zeros up to the point where the probability that the bit state "0" continuously appears falls below a preset threshold as the zero run length. May be
[0028] 上記映像符号化装置において、状態予測部は、予測対象のビットの状態が「1」又 は「0」である確率を予測し、ビットプレーン符号ィヒ部は、状態予測部にて予測された 確率に基づ 、て決定したシンボルの生起確率を用いて、算術符号ィ匕を行ってもよ!ヽ  In the above-described video encoding apparatus, the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit-plane coding part uses the state prediction unit. Arithmetic coding may be performed using the occurrence probability of the symbol determined based on the predicted probability.
[0029] 上記映像符号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、符号ィ匕順序が後方に位置する変換係数の同じビットプレーンに属する全 てのビットの状態が「0」である確率を予測し、ビットプレーン符号ィ匕部は、状態予測部 にて予測された確率に基づ ヽて決定したシンボルの生起確率を用いて、算術符号 化を行ってもよい。 [0029] In the above video encoding device, the state prediction unit is configured such that the state of the bit to be predicted is “1”, and the code order is all belonging to the same bit plane of the conversion coefficient located behind. The probability of the bit state being "0" is predicted, and the bit plane coding unit performs arithmetic coding using the occurrence probability of the symbol determined based on the probability predicted by the state prediction unit. You may go.
[0030] 上記映像符号化装置は、映像を単独で復号ィ匕可能な基本レイヤと基本レイヤの映 像品質を向上させる拡張レイヤとに階層符号ィ匕する映像符号ィ匕装置であって、状態 予測部は、基本レイヤの情報に基づいて、拡張レイヤの変換係数のビットの状態を 予測し、ビットプレーン符号ィ匕部は、状態予測部によって予測されたビットの状態に 応じて拡張レイヤの変換係数を符号ィ匕してもよ!、。  The video coding apparatus is a video coding apparatus that performs hierarchical coding on a base layer capable of decoding video independently and an enhancement layer for improving the video quality of the base layer. The prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on the information of the base layer, and the bit plane coding unit converts the enhancement layer according to the state of the bit predicted by the state prediction unit. You may sign the coefficients!
[0031] 上記映像符号化装置にお!、て、状態予測部は、基本レイヤに含まれるエッジの情 報または基本レイヤの符号ィ匕量に基づ 、て、拡張レイヤの変換係数のビットの状態 を予測してもよい。  [0031] In the above video coding apparatus, the state prediction unit is based on edge information included in the base layer or the base layer code amount, and the bit of the transform coefficient of the enhancement layer is You may predict the condition.
[0032] 上記映像符号ィ匕装置において、状態予測部は、映像を構成するフレームの変換係 数のビットの状態を、動き予測補償符号ィ匕に用いる参照フレームの情報に基づいて 予測し、ビットプレーン符号ィ匕部は、状態予測部によって予測されたビットの状態に 応じてフレームの変換係数を符号ィ匕してもょ 、。  In the video coding apparatus, the state prediction unit predicts the state of the bit of the conversion coefficient of the frame constituting the video based on the information of the reference frame used for the motion prediction / compensation code, The plane code unit may code the transform coefficient of the frame according to the state of the bit predicted by the state prediction unit.
[0033] 上記映像符号ィ匕装置において、状態予測部は、参照フレームに含まれるエッジの 情報または参照フレームの符号ィ匕量に基づ 、て、フレームの変換係数のビットの状 態を予測してもよい。 [0033] In the video coding apparatus, the state prediction unit determines the shape of bits of the transform coefficient of the frame based on edge information included in the reference frame or the code quantity of the reference frame. The state may be predicted.
[0034] 本発明の映像復号化装置は、ビットプレーン符号化された映像の符号化データを 上位ビットプレーン力 順に復号ィ匕するビットプレーン復号ィ匕部と、先に復号ィ匕され た変換係数に関する情報に基づいて、後に復号化される変換係数のビットの状態を 予測する状態予測部とを備え、ビットプレーン復号ィ匕部は、状態予測部によって予測 されたビットの状態に応じて復号ィ匕を行う。  The video decoding apparatus according to the present invention comprises a bit plane decoding unit that decodes bit plane coded video encoded data in order of upper bit plane power, and a conversion coefficient that has been previously decoded. And a state prediction unit that predicts the state of bits of transform coefficients to be decoded later based on the information related to H. The bit plane decoding unit decodes in accordance with the state of bits predicted by the state prediction unit. Do a bribe.
[0035] 上記映像復号化装置において、状態予測部は、(1)上位のビットプレーンの情報 に基づいて、下位のビットプレーンのビットの状態を予測してもよいし、(2)予測対象 のビットより上位のビットプレーンにお 、て、ビットの状態「1」が現れた変換係数の個 数に応じてビットの状態を予測してもよいし、(3)予測対象のビットの変換係数におい て上位ビットにビットの状態「1」が現れたか否かに基づ!/、て、ビットの状態を予測して もよいし、(4)予測対象のビットより上位のビットプレーンにおいてビットの状態「1」が 現れた変換係数のうちで、予測対象のビットの変換係数よりも復号ィ匕順序が後方に 位置し、かつ最も近い変換係数までの距離に基づいて、ビットの状態を予測してもよ いし、(5)予測対象のビットより上位のビットプレーンにおいて、復号ィ匕順序が最も後 方の位置でビットの状態「1」が現れた変換係数に基づいて、ビットの状態を予測して ちょい。  [0035] In the above video decoding apparatus, the state prediction unit may predict the state of the bits of the lower bit plane based on the information of the upper bit plane (1). The bit state may be predicted according to the number of transform coefficients in which a bit state “1” appears in the bit plane higher than the bit, or (3) the transform coefficient of the bit to be predicted is The state of the bit may be predicted based on whether the state of the bit "1" appears in the upper bit or not, or (4) the state of the bit in the bit plane higher than the bit to be predicted Among the transform coefficients in which “1” appears, the state of the bit is predicted based on the distance to the closest transform coefficient to which the decoding order is located later than the transform coefficient of the bit to be predicted. (5) bits higher than the bit to be predicted In the top plane, the state of the bit is predicted based on the transform coefficient at which the state of the bit "1" appears at the position where the decoding order is the most backward.
[0036] 上記映像復号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、復号ィ匕順序が後方に位置する変換係数の同じビットプレーンに属する全 てのビットの状態が「0」である確率を予測し、ビットプレーン復号ィ匕部は、確率に基づ いて、零ラン長復号ィ匕を行ってもよい。  In the video decoding apparatus, the state prediction unit is configured such that the state of the bit to be predicted is “1”, and the decoding order is all belonging to the same bit plane of the conversion coefficient located behind. By predicting the probability that the bit state is "0", the bit plane decoding unit may perform zero run length decoding based on the probability.
[0037] 上記映像復号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにお 、て、あら力じめ定められた順番以降の復号ィ匕順序の変換係数にビッ トの状態「1」を含む力否かを判定し、ビットの状態「1」を含まないとの判定に応じて、 予測対象のビットの状態が「1」であり、かつ、復号ィ匕順序が後方に位置する変換係 数の同じビットプレーンに属する全てのビットの状態が「0」である確率を予測し、ビット プレーン復号ィ匕部は、確率に基づいて、零ラン長復号ィ匕を行ってもよい。  [0037] In the above video decoding apparatus, the state prediction unit is configured to convert the decoding coefficients in the decoding order after the predetermined order in the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and the state of the bit to be predicted is “1” and the decoding order is determined according to the determination that the state “1” of the bit is not included. The probability of all bits belonging to the same bitplane of the conversion coefficient located behind being "0" is predicted, and the bitplane decoding unit performs zero run length decoding based on the probability. May be
[0038] 上記映像復号化装置にお!ヽて、状態予測部は、予測対象のビットを含む零ラン長 を予測し、ビットプレーン復号ィ匕部は、状態予測部にて予測された零ラン長に基づい て選択したハフマンテーブルを用いてハフマン復号ィ匕を行うことを特徴としてもよ 、。 [0038] In the above video decoding apparatus, the state prediction unit is a zero run length including bits to be predicted. The bit plane decoding unit may perform Huffman decoding using the Huffman table selected based on the zero run length predicted by the state prediction unit.
[0039] 状態予測部は、ビットの状態「0」が連続して出現する確率があらかじめ設定された 閾値を下回る箇所までのゼロの個数を零ラン長として求めてもよい。  The state prediction unit may obtain, as a zero run length, the number of zeros until the probability that the bit state “0” continuously appears falls below a preset threshold.
[0040] 上記映像復号化装置において、状態予測部は、予測対象のビットの状態が、「1」 又は「0」である確率を予測し、ビットプレーン復号ィ匕部は、状態予測部にて予測され た確率に基づ!/、て決定したシンボルの生起確率を用いて、算術復号化を行ってもよ い。  In the above video decoding apparatus, the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit plane decoding unit is operated by the state prediction unit. Arithmetic decoding may be performed using the probability of occurrence of the symbol determined based on the predicted probability.
[0041] 上記映像復号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、復号化順序が、後方に位置する変換係数の同じビットプレーンに属する 全てのビットの状態が「0」である確率を予測し、ビットプレーン復号ィ匕部は、状態予測 部にて予測された零ラン長に基づ 、て決定したシンボルの生起確率を用いて、算術 復号化を行ってもよい。  [0041] In the video decoding apparatus, the state prediction unit determines that the state of the bit to be predicted is “1”, and that the decoding order is all bits belonging to the same bit plane of the conversion coefficient positioned behind. The bit-plane decoding unit predicts the probability that the state of “0” is “0”, and the bit-plane decoding unit performs arithmetic decoding using the occurrence probability of the symbol determined based on the zero run length predicted by the state prediction unit. You may
[0042] 上記映像復号化装置において、状態予測部は、復号化された基本レイヤの情報に 基づいて、基本レイヤの映像品質を向上させる拡張レイヤの変換係数のビットの状態 を予測し、ビットプレーン復号ィ匕部は、状態予測部によって予測されたビットの状態に 応じて拡張レイヤの変換係数を復号ィ匕してもよ!、。  In the above video decoding apparatus, the state prediction unit predicts, based on the decoded information of the base layer, the state of the bit of the transform coefficient of the enhancement layer for improving the video quality of the base layer, The decoding unit may decode the transform coefficients of the enhancement layer according to the state of the bit predicted by the state prediction unit !.
[0043] 上記映像復号ィ匕装置において状態予測部は、基本レイヤに含まれるエッジの情報 または基本レイヤの符号ィ匕量に基づ 、て、拡張レイヤの変換係数のビットの状態を 予測してもよい。  In the above video decoding device, the state prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on edge information included in the base layer or the base layer code amount. It is also good.
[0044] 上記映像復号化装置にお!ヽて、状態予測部は、映像を構成するフレームの変換係 数のビットの状態を、動き予測補償符号ィ匕に用いる参照フレームの情報に基づいて 予測し、ビットプレーン復号ィ匕部は、状態予測部によって予測されたビットの状態に 応じてフレームの変換係数を復号ィ匕してもょ 、。  In the video decoding apparatus, the state prediction unit predicts the state of the bit of the conversion coefficient of the frame constituting the video based on the information of the reference frame used for the motion prediction / compensation code. Then, the bit-plane decoding unit decodes the transform coefficients of the frame according to the state of the bit predicted by the state prediction unit.
[0045] 上記映像復号ィ匕装置において、状態予測部は、参照フレームに含まれるエッジの 情報または参照フレームの符号ィ匕量に基づ 、て、フレームのビットの状態を予測して ちょい。  In the video decoding apparatus, the state prediction unit predicts the state of the bits of the frame based on edge information included in the reference frame or the amount of coding of the reference frame.
[0046] 本発明の映像符号化方法は、映像を周波数変換して周波数成分を表す変換係数 を生成する変換係数生成ステップと、変換係数生成ステップにお ヽて生成された変 換係数を 2進数に変換し、複数の変換係数の同じ位のビットからなるビットプレーンを 最上位ビットから最下位ビットに至るまで生成し、上位ビットプレーン力も順に符号ィ匕 するビットプレーン符号化ステップと、先に符号化された変換係数に関する情報に基 づいて、後に符号化される変換係数のビットの状態を予測する状態予測ステップとを 備え、ビットプレーン符号化ステップは、状態予測ステップにおいて予測されたビット の状態に応じて符号ィヒを行う。 According to the video encoding method of the present invention, a transform coefficient representing a frequency component by frequency-converting the video Converting the conversion coefficients generated in the conversion coefficient generation step of generating the conversion coefficients and the conversion coefficient generation step into binary numbers, the bit plane consisting of the bits of the same order of the plurality of conversion coefficients is the most significant bit to the least significant bit Based on the information on the bit plane coding step of generating up to bits and coding the upper bit plane powers sequentially and the information on the previously coded transform coefficients, the bit states of the transform coefficients to be encoded later are And a state prediction step of predicting, and the bit plane coding step performs coding in accordance with the state of the bit predicted in the state prediction step.
[0047] 本発明の映像復号化方法は、ビットプレーン符号ィ匕された映像の符号ィ匕データを 上位ビットプレーン力 順に復号ィ匕するビットプレーン復号化ステップと、先に復号ィ匕 された変換係数に関する情報に基づいて、後に復号化される変換係数のビットの状 態を予測する状態予測ステップとを備え、ビットプレーン復号化ステップは、状態予 測部によって予測されたビットの状態に応じて復号ィ匕を行う。  According to the video decoding method of the present invention, the bit plane decoding step of decoding the bit plane coded picture data and the coded picture data of the picture in the order of higher bit plane power, and the conversion previously decoded are performed. And a state prediction step of predicting the state of bits of transform coefficients to be decoded later based on the information on the coefficient, and the bit plane decoding step is performed according to the state of the bits predicted by the state prediction unit. Perform decryption.
[0048] 本発明の映像符号ィ匕のためのプログラムは、映像を符号ィ匕するためにコンピュータ に、映像を周波数変換して周波数成分を表す変換係数を生成する変換係数生成ス テツプと、変換係数生成ステップにおいて生成された変換係数を 2進数に変換し、複 数の変換係数の同じ位のビットからなるビットプレーンを最上位ビットから最下位ビット に至るまで生成し、上位ビットプレーン力も順に符号ィ匕するビットプレーン符号化ステ ップと、先に符号化された変換係数に関する情報に基づいて、後に符号化される変 換係数のビットの状態を予測する状態予測ステップとを実行させ、ビットプレーン符号 ィ匕ステップは、状態予測ステップにお 、て予測されたビットの状態に応じて符号ィ匕を 行う。  [0048] The program for video coding according to the present invention includes a conversion coefficient generation step of converting the frequency of the video into a conversion coefficient representing a frequency component and converting the video to a computer for coding the video. The conversion coefficient generated in the coefficient generation step is converted to a binary number, and bit planes consisting of the same order bits of multiple conversion coefficients are generated from the most significant bit to the least significant bit. Perform a state prediction step of predicting the state of the bit of the transform coefficient to be encoded later based on the information on the previously encoded bit of the transform coefficient, and The plane coding step performs coding according to the state of the predicted bit in the state prediction step.
[0049] 本発明の映像復号ィ匕のためのプログラムは、ビットプレーン符号ィ匕された映像の符 号ィ匕データを復号ィ匕するために、コンピュータに、上位ビットプレーン力も順に復号 化するビットプレーン復号化ステップと、先に復号化された変換係数に関する情報に 基づいて、後に復号化される変換係数のビットの状態を予測する状態予測ステップと を実行させ、ビットプレーン復号化ステップは、状態予測部によって予測されたビット の状態に応じて復号ィヒを行う。  The program for video decoding according to the present invention is a bit for decoding the upper bit plane power in order to the computer in order to decode the bit plane coded video data of the bit plane. Performing a plane decoding step and a state prediction step of predicting a state of a bit of a transform coefficient to be decoded later based on information on the previously decoded transform coefficient; Decoding is performed according to the state of the bit predicted by the prediction unit.
[0050] 以下に説明するように、本発明には他の態様が存在する。したがって、この発明の 開示は、本発明の一部の態様の提供を意図しており、請求される発明の範囲を制限 することは意図していない。 [0050] As described below, there are other aspects of the present invention. Therefore, the present invention The disclosure is intended to provide some aspects of the present invention and is not intended to limit the scope of the claimed invention.
図面の簡単な説明 Brief description of the drawings
[図 1]図 1は、本発明の第 1の実施の形態の階層符号ィ匕部の構成を示す図 [FIG. 1] FIG. 1 is a diagram showing the configuration of a layer code portion according to a first embodiment of the present invention.
[図 2]図 2は、本発明の第 1の実施の形態の映像符号化装置の構成を示す図  [FIG. 2] FIG. 2 is a diagram showing the configuration of a video encoding apparatus according to the first embodiment of the present invention.
[図 3]図 3は、本発明の第 1の実施の形態の映像符号ィ匕装置の動作を示すフローチ ヤート  [FIG. 3] FIG. 3 is a flowchart showing the operation of the video coding apparatus according to the first embodiment of the present invention.
[図 4]図 4は、本発明の第 1の実施の形態の映像符号ィ匕装置の階層符号ィ匕処理の動 作を示すフローチャート  [FIG. 4] FIG. 4 is a flow chart showing the operation of hierarchical code processing of the video coding apparatus of the first embodiment of the present invention.
[図 5]図 5は、本発明の状態予測についての説明図  [FIG. 5] FIG. 5 is an explanatory view of the state prediction of the present invention.
[図 6]図 6は、本発明の第 2の実施の形態の映像復号化装置の構成を示す図  [FIG. 6] FIG. 6 is a diagram showing the configuration of a video decoding apparatus according to a second embodiment of the present invention.
[図 7]図 7は、本発明の第 2の実施の形態の階層復号ィ匕部の構成を示す図 [FIG. 7] FIG. 7 is a diagram showing the configuration of a hierarchical decoding module according to a second embodiment of the present invention.
[図 8]図 8は、本発明の第 2の実施の形態の映像復号ィ匕装置の動作を示すフローチ ヤート [FIG. 8] FIG. 8 is a flowchart showing the operation of the video decoding apparatus according to the second embodiment of the present invention.
[図 9]図 9は、本発明の第 2の実施の形態の映像復号ィ匕装置の階層復号ィ匕処理の動 作を示すフローチャート  [FIG. 9] FIG. 9 is a flowchart showing an operation of hierarchical decoding processing of the video decoding apparatus according to the second embodiment of the present invention.
[図 10]図 10は、本発明の第 3の実施の形態の映像符号化装置の構成を示す図 [図 11]図 11は、本発明の第 3の実施の形態の映像符号化装置の動作を示すフロー チャート  [FIG. 10] FIG. 10 is a diagram showing a configuration of a video encoding apparatus according to a third embodiment of the present invention. [FIG. 11] FIG. 11 is a diagram showing a video encoding apparatus according to the third embodiment of the present invention. Flow chart showing operation
[図 12]図 12は、本発明の第 4の実施の形態の映像復号化装置の構成を示す図 [図 13]図 13は、本発明の第 4の実施の形態の映像復号化装置の動作を示すフロー チャート  [FIG. 12] FIG. 12 shows a configuration of a video decoding apparatus according to a fourth embodiment of the present invention. [FIG. 13] FIG. 13 shows a video decoding apparatus according to the fourth embodiment of the present invention. Flow chart showing operation
[図 14]図 14は、 MPEG -4 FGSの映像符号ィ匕装置の構成を示す図  [FIG. 14] FIG. 14 is a diagram showing the configuration of a video coding device of MPEG-4 FGS.
[図 15]図 15は、 MPEG -4 FGSの映像復号ィ匕装置の構成を示す図  [FIG. 15] FIG. 15 is a diagram showing the configuration of the MPEG-4 FGS video decoding device.
[図 16A]図 16Aは、 DCT係数のスキャンについての説明図  [FIG. 16A] FIG. 16A is an explanatory diagram of scanning of DCT coefficients.
[図 16B]図 16Bは、 DCT係数のビットプレーンについての説明図  [FIG. 16B] FIG. 16B is an explanatory diagram of bit planes of DCT coefficients.
[図 17]図 17は、ビットプレーン符号ィ匕についての説明図  [FIG. 17] FIG. 17 is an explanatory diagram of a bit plane code line.
[図 18]図 18は、従来の階層符号化部の構成を示す図 発明を実施するための最良の形態 [FIG. 18] FIG. 18 is a diagram showing the configuration of a conventional hierarchical coding unit. BEST MODE FOR CARRYING OUT THE INVENTION
[0052] 以下に本発明の詳細な説明を述べる、ただし、以下の詳細な説明と添付の図面は 、発明を限定するものではない。発明の範囲は添付の請求の範囲により規定される。  Hereinafter, the present invention will be described in detail, but the following detailed description and the attached drawings do not limit the present invention. The scope of the invention is defined by the appended claims.
[0053] 本実施の形態の映像符号化装置は、映像を周波数変換して周波数成分を表す変 換係数を生成する変換係数生成部と、変換係数生成部にて生成された変換係数を 2進数に変換し、複数の変換係数の同じ位のビットからなるビットプレーンを最上位ビ ットから最下位ビットに至るまで生成し、上位ビットプレーン力も順に符号ィ匕するビット プレーン符号ィ匕部と、先に符号化された変換係数に関する情報に基づいて、後に符 号化される変換係数のビットの状態を予測する状態予測部とを備え、ビットプレーン 符号ィ匕部は、状態予測部によって予測されたビットの状態に応じて符号ィ匕を行う。  The video encoding apparatus according to the present embodiment is configured to convert the frequency of the video to generate a conversion coefficient representing a frequency component, and to convert the conversion coefficient generated by the conversion coefficient generation unit into a binary number. To generate bit planes consisting of the same-order bits of a plurality of transform coefficients from the most significant bit to the least significant bit, and the higher order bit plane powers in order as well. And a state prediction unit that predicts the state of bits of the conversion coefficient to be encoded later based on the information on the previously encoded conversion coefficient, and the bit plane coding unit is predicted by the state prediction unit. It performs coding according to the state of the bit.
[0054] このように先に符号化された変換係数に関する情報に基づいて後に符号化される ビットの状態、すなわち後に符号ィ匕されるビットにおける「1」と「0」の出現確率を予測 することによって、効率の良い符号ィ匕を行うことができる。例えば、出現確率の高い組 合せに符号ィ匕ビット長が短い符号を割り当てることにより、符号ィ匕効率が向上する。  As described above, the probability of occurrence of “1” and “0” in the state of the bit to be encoded later, ie, the bits to be encoded later, is predicted based on the information on the previously encoded transform coefficient. In this way, efficient code can be obtained. For example, by assigning a code having a short code bit length to a combination having a high occurrence probability, the code efficiency is improved.
[0055] 上記映像符号ィ匕装置において、状態予測部は、上位のビットプレーンの情報に基 づ 、て、下位のビットプレーンのビットの状態を予測してもよ 、。  In the video coding apparatus, the state prediction unit may predict the state of the bits of the lower bit plane based on the information of the upper bit plane.
[0056] このように上位のビットプレーンの状態と下位のビットプレーンの状態との相関関係 を利用することによって、下位のビットプレーンのビットの状態を適切に予測すること ができる。また、符号ィ匕中の係数のビットが属する領域以外の情報を使うことなくビット の状態を予測できる。  By utilizing the correlation between the state of the upper bit plane and the state of the lower bit plane in this manner, it is possible to appropriately predict the state of the bits of the lower bit plane. In addition, the state of bits can be predicted without using information other than the region to which the coefficient bits in the code belong.
[0057] 上記映像符号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにおいて、ビットの状態「1」が現れた変換係数の個数に応じてビットの状態 を予測してもよい。  In the video coding apparatus, the state prediction unit predicts the state of the bit according to the number of transform coefficients in which the state of the bit “1” appears in the bit plane higher than the bit to be predicted. It is also good.
[0058] この構成により、ビットの状態の予測精度が向上し、符号ィ匕効率を高めることができ る。  With this configuration, the prediction accuracy of the bit state can be improved, and the coding efficiency can be improved.
[0059] 上記映像符号ィ匕装置において、状態予測部は、予測対象のビットの変換係数にお V、て上位ビットにビットの状態「1」が現れたか否かに基づ ヽて、ビットの状態を予測し てもよい。 [0060] この構成により、ビットの状態の予測精度が向上し、符号ィ匕効率を高めることができ る。 [0059] In the video coding apparatus, the state prediction unit determines whether a bit state “1” appears in the high-order bit and V in the conversion coefficient of the bit to be predicted. The state may be predicted. According to this configuration, the prediction accuracy of the bit state can be improved, and the coding efficiency can be improved.
[0061] 上記映像符号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにお 、てビットの状態「1」が現れた変換係数のうちで、予測対象のビットの 変換係数よりも符号ィ匕順序が後方に位置し、かつ、最も近い変換係数までの距離に 基づいて、ビットの状態を予測してもよい。  [0061] In the video coding apparatus, the state prediction unit converts the bit to be predicted among the conversion coefficients in which the state of bit “1” appears in the bit plane higher than the bit to be predicted. The state of the bit may be predicted based on the distance to the nearest transform coefficient where the code order is located behind the coefficient and the closest to the transform coefficient.
[0062] この構成により、零ラン長を予測することができるので、効率の良い符号ィ匕を行うこ とがでさる。  According to this configuration, since the zero run length can be predicted, it is possible to perform efficient coding.
[0063] 上記映像符号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにおいて、符号ィ匕順序が最も後方の位置でビットの状態「1」が現れた変換 係数に基づ 、て、ビットの状態を予測してもよ 、。  In the above video coding apparatus, the state prediction unit is based on the conversion coefficient in which the state of the bit “1” appears at the last position of the code order in the bit plane higher than the bit to be predicted. First, let's predict the state of the bit.
[0064] この構成により、ビットの状態の予測精度が向上し、符号ィ匕効率を高めることができ る。  With this configuration, the prediction accuracy of the bit state can be improved, and the coding efficiency can be improved.
[0065] 上記映像符号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、符号ィ匕順序が後方に位置する変換係数の同じビットプレーンに属する全 てのビットの状態が「0」である確率を予測し、ビットプレーン符号ィ匕部は、確率に基づ いて、零ラン長符号ィ匕を行ってもよい。  [0065] In the above-mentioned video encoding device, the state prediction unit is configured such that the state of the bit to be predicted is “1”, and all the code coefficients belonging to the same bit plane belonging to the same bit plane of the conversion coefficient located behind It predicts the probability that the bit state is "0", and the bit-plane coder may perform zero run length code based on the probability.
[0066] この構成により、そのビットプレーンに現れる最後尾の「1」を予測して、効率の良い 符号ィ匕を行うことができる。  [0066] According to this configuration, it is possible to predict the last "1" appearing in the bit plane and perform efficient coding.
[0067] 上記映像符号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにお 、て、あら力じめ定められた順番以降の符号ィ匕順序の変換係数にビッ トの状態「1」を含む力否かを判定し、ビットの状態「1」を含まないとの判定に応じて、 予測対象のビットの状態が「1」であり、かつ、符号ィ匕順序が後方に位置する変換係 数の同じビットプレーンに属する全てのビットの状態が「0」である確率を予測し、ビット プレーン符号ィ匕部は、確率に基づいて、零ラン長符号ィ匕を行ってもよい。  [0067] In the above video coding apparatus, the state prediction unit is configured to convert bits of the coding coefficients in the order of the codes after the predetermined order in the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and in accordance with the determination that the state “1” of the bit is not included, the state of the bit to be predicted is “1” and the order of the codes is The probability of all bits belonging to the same bit plane of the conversion coefficient located behind being "0" is predicted, and the bit plane code block performs a zero run length code block based on the probability. May be
[0068] この構成により、符号ィ匕順序において最後の「1」を表すビットプレーン終了信号に よって省略できる最後尾の「1」以降の「0」の個数を予測でき、最後尾の「1」以降の「 0」の個数が少ない場合には、ビットプレーン終了信号の符号ィ匕を行わないことにより 、符号ィ匕効率を向上できる。 With this configuration, it is possible to predict the number of "0" s after the last "1" that can be omitted by the bit plane end signal representing the last "1" in the code sequence, and the last "1" If the number of subsequent “0s” is small, the bit plane end signal is not coded. The code efficiency can be improved.
[0069] 上記映像符号化装置にお!、て、状態予測部は、予測対象のビットを含む零ラン長 を予測し、ビットプレーン符号ィ匕部は、状態予測部にて予測された零ラン長に基づい て選択したハフマンテーブルを用いてハフマン符号化を行ってもょ 、。 In the video coding apparatus, the state prediction unit predicts a zero run length including bits to be predicted, and the bit plane coding unit predicts a zero run predicted by the state prediction unit. Perform Huffman coding using a Huffman table selected based on length.
[0070] この構成により、予測した零ラン長に基づいて、出現確率の高い零ラン長に短い符 号を割り当てることが可能で、符号化効率が向上する。 According to this configuration, it is possible to assign a short code to the zero run length with high occurrence probability based on the predicted zero run length, and the coding efficiency is improved.
[0071] 上記映像符号ィ匕装置において、状態予測部は、ビットの状態「0」が連続して出現 する確率があらかじめ設定された閾値を下回る箇所までのゼロの個数を零ラン長とし て求めてもよい。 [0071] In the video coding apparatus, the state prediction unit determines the number of zeros up to the point where the probability that the bit state "0" appears continuously falls below a preset threshold as the zero run length. May be
[0072] この構成により、零ラン長を適切に予測することができる。  According to this configuration, the zero run length can be appropriately predicted.
[0073] 上記映像符号化装置において、状態予測部は、予測対象のビットの状態が「1」又 は「0」である確率を予測し、ビットプレーン符号ィヒ部は、状態予測部にて予測された 確率に基づ 、て決定したシンボルの生起確率を用いて、算術符号ィ匕を行ってもよ!ヽ  In the above-described video encoding apparatus, the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit-plane coding part uses the state prediction unit. Arithmetic coding may be performed using the occurrence probability of the symbol determined based on the predicted probability.
[0074] この構成により、符号ィ匕中の変換係数に短い符号を割り当てることが可能で、符号 化効率が向上する。 According to this configuration, it is possible to assign a short code to the transform coefficients in the code, and coding efficiency is improved.
[0075] 上記映像符号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、符号ィ匕順序が後方に位置する変換係数の同じビットプレーンに属する全 てのビットの状態が「0」である確率を予測し、ビットプレーン符号ィ匕部は、状態予測部 にて予測された確率に基づ ヽて決定したシンボルの生起確率を用いて、算術符号 化を行ってもよい。  [0075] In the above video encoding device, the state prediction unit determines that the state of the bit to be predicted is "1", and that all the codes belonging to the same bit plane of the transform coefficient whose code sequence is located behind. The probability of the bit state being "0" is predicted, and the bit plane coding unit performs arithmetic coding using the occurrence probability of the symbol determined based on the probability predicted by the state prediction unit. You may go.
[0076] この構成により、符号ィ匕中の変換係数に短い符号を割り当てることが可能で、符号 化効率が向上する。  According to this configuration, it is possible to assign a short code to the transform coefficients in the code, and coding efficiency is improved.
[0077] 上記映像符号化装置は、映像を単独で復号ィ匕可能な基本レイヤと基本レイヤの映 像品質を向上させる拡張レイヤとに階層符号ィ匕する映像符号ィ匕装置であって、状態 予測部は、基本レイヤの情報に基づいて、拡張レイヤの変換係数のビットの状態を 予測し、ビットプレーン符号ィ匕部は、状態予測部によって予測されたビットの状態に 応じて拡張レイヤの変換係数を符号ィ匕してもよ!、。 [0078] この構成により、拡張レイヤと相関の強い基本レイヤの情報に基づいて、拡張レイ ャの変換係数のビットの状態を適切に予測することができる。 The video coding apparatus is a video coding apparatus that performs hierarchical coding on a base layer capable of decoding video independently and an enhancement layer for improving the video quality of the base layer. The prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on the information of the base layer, and the bit plane coding unit converts the enhancement layer according to the state of the bit predicted by the state prediction unit. You may sign the coefficients! According to this configuration, it is possible to appropriately predict the bit state of the transform coefficient of the enhancement layer based on the information of the enhancement layer and the base layer strongly correlated.
[0079] 上記映像符号化装置にお!、て、状態予測部は、基本レイヤに含まれるエッジの情 報または基本レイヤの符号ィ匕量に基づ 、て、拡張レイヤの変換係数のビットの状態 を予測してもよい。 [0079] In the above video coding apparatus, the state prediction unit is configured to use the bit of the transform coefficient of the enhancement layer based on the edge information included in the base layer or the code quantity of the base layer. You may predict the condition.
[0080] この構成により、拡張レイヤの変換係数のビットの状態を適切に予測できる。  According to this configuration, it is possible to appropriately predict the state of bits of the transform coefficient of the enhancement layer.
[0081] 上記映像符号ィ匕装置において、状態予測部は、映像を構成するフレームの変換係 数のビットの状態を、動き予測補償符号ィ匕に用いる参照フレームの情報に基づいて 予測し、ビットプレーン符号ィ匕部は、状態予測部によって予測されたビットの状態に 応じてフレームの変換係数を符号ィ匕してもょ 、。 In the video coding apparatus, the state prediction unit predicts the state of the bit of the conversion coefficient of the frame constituting the video, based on the information of the reference frame used for the motion prediction / compensation code, The plane code unit may code the transform coefficient of the frame according to the state of the bit predicted by the state prediction unit.
[0082] この構成により、符号ィ匕中のフレームと相関の強い参照フレームの情報に基づいてAccording to this configuration, based on the information of the reference frame strongly correlated with the frame in the code.
、符号ィ匕中のフレームの変換係数のビットの状態を適切に予測することができる。参 照フレームの特性を用いて動き予測補償の残差信号の状態を予測することが可能で, The state of the bits of the transform coefficients of the frame in the code can be properly predicted. It is possible to predict the state of the residual signal of motion prediction compensation using the characteristics of the reference frame.
、符号ィ匕効率を向上できる。 The code efficiency can be improved.
[0083] 上記映像符号ィ匕装置において、状態予測部は、参照フレームに含まれるエッジの 情報または参照フレームの符号ィ匕量に基づ 、て、フレームの変換係数のビットの状 態を予測してもよい。 In the above video coding device, the state prediction unit predicts the bit state of the transform coefficient of the frame based on edge information included in the reference frame or the code amount of the reference frame. May be
[0084] この構成により、符号ィ匕中のフレームの変換係数のビットの状態を適切に予測でき る。  According to this configuration, it is possible to properly predict the bit state of the transform coefficient of the frame in the code.
[0085] 本実施の形態の映像復号化装置は、ビットプレーン符号化された映像の符号化デ ータを上位ビットプレーン力 順に復号ィ匕するビットプレーン復号ィ匕部と、先に復号 ィ匕された変換係数に関する情報に基づいて、後に復号化される変換係数のビットの 状態を予測する状態予測部とを備え、ビットプレーン復号ィ匕部は、状態予測部によつ て予測されたビットの状態に応じて復号ィ匕を行う。  [0085] The video decoding apparatus according to the present embodiment decodes a bit-plane coded video encoded data in the order of higher bit-plane coding, and a bit-plane decoding module that decodes the data first. And a state prediction unit for predicting the state of bits of the conversion coefficient to be decoded later based on the information on the converted conversion coefficient, the bit plane decoding unit including bits predicted by the state prediction unit. Decode according to the state of.
[0086] このように先に復号ィ匕された変換係数に関する情報に基づいて後に復号ィ匕される ビットの状態、すなわち後に復号ィ匕されるビットにおける「1」と「0」の出現確率を予測 することによって、出現確率に応じて効率良く符号化された符号化データを復号化す ることがでさる。 [0087] 上記映像復号ィ匕装置において、状態予測部は、上位のビットプレーンの情報に基 づ 、て、下位のビットプレーンのビットの状態を予測してもよ 、。 Thus, based on the information on the transform coefficient decoded earlier, the state of the bit to be decoded later, ie, the appearance probabilities of “1” and “0” in the bit to be decoded later By prediction, it is possible to decode encoded data efficiently encoded according to the appearance probability. [0087] In the video decoding apparatus, the state prediction unit may predict the state of the bits of the lower bit plane based on the information of the upper bit plane.
[0088] このように上位のビットプレーンの状態と下位のビットプレーンの状態との相関関係 を利用することによって、下位のビットプレーンのビットの状態を適切に予測すること ができ、効率良く符号化された符号化データを復号化することができる。また、復号 化中の係数のビットが属する領域以外の情報を使うことなくビットの状態を予測できる  By utilizing the correlation between the state of the upper bit plane and the state of the lower bit plane in this manner, the state of the bits of the lower bit plane can be appropriately predicted, and coding can be performed efficiently. The encoded data can be decoded. In addition, the state of bits can be predicted without using information other than the region to which the bit of coefficient being decoded belongs.
[0089] 上記映像復号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにおいて、ビットの状態「1」が現れた変換係数の個数に応じてビットの状態 を予測してもよい。 In the video decoding device, the state prediction unit predicts the state of the bit according to the number of transform coefficients in which the state of the bit “1” appears in the bit plane higher than the bit to be predicted. It is also good.
[0090] この構成により、ビットの状態の予測精度を向上して効率良く符号化された符号ィ匕 データを復号化することができる。  According to this configuration, it is possible to decode coded data encoded efficiently by improving the prediction accuracy of the bit state.
[0091] 上記映像復号ィ匕装置において、状態予測部は、予測対象のビットの変換係数にお[0091] In the above video decoding apparatus, the state prediction unit determines the conversion coefficient of the bit to be predicted.
V、て上位ビットにビットの状態「1」が現れたか否かに基づ ヽて、ビットの状態を予測し てもよい。 The state of the bit may be predicted based on whether or not the state of the bit "1" appears in V, the upper bits.
[0092] この構成により、ビットの状態の予測精度を向上して効率良く符号化された符号ィ匕 データを復号化することができる。  [0092] With this configuration, it is possible to improve the prediction accuracy of the bit state and decode efficiently encoded code data.
[0093] 上記映像復号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにお 、てビットの状態「1」が現れた変換係数のうちで、予測対象のビットの 変換係数よりも復号ィ匕順序が後方に位置し、かつ最も近い変換係数までの距離に基 づいて、ビットの状態を予測してもよい。 [0093] In the above video decoding apparatus, the state prediction unit converts the bit to be predicted among the conversion coefficients in which the bit state “1” appears in the bit plane higher than the bit to be predicted. The state of the bit may be predicted based on the distance to the nearest transform coefficient where the decoding order is located behind the coefficient and the nearest to it.
[0094] この構成により、零ラン長を予測して効率良く符号化された符号化データを復号ィ匕 することができる。 According to this configuration, it is possible to predict zero run length and decode coded data efficiently coded.
[0095] 上記映像復号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにおいて、復号ィ匕順序が最も後方の位置でビットの状態「1」が現れた変換 係数に基づ 、て、ビットの状態を予測してもよ 、。  [0095] In the above video decoding apparatus, the state prediction unit is based on the conversion coefficient at which the state of the bit "1" appeared at the position of the rearmost in the decoding order in the bit plane higher than the bit to be predicted. First, let's predict the state of the bit.
[0096] この構成により、ビットの状態の予測精度を向上して効率良く符号化された符号ィ匕 データを復号化することができる。 [0097] 上記映像復号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、復号ィ匕順序が後方に位置する変換係数の同じビットプレーンに属する全 てのビットの状態が「0」である確率を予測し、ビットプレーン復号ィ匕部は、確率に基づ いて、零ラン長復号ィ匕を行ってもよい。 With this configuration, it is possible to improve the prediction accuracy of the bit state and decode efficiently encoded code data. [0097] In the above video decoding apparatus, the state prediction unit determines that the state of the bit to be predicted is “1”, and that the decoding order is all belonging to the same bit plane of the conversion coefficient located behind. By predicting the probability that the bit state is "0", the bit plane decoding unit may perform zero run length decoding based on the probability.
[0098] この構成により、そのビットプレーンに現れる最後の「1」を予測して効率良く符号ィ匕 された符号化データを復号化することができる。  According to this configuration, it is possible to efficiently decode coded data encoded by predicting the last “1” appearing in the bit plane.
[0099] 上記映像復号ィ匕装置において、状態予測部は、予測対象のビットより上位のビット プレーンにお 、て、あら力じめ定められた順番以降の復号ィ匕順序の変換係数にビッ トの状態「1」を含む力否かを判定し、ビットの状態「1」を含まないとの判定に応じて、 予測対象のビットの状態が「1」であり、かつ、復号ィ匕順序が後方に位置する変換係 数の同じビットプレーンに属する全てのビットの状態が「0」である確率を予測し、ビット プレーン復号ィ匕部は、確率に基づいて、零ラン長復号ィ匕を行ってもよい。  In the above video decoding apparatus, the state prediction unit is configured to convert the decoding coefficients in the decoding order after the predetermined order in the bit plane higher than the bit to be predicted. It is determined whether or not the state includes the state “1”, and the state of the bit to be predicted is “1” and the decoding order is determined according to the determination that the state “1” of the bit is not included. The probability of all bits belonging to the same bitplane of the conversion coefficient located behind being "0" is predicted, and the bitplane decoding unit performs zero run length decoding based on the probability. May be
[0100] この構成により、最後尾の「1」以降の「0」の個数が少ない場合に、ビットプレーン終 了信号の符号化を行わないことにより、効率良く符号化された符号化データを復号 化できる。  With this configuration, when the number of “0” s after “1” at the end is small, coding of the bit plane end signal is not performed, thereby efficiently decoding the coded data encoded. Can be
[0101] 上記映像復号ィ匕装置において、状態予測部は、予測対象のビットを含む零ラン長 を予測し、ビットプレーン復号ィ匕部は、状態予測部にて予測された零ラン長に基づい て選択したハフマンテーブルを用いてハフマン復号ィ匕を行ってもょ 、。  In the video decoding apparatus, the state prediction unit predicts a zero run length including bits to be predicted, and the bit plane decoding unit is based on the zero run length predicted by the state prediction unit. Do the Huffman decoding using the selected Huffman table.
[0102] この構成により、予測した零ラン長に応じて選択されたノヽフマンテーブルを用いて 効率良く符号化された符号化データを復号化することができる。  According to this configuration, it is possible to decode encoded data efficiently encoded using the Nofman table selected according to the predicted zero run length.
[0103] 状態予測部は、ビットの状態「0」が連続して出現する確率があらかじめ設定された 閾値を下回る箇所までのゼロの個数を零ラン長として求めてもよい。  The state prediction unit may obtain, as the zero run length, the number of zeros until the probability that the bit state “0” appears continuously falls below a preset threshold.
[0104] この構成により、零ラン長を適切に予測することができる。 According to this configuration, the zero run length can be appropriately predicted.
[0105] 上記映像復号化装置において、状態予測部は、予測対象のビットの状態が、「1」 又は「0」である確率を予測し、ビットプレーン復号ィ匕部は、状態予測部にて予測され た確率に基づ!/、て決定したシンボルの生起確率を用いて、算術復号化を行ってもよ い。  In the above video decoding apparatus, the state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit plane decoding unit uses the state prediction unit to Arithmetic decoding may be performed using the probability of occurrence of the symbol determined based on the predicted probability.
[0106] この構成により、ビットの状態に応じて選択された生起確率を用いて効率良く符号 化された符号化データを復号化できる。 According to this configuration, the code can be efficiently encoded using the occurrence probability selected according to the state of the bit. The encoded data can be decoded.
[0107] 上記映像復号化装置において、状態予測部は、予測対象のビットの状態が「1」で あり、かつ、復号化順序が、後方に位置する変換係数の同じビットプレーンに属する 全てのビットの状態が「0」である確率を予測し、ビットプレーン復号ィ匕部は、状態予測 部にて予測された零ラン長に基づ 、て決定したシンボルの生起確率を用いて、算術 復号化を行ってもよい。  In the video decoding apparatus, the state prediction unit determines that the state of the bit to be predicted is “1”, and that the decoding order is all bits belonging to the same bit plane of the conversion coefficient positioned behind. The bit-plane decoding unit predicts the probability that the state of “0” is “0”, and the bit-plane decoding unit performs arithmetic decoding using the occurrence probability of the symbol determined based on the zero run length predicted by the state prediction unit. You may
[0108] この構成により、ビットの状態に応じて選択された生起確率を用いて効率良く符号 化された符号化データを復号化できる。  According to this configuration, it is possible to decode encoded data efficiently encoded using the occurrence probability selected according to the state of the bit.
[0109] 上記映像復号ィ匕装置において、状態予測部は、復号化された基本レイヤの情報に 基づいて、基本レイヤの映像品質を向上させる拡張レイヤの変換係数のビットの状態 を予測し、ビットプレーン復号ィ匕部は、状態予測部によって予測されたビットの状態に 応じて拡張レイヤの変換係数を復号ィ匕してもよ!、。  In the above video decoding apparatus, the state prediction unit predicts, based on the decoded information of the base layer, the state of the bit of the transform coefficient of the enhancement layer for improving the video quality of the base layer, The plane decoding unit may decode the transform coefficients of the enhancement layer according to the state of the bit predicted by the state prediction unit !.
[0110] この構成により、拡張レイヤと相関の強い基本レイヤの情報に基づいて、拡張レイ ャの変換係数のビットの状態を適切に予測することができる。  According to this configuration, it is possible to appropriately predict the bit state of the transform coefficient of the enhancement layer based on the information of the enhancement layer and the base layer strongly correlated.
[0111] 上記映像復号ィ匕装置において状態予測部は、基本レイヤに含まれるエッジの情報 または基本レイヤの符号ィ匕量に基づ 、て、拡張レイヤの変換係数のビットの状態を 予測してもよい。  In the above video decoding apparatus, the state prediction unit predicts the bit state of the transform coefficient of the enhancement layer based on edge information included in the base layer or the code quantity of the base layer. It is also good.
[0112] この構成により、復号ィ匕中の拡張レイヤの変換係数のビットの状態を適切に予測で きる。  According to this configuration, it is possible to appropriately predict the bit state of the transform coefficient of the enhancement layer in decoding.
[0113] 上記映像復号ィ匕装置において、状態予測部は、映像を構成するフレームの変換係 数のビットの状態を、動き予測補償符号ィ匕に用いる参照フレームの情報に基づいて 予測し、ビットプレーン復号ィ匕部は、状態予測部によって予測されたビットの状態に 応じてフレームの変換係数を復号ィ匕してもょ 、。  In the above video decoding device, the state prediction unit predicts the state of the bit of the conversion coefficient of the frame making up the video based on the information of the reference frame used for the motion prediction / compensation code, and the bit is The plane decoding unit may decode the transform coefficients of the frame according to the state of the bit predicted by the state prediction unit.
[0114] この構成により、復号ィ匕中のフレームと相関の強い参照フレームの情報に基づいて 、復号ィ匕中のフレームの変換係数のビットの状態を適切に予測することができる。  According to this configuration, it is possible to appropriately predict the bit state of the transform coefficient of the frame in decoding based on the information of the reference frame having a strong correlation with the frame in decoding.
[0115] 上記映像復号ィ匕装置において、状態予測部は、参照フレームに含まれるエッジの 情報または参照フレームの符号ィ匕量に基づ 、て、フレームのビットの状態を予測して ちょい。 [0116] この構成により、復号ィ匕中のフレームの変換係数のビットの状態を適切に予測でき る。 In the above video decoding apparatus, the state prediction unit predicts the state of the bits of the frame based on edge information included in the reference frame or the amount of coding of the reference frame. According to this configuration, it is possible to appropriately predict the bit state of the transform coefficient of the frame in the decoding process.
[0117] 本実施の形態の映像符号化方法は、映像を周波数変換して周波数成分を表す変 換係数を生成する変換係数生成ステップと、変換係数生成ステップにお ヽて生成さ れた変換係数を 2進数に変換し、複数の変換係数の同じ位のビットからなるビットプレ ーンを最上位ビットから最下位ビットに至るまで生成し、上位ビットプレーン力 順に 符号ィ匕するビットプレーン符号化ステップと、先に符号化された変換係数に関する情 報に基づいて、後に符号化される変換係数のビットの状態を予測する状態予測ステ ップとを備え、ビットプレーン符号化ステップは、状態予測ステップにおいて予測され たビットの状態に応じて符号化を行う。  In the video encoding method according to the present embodiment, a transform coefficient generating step of frequency converting a video to generate a transform coefficient representing a frequency component, and a transform coefficient generated in the transform coefficient generating step Is converted to a binary number, and a bit plane coding step of generating bit planes consisting of the same order bits of a plurality of transform coefficients from the most significant bit to the least significant bit and coding in the order of upper bit plane power The state prediction step of predicting the state of the bits of the transform coefficient to be encoded later based on the information on the previously encoded transform coefficient, the bit plane encoding step comprising the steps of: Coding is performed according to the state of the predicted bit.
[0118] この構成により、上記した映像符号ィ匕装置と同様に、効率の良い符号ィ匕を行うこと ができる。上記した映像符号化装置の各種の構成を、本実施の形態の映像符号ィ匕 方法に適用することも可能である。  According to this configuration, it is possible to perform efficient coding as in the video coding apparatus described above. It is also possible to apply various configurations of the above video coding apparatus to the video coding method of the present embodiment.
[0119] 本実施の形態の映像復号化方法は、ビットプレーン符号ィ匕された映像の符号ィ匕デ ータを上位ビットプレーン力 順に復号ィ匕するビットプレーン復号化ステップと、先に 復号化された変換係数に関する情報に基づいて、後に復号化される変換係数のビッ トの状態を予測する状態予測ステップとを備え、ビットプレーン復号化ステップは、状 態予測部によって予測されたビットの状態に応じて復号ィ匕を行う。  In the video decoding method according to the present embodiment, a bit-plane decoding step of decoding bit-code coded video coded data in order of upper bit-plane power, and decoding first. And a state prediction step of predicting the state of bits of the conversion coefficient to be decoded later based on the information on the converted conversion coefficient, the bit plane decoding step including the state of the bit predicted by the state prediction unit. Perform decryption according to.
[0120] この構成により、上記した映像復号ィ匕装置と同様に、効率良く符号化された符号化 データを復号化することができる。上記した映像復号化装置の各種の構成を、本実 施の形態の映像復号ィ匕方法に適用することも可能である。  With this configuration, it is possible to decode encoded data that has been efficiently encoded, as in the video decoding device described above. It is also possible to apply various configurations of the video decoding apparatus described above to the video decoding method of the present embodiment.
[0121] 本実施の形態の映像符号ィ匕のためのプログラムは、映像を符号ィ匕するためにコン ピュータに、映像を周波数変換して周波数成分を表す変換係数を生成する変換係 数生成ステップと、変換係数生成ステップにお ヽて生成された変換係数を 2進数に 変換し、複数の変換係数の同じ位のビットからなるビットプレーンを最上位ビットから 最下位ビットに至るまで生成し、上位ビットプレーン力 順に符号ィ匕するビットプレー ン符号化ステップと、先に符号化された変換係数に関する情報に基づいて、後に符 号化される変換係数のビットの状態を予測する状態予測ステップとを実行させ、ビット プレーン符号化ステップは、状態予測ステップにお 、て予測されたビットの状態に応 じて符号化を行う。 The program for video coding according to the present embodiment is a conversion coefficient generation step of frequency converting the video into a computer and generating a conversion coefficient representing a frequency component in order to code the video. Then, the conversion coefficient generated in the conversion coefficient generation step is converted into a binary number, and a bit plane consisting of the same order bits of a plurality of conversion coefficients is generated from the most significant bit to the least significant bit, Bit-Plane Force A bit-plane encoding step of encoding in order, and a state prediction step of predicting the state of bits of transform coefficients to be encoded later based on information on transform coefficients encoded earlier. Let the bit run The plane coding step performs coding according to the state of the predicted bit in the state prediction step.
[0122] この構成により、上記した映像符号ィ匕装置と同様に、効率の良い符号ィ匕を行うこと ができる。上記した映像符号化装置の各種の構成を、本実施の形態のプログラムに 適用することも可能である。  According to this configuration, it is possible to perform efficient coding as in the video coding apparatus described above. It is also possible to apply the various configurations of the video encoding device described above to the program of the present embodiment.
[0123] 本実施の形態の映像復号ィ匕のためのプログラムは、ビットプレーン符号ィ匕された映 像の符号ィ匕データを復号ィ匕するために、コンピュータに、上位ビットプレーン力も順 に復号ィ匕するビットプレーン復号化ステップと、先に復号ィ匕された変換係数に関する 情報に基づいて、後に復号化される変換係数のビットの状態を予測する状態予測ス テツプとを実行させ、ビットプレーン復号化ステップは、状態予測部によって予測され たビットの状態に応じて復号化を行う。  The program for video decoding according to the present embodiment is to decode the upper bit plane power in order to the computer in order to decode the bit plane coded video data of the bit plane code. The bit plane decoding step is performed, and the state prediction step is performed to predict the state of bits of the transform coefficient to be decoded later based on the information on the transform coefficient decoded earlier. The decoding step performs decoding in accordance with the state of the bit predicted by the state prediction unit.
[0124] この構成により、上記した映像復号ィ匕装置と同様に、効率良く符号化された符号化 データを復号化することができる。本発明の映像復号化装置の各種の構成を、本発 明のプログラムに適用することも可能である。  With this configuration, it is possible to decode encoded data that has been efficiently encoded, as in the video decoding device described above. It is also possible to apply various configurations of the video decoding apparatus of the present invention to the program of the present invention.
[0125] 本実施の形態に係る映像符号ィ匕装置および方法によれば、先に符号化された変 換係数に関する情報に基づいて後に符号化されるビットの状態、すなわち後に符号 化されるビットにおける「 1」と「0」の出現確率を予測することによって、効率の良!ヽ符 号ィ匕を行うことができるというすぐれた効果を有する。  According to the video encoding device and method according to the present embodiment, the state of the bit to be encoded later based on the information on the previously encoded transformation coefficient, ie, the bit to be encoded later. Good efficiency by predicting the appearance probability of '1' and '0' in It has the excellent effect of being able to perform the ヽ sign.
[0126] 以下、本発明の映像符号ィ匕装置および映像復号ィ匕装置の実施の形態について、 図面を参照して詳細に説明する。  Hereinafter, embodiments of the video coding device and the video decoding device according to the present invention will be described in detail with reference to the drawings.
[0127] (第 1の実施の形態)  First Embodiment
第 1の実施の形態では、スケーラブル映像符号ィ匕において、先に符号化した拡張 レイヤの情報を用いて拡張レイヤの状態を予測して符号ィ匕を行う映像符号ィ匕装置に ついて説明する。第 1の実施の形態の映像符号化装置は、符号ィ匕中の情報である 符号ィ匕情報に対して、先に符号ィ匕して映像ストリームに格納した情報である符号ィ匕 済情報から符号化情報の状態を表す状態予測パラメータを予測し、その状態予測パ ラメータに基づいて符号ィ匕情報を符号ィ匕する。  In the first embodiment, in a scalable video coding system, a video coding system will be described which predicts the state of the enhancement layer using the information of the enhancement layer encoded earlier and performs coding. The video encoding apparatus according to the first embodiment uses the encoded / decoded information, which is the information stored in the video stream as the encoded information, to the encoded information, which is the information in the encoded information. A state prediction parameter representing a state of coding information is predicted, and code information is coded based on the state prediction parameter.
[0128] 図 1は映像符号ィ匕装置 10における階層符号ィ匕部 26の構成を示すブロック図、図 2 は本発明の第 1の実施の形態に係る映像符号ィ匕装置 10の構成を示すブロック図で ある。図 1を参照して、本実施の形態の映像符号化装置 10の符号化について説明 する前に、図 2を参照して映像符号ィ匕装置 10の全体構成について説明する。 FIG. 1 is a block diagram showing the configuration of hierarchical code input unit 26 in video encoding apparatus 10, FIG. FIG. 1 is a block diagram showing the configuration of a video coding apparatus 10 according to a first embodiment of the present invention. Before describing the encoding of the video encoding device 10 according to the present embodiment with reference to FIG. 1, the overall configuration of the video encoding device 10 will be described with reference to FIG.
[0129] 図 2において、映像符号化装置 10は、映像信号入力部 12、基本レイヤ符号化部 1 4、基本レイヤ出力部 16、拡張レイヤ符号化部 18、拡張レイヤ出力部 28を有する。 拡張レイヤ符号ィ匕部 18は、差分部 20、 DCT部 22、スキャン部 24、階層符号化部 2 6を有する。 In FIG. 2, the video encoding device 10 includes a video signal input unit 12, a base layer coding unit 14, a base layer output unit 16, an enhancement layer coding unit 18, and an enhancement layer output unit 28. The enhancement layer coding unit 18 has a difference unit 20, a DCT unit 22, a scan unit 24, and a hierarchical coding unit 26.
[0130] 映像信号入力部 12は、映像符号ィ匕装置 10の外部から映像を 1フレームずつ原画 像として入力し、基本レイヤ符号ィ匕部 14と拡張レイヤ符号ィ匕部 18に出力する。映像 符号化装置 10の外部から入力される映像の有無を判定し、映像の入力がなければ 処理を終了する。  The video signal input unit 12 receives an image as an original image frame by frame from the outside of the video code processing apparatus 10 and outputs the video to the base layer code communication unit 14 and the enhancement layer coding circuit unit 18. The presence or absence of a video input from the outside of the video encoding device 10 is determined, and if there is no video input, the processing ends.
[0131] 基本レイヤ符号ィ匕部 14は、映像信号入力部 12から入力された原画像を符号ィ匕し て基本レイヤストリームを生成し、生成された基本レイヤストリームを基本レイヤ出力 部 16に出力する。基本レイヤストリ一ムを復号化して基本レイヤ復号化画像を生成し 、生成された基本レイヤ復号化画像を拡張レイヤ符号化部 18に出力する。  The base layer coding unit 14 codes the original image input from the video signal input unit 12 to generate a base layer stream, and outputs the generated base layer stream to the base layer output unit 16. Do. The base layer stream is decoded to generate a base layer decoded image, and the generated base layer decoded image is output to the enhancement layer coding unit 18.
[0132] 基本レイヤ出力部 16は、基本レイヤ符号ィ匕部 14から入力された基本レイヤストリー ムを映像符号ィ匕装置 10の外部に出力する。  The base layer output unit 16 outputs the base layer stream input from the base layer code input unit 14 to the outside of the video code input device 10.
[0133] 差分部 20は、映像信号入力部 12から入力された原画像と基本レイヤ符号ィ匕部 14 から入力された基本レイヤ復号化画像との差分をとつて差分画像を生成し、生成され た差分画像を DCT部 22に出力する。  The difference unit 20 generates a difference image by calculating the difference between the original image input from the video signal input unit 12 and the base layer decoded image input from the base layer code input unit 14. The difference image is output to the DCT unit 22.
[0134] DCT部 22は、差分部 20から入力された差分画像を 8 X 8画素ごとの領域であるブ ロックに分割し、ブロックごとに DCT変換を施して DCT係数を生成し、生成された D CT係数をスキャン部 24に出力する。この DCT部 22が本発明の変換係数生成部に 相当する。  The DCT unit 22 divides the difference image input from the difference unit 20 into blocks, which are areas of 8 × 8 pixels, performs DCT transform for each block, and generates DCT coefficients. The DCT coefficients are output to the scan unit 24. The DCT unit 22 corresponds to a transform coefficient generation unit of the present invention.
[0135] スキャン部 24は、 DCT部 22から入力された DCT係数を定められた順番でスキャン してスキャン済 DCT係数を生成し、生成されたスキャン済 DCT係数を階層符号ィ匕部 26に出力する。  The scan unit 24 scans the DCT coefficients input from the DCT unit 22 in a predetermined order to generate scanned DCT coefficients, and outputs the generated scanned DCT coefficients to the hierarchical coding unit 26. Do.
[0136] 図 1を参照して、第 1の実施の形態における階層符号ィ匕部 26について説明する。 図 1に示すように、階層符号ィ匕部 26は、零ラン長符号化部 30、可変長符号化部 32、 状態予測部 34を有する。 With reference to FIG. 1, the layer coding unit 26 in the first embodiment will be described. As shown in FIG. 1, the hierarchical coding unit 26 has a zero run length coding unit 30, a variable length coding unit 32, and a state prediction unit 34.
[0137] 零ラン長符号化部 30は、スキャン部 24から入力されたスキャン済 DCT係数をビット プレーンごとに、零ラン長とビットプレーン終了信号との組み合わせに零ラン長符号 化し、可変長符号化部 32に出力する。 The zero run length coding unit 30 zero-scans the scanned DCT coefficients input from the scan unit 24 into a combination of a zero run length and a bit plane end signal for each bit plane, and performs variable-length code coding. Output to the conversion unit 32.
[0138] 可変長符号化部 32は、零ラン長符号化部 30から入力された零ラン長およびビット プレーン終了信号の組み合わせを、状態予測部 34から入力された状態予測パラメ ータを用いて可変長符号ィ匕する。可変長符号化部 32は、符号化によって生成された 拡張レイヤストリームを拡張レイヤ出力部 28に出力する。 The variable length coding unit 32 uses the state prediction parameter input from the state prediction unit 34 to combine the zero run length and bit plane end signal input from the zero run length coding unit 30. Variable-length code. The variable-length coding unit 32 outputs the enhancement layer stream generated by the encoding to the enhancement layer output unit 28.
[0139] 状態予測部 34は、スキャン部 24から階層符号ィ匕部 26に入力したスキャン済 DCT 係数力 符号ィ匕中のビットである符号ィ匕情報の状態を予測して状態予測パラメータ を生成し、生成された状態予測パラメータを可変長符号ィ匕部 32に出力する。この状 態予測部 34が本発明の状態予測部に相当し、可変長符号ィ匕部 32がビットプレーン 符号ィ匕部に相当する。 The state prediction unit 34 generates a state prediction parameter by predicting the state of code information which is a bit in the scanned DCT coefficient power input from the scan unit 24 to the hierarchical code unit 26. Then, the generated state prediction parameters are output to the variable-length coding unit 32. The state prediction unit 34 corresponds to the state prediction unit of the present invention, and the variable length coding unit 32 corresponds to the bit plane coding unit.
[0140] 以上のように構成された映像符号ィ匕装置 10の動作を説明する。図 3は、図 1および 図 2に示す第 1の実施の形態の映像符号ィ匕装置 10の動作の一例を表すフローチヤ ートである。図 3に示すフローチャートは、図示しない記憶装置(例えば ROMゃフラ ッシュメモリなど)に格納された制御プログラムを、同じく図示しない CPUが実行する ことにより、プログラムの実行によりソフトウェア的に実行されるようにすることも可能で ある。  The operation of the video coding apparatus 10 configured as described above will be described. FIG. 3 is a flow chart showing an example of the operation of the video encoding apparatus 10 of the first embodiment shown in FIGS. 1 and 2. The flowchart shown in FIG. 3 causes a control program stored in a storage device (for example, ROM, flash memory, etc.) not shown to be executed as software by executing the program by the CPU not shown also executing the control program. It is also possible.
[0141] まず、映像符号化装置 10は、映像信号入力処理を行う (S10) o具体的には、映像 信号入力部 12が、映像符号ィ匕装置 10の外部から映像を 1フレームずつ原画像とし て入力し、基本レイヤ符号ィ匕部 14と拡張レイヤ符号ィ匕部 18に出力する。 [0141] First, the image encoding device 10 performs video signal input processing (S10) o Specifically, a video signal input unit 12, one frame image from the outside of the video encoder spoon 10 original images As an input, it is outputted to the base layer code block 14 and the enhancement layer code block 18.
[0142] 次に、映像符号化装置 10は、基本レイヤ符号化処理を行う(S12)。具体的には、 基本レイヤ符号ィ匕部 14が、映像信号入力部 12から入力された原画像を符号ィ匕して 基本レイヤストリームを生成し、生成された基本レイヤストリームを基本レイヤ出力部 1 6に出力する。基本レイヤ符号ィ匕部 14力 基本レイヤストリームを復号ィ匕して基本レ ィャ復号化画像を生成し、生成された基本レイヤ復号化画像を拡張レイヤ符号化部 18に出力する。 Next, the video encoding device 10 performs base layer encoding processing (S12). Specifically, the base layer coding unit 14 codes the original image input from the video signal input unit 12 to generate a base layer stream, and the generated base layer stream is output to the base layer output unit 1. Output to 6. Base layer code block 14 units A base layer stream is decoded and a base layer decoded image is generated, and the generated base layer decoded image is an enhancement layer coding unit. Output to 18
[0143] 基本レイヤの符号化方法としては、 MPEG— 4 AVCなど既存の方法を用いること とする。基本レイヤ復号化画像は、基本レイヤストリームを復号ィ匕して生成しなくても、 全く同一のものが生成できるのであれば、基本レイヤストリームを符号ィ匕における中 間処理で生成した復号化画像でも良!ヽ。  As an encoding method of the base layer, an existing method such as MPEG-4 AVC is used. A base layer decoded image is a decoded image generated by performing intermediate processing on a base layer stream if the base layer stream can not be generated by decoding the base layer stream, and an identical one can be generated. But good!
[0144] 次に、映像符号化装置 10は、差分処理を行う(S14)。具体的には、拡張レイヤ符 号ィ匕部 18の差分部 20が、映像信号入力部 12から入力された原画像と基本レイヤ符 号化部 14から入力された基本レイヤ復号化画像との差分をとつて差分画像を生成し 、生成された差分画像を DCT部 22に出力する。  Next, the video encoding device 10 performs differential processing (S14). Specifically, the difference unit 20 of the enhancement layer coding unit 18 calculates the difference between the original image input from the video signal input unit 12 and the base layer decoded image input from the base layer coding unit 14. As a result, a difference image is generated, and the generated difference image is output to the DCT unit 22.
[0145] 次に、映像符号化装置 10は、 DCT処理を行う(S16)。具体的には、 DCT部 22が 、差分部 20から入力された差分画像を 8 X 8画素ごとの領域のブロックに分割し、ブ ロックごとに DCT変換を施して DCT係数を生成し、生成された DCT係数をスキャン 部 24に出力する。 DCT変換の際の領域の分割方法は 8 X 8画素に限らない。映像 の周波数変換を行う方法は、 DCT変換に限らず Wavelet変換など他の直交変換を 行ってもよい。また、符号ィ匕効率は劣ることになるが、直交変換を一切行わなくてもよ い。  Next, the video encoding device 10 performs a DCT process (S16). Specifically, the DCT unit 22 divides the difference image input from the difference unit 20 into blocks of an area of 8 × 8 pixels, performs DCT transform for each block, and generates DCT coefficients. The DCT coefficient is output to the scan unit 24. The division method of the area at the time of DCT conversion is not limited to 8 × 8 pixels. The method of performing frequency conversion of video is not limited to DCT conversion, and other orthogonal conversion such as Wavelet conversion may be performed. In addition, although the coding efficiency is inferior, it is not necessary to perform any orthogonal transformation.
[0146] 次に、映像符号化装置 10は、スキャン処理を行う(S18)。具体的には、スキャン部 24力 DCT部 22から入力された DCT係数を定められた順番でスキャンしてスキャン 済 DCT係数を生成し、生成されたスキャン済 DCT係数を階層符号ィ匕部 26に出力す る。スキャン部 24にて行うスキャンは、図 16に示す順序に限られず、他の順序で行つ てもよい。  Next, the video encoding device 10 performs a scan process (S18). Specifically, the scanning unit 24 scans the DCT coefficients input from the DCT unit 22 in a predetermined order to generate scanned DCT coefficients, and generates the scanned DCT coefficients in the hierarchical coding unit 26. Output. The scan performed by the scan unit 24 is not limited to the order shown in FIG. 16, but may be performed in another order.
[0147] DCT係数をそのままスキャンするのではなぐ DCT係数をあら力じめ定められた数 値で除算して値を小さくする量子化処理を行った後にスキャンしてもよい。その場合、 復号化にぉ 、て逆量子化処理を行う必要がある。  Instead of scanning DCT coefficients as they are, scanning may be performed after quantization processing to reduce the values by dividing the DCT coefficients by a predetermined number. In that case, it is necessary to perform inverse quantization processing for decoding.
[0148] 次に、映像符号化装置 10は、階層符号化処理を行う(S20)。具体的には、階層符 号ィ匕部 26が、スキャン部 24から入力されたスキャン済 DCT係数を符号ィ匕して拡張レ ィャストリームを生成し、生成された拡張レイヤストリームを拡張レイヤ出力部 28に出 力する。階層符号化処理 (S20)について詳しく説明する。 [0149] 図 4は、階層符号化処理 (S20)の一例を示すフローチャートである。まず、階層符 号ィ匕部 26は、零ラン長符号化処理を行う(S30)。具体的には、零ラン長符号化部 3 0力 スキャン部 24から階層符号ィ匕部 26に入力されたスキャン済 DCT係数をビット プレーンごとに幾つかの零ラン長とビットプレーン終了信号との組み合わせに零ラン 長符号化し、可変長符号化部 32に出力する。ビットプレーン終了信号が ONであると は、零ラン信号のあとに続く「1」がビットプレーンの最後の「1」であることを示す。 OF Fであるとは、最後の「1」ではないことを示す。 Next, the video encoding device 10 performs hierarchical encoding processing (S20). Specifically, the hierarchical encoding unit 26 encodes the scanned DCT coefficients input from the scanning unit 24 to generate an enhancement layer stream, and the generated enhancement layer stream is output to the enhancement layer output unit 28. Output to The hierarchical encoding process (S20) will be described in detail. FIG. 4 is a flowchart showing an example of the hierarchical encoding process (S20). First, the hierarchical encoding unit 26 performs zero run length encoding processing (S30). Specifically, the scanned DCT coefficients input from the zero run length coding unit 30 to the hierarchical code unit 26 from the zero power scan unit 24 are divided into several zero run lengths and bit plane end signals for each bit plane. The combination is zero-run length encoded and output to the variable-length encoder 32. The bit plane end signal being ON indicates that "1" following the zero run signal is the last "1" of the bit plane. The OF F indicates that it is not the last “1”.
[0150] スキャン済 DCT係数を複数のビットプレーンをまとめて零ラン長符号ィ匕してもょ 、。  [0150] A zero run length code may be obtained by putting together a plurality of bit planes into a scanned DCT coefficient.
この場合、零ラン長とビットプレーン終了信号の他に「『0』が続 、た後に何の数字が 出現した力」を表すレベル信号が必要である。例えば、 3ビットプレーンをまとめて零 ラン長符号化した場合、零ランの後には「0」以外の「 1」〜「7」の値が出現する可能 性があるので、どの値が出現したかを示すレベル信号が必要である。本実施の形態 では、説明の簡単のため、零ラン長とビットプレーン終了信号のみを扱って説明する  In this case, in addition to the zero run length and the bit plane end signal, a level signal indicating “a force that“ 0 ”follows and then what number has appeared” is required. For example, when 3 bit planes are grouped together and subjected to zero run length coding, there is a possibility that values “1” to “7” other than “0” may appear after the zero run. A level signal indicating In the present embodiment, only the zero run length and the bit plane end signal are described for simplicity of description.
[0151] 次に、階層符号ィ匕部 26は、状態予測処理を行う(S32)。具体的には、状態予測部 Next, the hierarchical coding unit 26 performs state prediction processing (S32). Specifically, the state prediction unit
34が、スキャン部 24から階層符号ィ匕部 26に入力されたスキャン済 DCT係数力も状 態予測パラメータを生成し、生成された状態予測パラメータを可変長符号化部 32に 出力する。状態予測部 34が生成する状態予測パラメータは、符号化情報の含む幾 つかのビットが成す零ラン長または符号ィ匕情報の含む各ビットが「1」である確率と、ビ ットプレーン終了信号が ONである確率である。  34 also generates a state prediction parameter from the scanned DCT coefficient power input from the scan unit 24 to the hierarchical code pair unit 26, and outputs the generated state prediction parameter to the variable-length coding unit 32. The state prediction parameter generated by the state prediction unit 34 is a zero run length formed by a number of bits included in the encoded information or a probability that each bit included in the code information is “1”, and a bit plane end signal is ON. The probability is
[0152] 図 5は、本発明の状態予測の一例を示す図である。図 5は、あるブロックの DCT係 数を、横軸をスキャン順に、縦軸をビットごとに並べた図である。便宜上、スキャン軸 上での 2点間の距離をスキャン距離と呼び、スキャン軸の左端からの距離をスキャン 座標と呼ぶ。図 5において、領域 Cは、状態予測対象のビット群を示す。領域 Cのビッ ト B4の左端のビットが現在符号ィ匕中のビットである。  FIG. 5 is a diagram showing an example of the state prediction of the present invention. FIG. 5 is a diagram in which the DCT coefficients of a certain block are arranged in the order of scanning on the horizontal axis and in units of bits on the vertical axis. For convenience, the distance between two points on the scan axis is called the scan distance, and the distance from the left edge of the scan axis is called the scan coordinates. Region C in FIG. 5 indicates a bit group to be subjected to state prediction. The leftmost bit of bit B4 in region C is the bit currently in the code.
[0153] 本実施の形態では、可変長符号ィ匕部 32がハフマン符号ィ匕を行う場合の予測につ いて説明する。ハフマン符号ィ匕の場合は、零ラン長を予測する。本実施の形態では 、上位ビットに MSBが現れた DCT係数の符号ィ匕情報、例えば、図 5においてビット B 7、 B8等は「0」と「1」の出現確率が上位に MSBが現れて!/、な!/、DCT係数のビット B 4、 B5などに比べて高ぐ値力「l」である確率を例えば 50%と予測する。逆に、 MSB が現れていない DCT係数の符号ィ匕情報、例えば、図 5においてビット B4、 B5等は、 値が「1」である確率を 50%より低く予測する。 In the present embodiment, prediction in the case where the variable length coding unit 32 performs Huffman coding will be described. In the case of the Huffman code 匕, a zero run length is predicted. In the present embodiment, the sign information of the DCT coefficient in which the MSB appears in the upper bits, for example, the bit B in FIG. 7 and B8 etc. The appearance probability of “0” and “1” appears at the top with MSB appearing! /,! /, DCT coefficient bit B 4, B 5 etc. has higher value “1” The probability is predicted to be 50%, for example. Conversely, the sign information of the DCT coefficient in which the MSB does not appear, for example, bits B4 and B5 in FIG. 5 predict the probability that the value is "1" lower than 50%.
[0154] 一例として、ビット B4から始まる複数のビットの状態を予測する場合を考える。ここで は、 MSBが現れていないビットが「1」である確率を 50%、 MSBが現れたビットが「1」 である確率が 25%として、ビット B5に「0」が続く確率力 0%を下回った時点で連続 する「0」の個数を零ラン長の予測とする。領域 Cの左端のビット B4は上位に MSBが 現れて ヽな 、ので「0」である確率は 75%、その右のビット B5も同様に「0」である確 率は 75%、ゆえに、零ランが 2以上である確率は 75% X 75% = 56. 25%である。 次のビット B6も上位に MSBが現れて!/、な!/、ので「0」である確率は 75%で、「0」が 3 個連続する確率は、 56. 25% X 75% =42. 1875%である。次のビット B7は、上位 に MSBが現れているので「0」である確率は 50%である。従って、ビット B7まで「0」が 続く確率は 21. 09375% < 40%,よって、予測する零ラン長は 3と求められる。  As an example, consider the case of predicting the states of a plurality of bits starting from bit B4. Here, assuming that the probability that the bit without MSB appearing is “1” is 50%, and the probability that the bit with MSB appearing is “1” is 25%, probability that “0” follows bit B5 0% The number of consecutive “0s” is considered as the prediction of zero run length when it falls below. The leftmost bit B4 of the region C is MSB at the upper end and is therefore 、, so the probability of being “0” is 75%, and the probability of the bit B5 to its right is also “0” is 75%. The probability of a run being 2 or more is 75% x 75% = 56. 25%. The next bit B6 also appears MSB at the upper level! /,! /, So the probability of being '0' is 75%, and the probability of 3 consecutive '0's is 56. 25% x 75% = 42 It is 1875%. The next bit B7 has a 50% probability of being "0" because the MSB appears in the upper part. Therefore, the probability that "0" will continue to bit B7 is 21. 09375% <40%, so the predicted zero run length is determined to be 3.
[0155] DCT係数は 0より大きい絶対値を持つことが多ぐ下位ビットプレーンになればなる ほど、それより上位ビットプレーンに MSBが出現している確率が高くなる。よって、下 位ビットプレーンになればなるほど「1」の出現する確率が高くなるので、状態予測部 3 4は、ビット位が低くなればなるほど零ラン長を短く予測してもよい。零ラン長が属する ビットプレーンより上位のビットプレーンが含む MSBの数が多ければ多いほど零ラン 長を短く設定してもよい。  [0155] The lower the bit plane that the DCT coefficient often has an absolute value larger than 0, the higher the probability that the MSB appears in the higher bit plane. Therefore, since the probability that “1” will appear increases as the lower bit plane is reached, the state prediction unit 34 may predict the zero run length to be shorter as the bit position becomes lower. The zero run length may be set shorter as the number of MSBs included in the bit plane higher than the bit plane to which the zero run length belongs increases.
[0156] DCT係数の高周波係数は絶対値が小さいものが多ぐ MSBが現れていない DCT 係数の符号ィ匕情報が「1」である確率は低くなる。よって、零ラン開始ビットのスキャン 座標が大きければ大きいほど「1」のビットが現れる確率が少なくなるので、状態予測 部 34は、予測する零ラン長を長くしてもよい。  The high frequency coefficient of the DCT coefficient has many small absolute values, and the MSB does not appear. The probability that the sign information of the DCT coefficient is “1” is low. Therefore, the larger the scan coordinate of the zero run start bit, the smaller the probability that the bit “1” will appear, and therefore the state prediction unit 34 may extend the zero run length to be predicted.
[0157] ビットプレーン終了信号の予測について説明する。ハフマン符号化の場合、状態予 測部 34は、スキャン軸の後ろのビットほど、ビットプレーン終了信号が ONの確率を高 く予測する。状態予測部 34は符号ィ匕情報に対して、上位ビットプレーンにおいてスキ ヤン軸の最後尾に位置する MSBよりスキャン座標が小さければ ONの確率を低く予 測し、大きければ ONの確率を高く予測する。例えば、図 5において、ビット B1よりス キャン座標が小さければ ONの確率を低ぐ大きければ高く予測する。 The prediction of the bit plane end signal will be described. In the case of Huffman coding, the state prediction unit 34 predicts the probability of the bit plane end signal being ON to be higher as the bit behind the scan axis. The state prediction unit 34 reduces the probability of ON if the scan coordinate is smaller than the MSB located at the end of the scan axis in the upper bit plane with respect to the code information. Measure and predict the probability of ON high if it is large. For example, in FIG. 5, if the scan coordinate is smaller than bit B1, the ON probability is predicted higher if it is lower.
[0158] 符号ィ匕情報に対する上位ビットプレーンにおいて、ある程度大きいスキャン座標の DCT係数が 1を含む場合、符号ィ匕情報の属するビットプレーンに対してビットプレー ン終了信号を予測しなくてもよい。例えば、 8 X 8DCT変換において上位ビットプレー ンのビットプレーン終了信号が 62番目の DCT係数にある場合、それより下位ビットプ レーンではビットプレーン終了信号によって省略できる「0」の数が少なぐビットプレ ーン終了信号を符号ィ匕しないことで符号ィ匕効率を向上することが可能である。  If the DCT coefficient of a somewhat large scan coordinate contains 1 in the upper bit plane for the code information, the bit plane end signal may not be predicted for the bit plane to which the code information belongs. For example, when the bit plane end signal of the upper bit plane is at the 62nd DCT coefficient in the 8 × 8 DCT conversion, the bit plane with a smaller number of “0” can be omitted by the bit plane end signal in lower bit planes. It is possible to improve the coding efficiency by not coding the end signal.
[0159] 本実施の形態の状態予測部 34は、符号ィ匕情報の状態の予測の際に、符号化情報 の属するブロック内の上位ビットプレーンのみを使って予測を行う。これにより、映像 ストリームを復号ィ匕するユーザが特定ブロックのみを映像を復号ィ匕した 、と 、う場合 に、特定ブロックのみの拡張レイヤを復号ィ匕するだけでよいので、全体を復号化する より復号ィ匕の処理負荷を小さくできる。また、映像ストリームの一部のみをネットワーク 上に伝送するなどしてビットレートを節約することが可能である。  When predicting the state of code information, the state prediction unit 34 according to the present embodiment performs prediction using only the upper bit plane in the block to which the coding information belongs. As a result, in the case where the user who decodes the video stream decodes only the specific block, it is sufficient to decode the enhancement layer of the specific block only, so the whole is decoded. The processing load of decoding can be further reduced. In addition, it is possible to save the bit rate by transmitting only a part of the video stream over the network.
[0160] 次に、階層符号ィ匕部 26は、可変長符号化処理を行う(S34)。具体的には、可変長 符号ィ匕部 32が、零ラン長符号ィ匕部 30から入力された零ラン長およびビットプレーン 終了信号の組み合わせを、状態予測部 34から入力された状態予測パラメータを用 いて可変長符号化する。階層符号ィ匕部 26は、符号化によって生成された拡張レイヤ ストリームを拡張レイヤ出力部 28に出力する。  Next, the hierarchical coding unit 26 performs variable-length coding processing (S34). Specifically, the variable length code input unit 32 combines the zero run length and bit plane end signal combination input from the zero run length code input unit 30 with the state prediction parameter input from the state prediction unit 34. Use variable-length coding. The layer coding unit 26 outputs the enhancement layer stream generated by the encoding to the enhancement layer output unit 28.
[0161] ここで、可変長符号化部 32が、ハフマン符号ィ匕を行う場合について説明する。この 場合、映像符号化装置 10は、あら力じめ、複数のハフマンテーブルを持つ。ハフマ ンテーブルによって、出現頻度の高い組み合わせに短い符号を、高い組み合わせ に長い符号を割り当て、符号化効率を向上させる。  Here, the case where the variable-length coding unit 32 performs the Huffman coding will be described. In this case, the video encoding device 10 has a plurality of Huffman tables in advance. The coding table is improved by assigning short codes to frequently appearing combinations and long codes to high combinations.
[0162] よって、可変長符号化部 32は、零ラン長が長 、と予測された組み合わせを符号ィ匕 する場合、長 、零ラン長を持つ組み合わせに短 、符号を割り当てたハフマンテープ ルを選択して、ハフマン符号化する。逆に、零ラン長が短いと予測された組み合わせ を符号化する場合、短 、零ラン長を持つ組み合わせに短 、符号を割り当てたハフマ ンテーブルを選択して、ハフマン符号ィ匕する。 [0163] 可変長符号化部 32は、ビットプレーン終了信号が OFFである可能性が高いと予測 された組み合わせを符号ィ匕する場合、ビットプレーン終了信号力 OFFである組み合 わせに短い符号を割り当てたノヽフマンテーブルを選択して、ハフマン符号化する。逆 に、ビットプレーン終了信号力 ONである可能性と予測されたが高 、組み合わせを符 号化する場合、ビットプレーン終了信号が ONである組み合わせに短 、符号を割り当 てたノヽフマンテーブルを選択して、ハフマン符号ィ匕する。以上、階層符号化処理(S 20)について説明した。 Therefore, when coding a combination in which the zero run length is predicted to be long, the variable-length coding unit 32 uses a Huffman tape in which a short code is assigned to a combination having a length and a zero run length. Select and Huffman encode. Conversely, when coding a combination predicted to have a short zero run length, the Huffman table is selected by selecting a short and a Huffman table to which a short code is assigned to a combination having a zero run length. [0163] When coding a combination predicted that the bit plane end signal is likely to be OFF, the variable-length coding unit 32 sets a short code to the combination in which the bit plane end signal power is OFF. Select the assigned Nomfman table and perform Huffman coding. Conversely, when it is predicted that the bit plane end signal power is likely to be on, but the combination is encoded as high, the bit plane end signal of the combination where the bit plane end signal is ON is assigned a short code. Select and Huffman code. The hierarchical encoding process (S20) has been described above.
[0164] 図 3に戻って、映像符号化装置 10は、ストリーム出力処理を行う(S22)。具体的に は、基本レイヤ出力部 16が、基本レイヤ符号ィ匕部 14から入力された基本レイヤストリ ームを映像符号ィ匕装置 10の外部に出力する。拡張レイヤ出力部 28が、拡張レイヤ 符号ィ匕部 18から入力された拡張レイヤストリームを映像符号ィ匕装置 10の外部に出 力する。  Returning to FIG. 3, the video encoding device 10 performs stream output processing (S22). Specifically, the base layer output unit 16 outputs the base layer stream input from the base layer code input unit 14 to the outside of the video code input unit 10. The enhancement layer output unit 28 outputs the enhancement layer stream input from the enhancement layer coding unit 18 to the outside of the video coding unit 10.
[0165] 次に、映像符号化装置 10は、終了判定処理を行う(S24)。具体的には、映像信号 入力部 12が、映像符号化装置 10の外部から入力する映像の有無を判定する。その 判定の結果、映像の入力がなければ処理を終了し、映像の入力があれば映像入力 処理 (S 10)の処理に戻る。以上、本発明の第 1の実施の形態の映像符号化装置に ついて説明した。  Next, the video encoding device 10 performs an end determination process (S24). Specifically, the video signal input unit 12 determines the presence or absence of the video input from the outside of the video encoding device 10. As a result of the determination, if there is no video input, the processing is ended, and if there is a video input, the processing returns to the video input processing (S10). The video coding apparatus according to the first embodiment of the present invention has been described above.
[0166] 第 1の実施の形態の映像符号化装置 10は、拡張レイヤの符号化情報に対して、拡 張レイヤのより上位ビットプレーンの MSBの状態を用いることによって、零ラン長や「 1」である確率を予測し、また、ビットプレーン終了信号が ONである確率を予測し、予 測をもとにしてハフマン符号ィ匕に用いるハフマンテーブルを入れ替えるにより、可変 長符号ィ匕の符号ィ匕効率を向上することが可能である。  [0166] The video encoding device 10 according to the first embodiment uses the state of the MSB of the higher bit plane of the enhancement layer for the encoding information of the enhancement layer to set the zero run length or “1 By predicting the probability that the bit plane end signal is ON, and replacing the Huffman table used for the Huffman code based on the prediction, the code of the variable-length code It is possible to improve the chewing efficiency.
[0167] 第 1の実施の形態によれば、映像符号化装置 10は、符号ィヒ情報の状態を同じプロ ックに属する上位ビットプレーンから予測するので、ブロック間の独立性が保たれ、復 号ィ匕を行うユーザの注目するブロックのみを符号ィ匕できる。これにより、符号化の処 理負荷を軽くすることが可能である。注目するブロックに該当する映像ストリームのみ を伝送することによりビットレートを節約することも可能である。  According to the first embodiment, since the video encoding device 10 predicts the state of code information from the upper bit plane belonging to the same block, independence between blocks is maintained. It is possible to code only the block of interest of the user performing decoding. This can reduce the coding processing load. It is also possible to save bitrate by transmitting only the video stream that corresponds to the block of interest.
[0168] 上記した第 1の実施の形態では、可変長符号ィ匕部 32においてハフマン符号ィ匕を行 う例について説明したが、可変長符号ィ匕部 32は算術符号ィ匕を行ってもよい。算術符 号ィ匕を行う場合の状態予測について説明する。 [0168] In the first embodiment described above, the variable length coding unit 32 The variable-length code unit 32 may perform arithmetic coding. State prediction in the case of arithmetic coding will be described.
[0169] 算術符号化の場合は、符号ィ匕情報が「1」である確率を予測する。状態予測部 34は 、上位に MSBが現れた DCT係数の下位ビットが「1」である確率は 50%と予測する。 それ以外の符号化情報は、下位のビットプレーンほど「1」である確率を高くする。符 号化情報が属するビットプレーンより上位のビットプレーンが含む MSBの数が多けれ ば多 、ほど「1」である確率を高くしてもょ 、。零ラン開始ビットのスキャン座標が大き ければ大きいほど、「1」のビットが現れる確率が少なくしてもよい。  In the case of arithmetic coding, it predicts the probability that the code information is “1”. The state prediction unit 34 predicts that the probability that the low-order bit of the DCT coefficient in which the MSB appears at the high order is "1" is 50%. The other coding information has a higher probability of being "1" as the lower bit-planes. If the number of MSBs included in the bit plane higher than the bit plane to which the encoding information belongs is large, the probability of being “1” may be increased. The larger the scan coordinates of the zero run start bit, the smaller the probability that a "1" bit will appear may be.
[0170] 算術符号化によって符号化を行う場合、状態予測部 34は、ビットごとにビットプレー ン終了信号が ONである確率を予測する。ハフマン符号ィ匕の場合と同様に、上位ビッ トプレーンにおいてスキャン軸の最後尾に位置する MSBよりスキャン座標が小さけれ ば ONの確率を低く予測し、大きければ ONの確率を高く予測する。例えば、図 5にお いて、ビット B1よりスキャン座標が小さければ ONの確率を低ぐ大きければ高く予測 する。  When encoding is performed by arithmetic coding, the state prediction unit 34 predicts the probability that the bit plane end signal is ON for each bit. As in the case of the Huffman code, if the scan coordinate is smaller than the MSB located at the end of the scan axis in the upper bit plane, the probability of ON is predicted low, and if it is large, the probability of ON is predicted high. For example, in FIG. 5, if the scan coordinate is smaller than bit B1, the ON probability is predicted high if it is too low.
[0171] 可変長符号化部 32は、状態予測部 34にて生成された状態予測パラメータを用い て可変長符号ィ匕を行う。ここで用いられる状態予測パラメータは、算術符号化で必要 な符号ィ匕する記号の出現確率として、符号ィ匕情報が「1」である確率とビットプレーン 終了信号が ONである確率を用いる。符号化情報が「0」である力「1」であるかを 2種 類の記号で算術符号化し、「1」である符号ィ匕情報のビットプレーン終了信号が ONで あるカゝ OFFであるかを 2種類の記号で別に算術符号ィ匕してもょ ヽ。符号化情報が「0 」であるかビットプレーン終了信号が OFFの「1」であるかビットプレーン終了信号が O Nの「 1」であるかを 3種類の記号で算術符号ィ匕してもょ 、。  The variable-length coding unit 32 performs variable-length coding using the state prediction parameters generated by the state prediction unit 34. As the state prediction parameter used here, the probability that the code information is “1” and the probability that the bit plane end signal is ON are used as the appearance probability of the symbol necessary for the arithmetic coding. It is arithmetically encoded with two kinds of symbols whether the encoding information is “0” or “1”, and the bit plane end signal of the encoding information “1” is ON. OFF is OFF If you use two different symbols, you need to use another arithmetic code. Whether the encoding information is “0” or whether the bit plane end signal is “1” of OFF or the bit plane end signal is “1” of ON can be arithmetically coded with three kinds of symbols. ,.
[0172] (第 2の実施の形態)  Second Embodiment
第 2の実施の形態では、復号化中の情報である復号化情報に対して、先に復号ィ匕 した復号化済情報から復号化情報の状態を表す状態予測パラメータを予測し、その 状態予測パラメータに基づいて復号ィ匕を行う映像復号ィ匕装置について説明する。第 2の実施の形態では、第 1の実施の形態の映像符号ィ匕装置で生成した映像ストリー ムを復号ィ匕する映像復号ィ匕装置について説明する。 [0173] 図 6は第 2の実施の形態に係る映像復号ィ匕装置 40の構成を示すブロック図、図 7 は階層復号化部 50の構成を示す図である。図 6において、映像復号化装置 40は、 基本レイヤ入力部 42、基本レイヤ復号化部 44、拡張レイヤ入力部 46,拡張レイヤ復 号化部 48、映像信号出力部 58を有する。拡張レイヤ復号ィ匕部 48は、階層復号化部 50、逆スキャン部 52、逆 DCT部 54、加算部 56を有する。 In the second embodiment, a state prediction parameter representing the state of the decoded information is predicted from the decoded information decoded earlier with respect to the decoded information which is information being decoded, and the state prediction is performed. A video decoding apparatus that performs decoding based on parameters will be described. In the second embodiment, a video decoding apparatus for decoding a video stream generated by the video coding apparatus of the first embodiment will be described. FIG. 6 is a block diagram showing the configuration of a video decoding / decoding device 40 according to the second embodiment, and FIG. 7 is a diagram showing the configuration of the hierarchical decoding unit 50. In FIG. 6, the video decoding device 40 has a base layer input unit 42, a base layer decoding unit 44, an enhancement layer input unit 46, an enhancement layer decoding unit 48, and a video signal output unit 58. The enhancement layer decoding unit 48 includes a hierarchical decoding unit 50, an inverse scan unit 52, an inverse DCT unit 54, and an addition unit 56.
[0174] 基本レイヤ入力部 42は、映像復号ィ匕装置 40の外部力も基本レイヤストリームを入 力し、基本レイヤ復号ィ匕部 44に出力する。基本レイヤ入力部 42は、外部から基本レ ィャストリームの入力の有無を判定し、基本レイヤストリームの入力がなければ処理を 終了する。  [0174] Base layer input section 42 also inputs the base layer stream as an external force of video decoding / decoding apparatus 40, and outputs the same to base layer decoding section 44. The base layer input unit 42 determines the presence or absence of the input of the base layer stream from the outside, and ends the processing if there is no input of the base layer stream.
[0175] 基本レイヤ復号ィ匕部 44は、基本レイヤ入力部 42から入力された基本レイヤストリー ムを復号化し基本レイヤ復号化画像を生成し、生成された基本レイヤ復号化画像を 拡張レイヤ復号ィ匕部 48と映像信号出力部 58に出力する。  [0175] Base layer decoding section 44 decodes the base layer stream input from base layer input section 42 to generate a base layer decoded image, and the generated base layer decoded image is subjected to enhancement layer decoding. It is output to the head unit 48 and the video signal output unit 58.
[0176] 拡張レイヤ入力部 46は、映像復号ィ匕装置 40の外部力も拡張レイヤストリームを入 力し、拡張レイヤ復号ィ匕部 48に出力する。 [0176] The enhancement layer input unit 46 also receives an enhancement layer stream as an external force of the video decoding / decoding device 40 and outputs the enhancement layer stream to the enhancement layer decoding unit 48.
[0177] 映像信号出力部 58は、基本レイヤ復号化部 44から入力された基本レイヤ復号ィ匕 画像と、拡張レイヤ復号化部 48から入力された拡張レイヤ復号化画像を映像復号化 装置 40の外部に出力する。 The video signal output unit 58 includes the base layer decoded image input from the base layer decoding unit 44 and the enhancement layer decoded image input from the enhancement layer decoding unit 48 in the video decoding apparatus 40. Output to the outside.
[0178] 階層復号ィ匕部 50において、逆スキャン部 52は、階層復号ィ匕部 50から入力された スキャン済 DCT係数を定められた川頁番に並べ替える逆スキャンを行って DCT係数を 生成し、生成された DCT係数を逆 DCT部 54に出力する。 [0178] In hierarchical decoding unit 50, reverse scanning unit 52 performs reverse scanning to rearrange scanned DCT coefficients input from hierarchical decoding unit 50 into defined river page numbers to generate DCT coefficients. And outputs the generated DCT coefficients to the inverse DCT unit 54.
[0179] 逆 DCT部 54は、逆スキャン部 52から入力された DCT係数をブロックごとに逆 DCT を施してブロックを合成することによって差分復号化画像を生成する。逆 DCT部 54 は、生成された差分復号化画像を加算部 56に出力する。 [0179] The inverse DCT unit 54 generates a differentially decoded image by performing inverse DCT on the block basis and applying DCT coefficients input from the inverse scan unit 52 to the block. The inverse DCT unit 54 outputs the generated differentially decoded image to the addition unit 56.
[0180] 加算部 56は、基本レイヤ復号部 44から入力された基本レイヤ復号化画像と、逆 D[0180] The addition unit 56 adds the inverse of the base layer decoded image input from the base layer decoding unit 44 and the inverse D
CT部 54から入力された差分復号化画像とを加算して拡張レイヤ復号化画像を生成 する。加算部 56は、生成された拡張レイヤ復号ィ匕画像を映像信号出力部 58に出力 する。 The differential decoded image input from the CT unit 54 is added to generate an enhancement layer decoded image. The addition unit 56 outputs the generated enhancement layer decoded 匕 image to the video signal output unit 58.
[0181] 図 7は、第 2の実施の形態における階層復号ィ匕部 50の構成を示すブロック図であ る。階層復号化部 50は、可変長復号化部 60、零ラン長復号化部 62、状態予測部 6 4を有する。 FIG. 7 is a block diagram showing a configuration of hierarchical decoding unit 50 in the second embodiment. Ru. The hierarchical decoding unit 50 includes a variable-length decoding unit 60, a zero run length decoding unit 62, and a state prediction unit 64.
[0182] 可変長復号化部 60は、状態予測部 64から入力された状態予測パラメータを用い て、拡張レイヤ入力部 46から入力された拡張レイヤストリームを可変長復号ィ匕して、 零ラン長とビットプレーン終了信号の組み合わせを生成する。可変長復号化部 60は 、生成された零ラン長とビットプレーン終了信号の組み合わせを零ラン長復号ィ匕部 6 2に出力する。  The variable-length decoding unit 60 performs variable-length decoding on the enhancement layer stream input from the enhancement layer input unit 46 using the state prediction parameters input from the state prediction unit 64 to obtain a zero run length. Generate a combination of and the bit plane end signal. The variable length decoding unit 60 outputs the combination of the generated zero run length and bit plane end signal to the zero run length decoding unit 62.
[0183] 零ラン長復号ィ匕部 62は、可変長復号ィ匕部 60から入力された零ラン長とビットプレ ーン終了信号の組み合わせを復号ィ匕し、スキャン済 DCT係数を生成する。零ラン長 復号ィ匕部 60は、生成されたスキャン済 DCT係数を逆スキャン部 52に出力する。  The zero run length decoding unit 62 decodes the combination of the zero run length and the bit plane end signal input from the variable length decoding unit 60 to generate a scanned DCT coefficient. The zero run length decoding unit 60 outputs the generated scanned DCT coefficients to the reverse scan unit 52.
[0184] 状態予測部 64は、零ラン長復号ィ匕部 62から入力されたスキャン済 DCT係数を用 いて、可変長復号ィ匕部 60が復号ィ匕中の零ラン長または復号ィ匕情報が「1」である確 率とビットプレーン終了信号の状態を予測して状態予測パラメータを生成する。状態 予測部 60は、生成された状態予測パラメータを可変長復号ィ匕部 60に出力する。  The state prediction unit 64 uses the scanned DCT coefficients input from the zero run length decoding unit 62, and the zero run length or the decoded information during the variable length decoding unit 60 is decoded. The state prediction parameter is generated by predicting the state of the bit plane end signal and the probability that “1” is “1. The state prediction unit 60 outputs the generated state prediction parameter to the variable-length decoding unit 60.
[0185] 以上のように構成された映像復号ィ匕装置 40の動作を説明する。  The operation of the video decoding apparatus 40 configured as described above will be described.
図 8は、第 2の実施の形態の映像復号ィ匕装置 40の動作の一例を表すフローチヤ一 トである。なお、図 8に示すフローチャートは、図示しない記憶装置(例えば ROMや フラッシュメモリなど)に格納された制御プログラムを、同じく図示しない CPUが実行 することにより、プログラムの実行によりソフトウェア的に実行されるようにすることも可 能である。  FIG. 8 is a flowchart showing an example of the operation of the video decoding device 40 according to the second embodiment. Note that the flowchart shown in FIG. 8 is executed as software by executing a control program stored in a storage device (for example, ROM, flash memory, etc.) (not shown) by the CPU (not shown). It is also possible to
[0186] まず、映像復号化装置 40は、ストリーム入力処理を行う(S40)。具体的には、基本 レイヤ入力部 42が、映像復号ィ匕装置 40の外部力も基本レイヤストリームを入力し、 基本レイヤ復号ィ匕部 44に出力する。拡張レイヤ入力部 46が、映像復号化装置 40の 外部から拡張レイヤストリームを入力し、拡張レイヤ復号ィ匕部 48に出力する。  First, the video decoding apparatus 40 performs stream input processing (S40). Specifically, the base layer input unit 42 also inputs the base layer stream as an external force of the video decoding apparatus 40 and outputs the base layer stream to the base layer decoding unit 44. The enhancement layer input unit 46 receives the enhancement layer stream from the outside of the video decoding device 40 and outputs the stream to the enhancement layer decoding unit 48.
[0187] 次に、映像復号化装置 40は、基本レイヤ復号化処理を行う(S42)。具体的には、 基本レイヤ復号ィ匕部 44が、基本レイヤ入力部 42から入力された基本レイヤストリーム を復号化して基本レイヤ復号化画像を生成する。基本レイヤ復号化部 44は、生成さ れた基本レイヤ復号化画像を、拡張レイヤ復号ィ匕部 48と映像信号出力部 58に出力 する。 Next, the video decoding apparatus 40 performs base layer decoding processing (S42). Specifically, the base layer decoding unit 44 decodes the base layer stream input from the base layer input unit 42 to generate a base layer decoded image. The base layer decoding unit 44 outputs the generated base layer decoded image to the enhancement layer decoding unit 48 and the video signal output unit 58. Do.
[0188] 次に、映像復号化装置 40は、階層符号化処理を行う(S44)。具体的には、拡張レ ィャ復号ィ匕部 48が、拡張レイヤ入力部 46から入力された拡張レイヤストリームを復号 化して拡張レイヤ復号化画像を生成する。拡張レイヤ復号化部 48は、生成された拡 張レイヤ復号ィ匕画像を映像信号出力部 58に出力する。  Next, the video decoding apparatus 40 performs hierarchical coding processing (S44). Specifically, the enhancement layer decoding unit 48 decodes the enhancement layer stream input from the enhancement layer input unit 46 to generate an enhancement layer decoded image. The enhancement layer decoding unit 48 outputs the generated enhancement layer decoded image to the video signal output unit 58.
[0189] 図 9は、階層復号ィ匕処理 (S44)の一例を示すフローチャートである。まず、階層復 号ィ匕部 50は、状態予測処理を行う(S60)。具体的には、状態予測部 64が、零ラン 長復号ィ匕部 62から入力されたスキャン済 DCT係数を用いて、可変長復号化部 60が 復号ィ匕中の零ラン長または復号ィ匕情報が「1」である確率とビットプレーン終了信号の 状態を予測して状態予測パラメータを生成し、生成された状態予測パラメータを可変 長復号ィ匕部 60に出力する。状態予測部 64が行う予測は、第 1の実施の形態で行つ たものと同様であり、状態予測部 64が生成する状態予測パラメータは、復号化情報 を含む連続するビットが成す零ラン長または符号ィ匕情報の含む各ビットが「1」である 確率と、ビットプレーン終了信号力 SONである確率である。状態予測部 64は、復号ィ匕 情報の状態を予測するために、常に最新のスキャン済 DCT係数を零ラン長復号ィ匕 部 62から入力する必要がある。  FIG. 9 is a flowchart illustrating an example of the hierarchical decoding process (S44). First, the hierarchy decoding unit 50 performs state prediction processing (S60). Specifically, using the scanned DCT coefficients input from the zero run length decoding unit 62, the state prediction unit 64 uses the scanned DCT coefficients to perform zero run length during decoding or decoding performed by the variable length decoding unit 60. The state prediction parameter is generated by predicting the probability that the information is “1” and the state of the bit plane end signal, and the generated state prediction parameter is output to the variable length decoding unit 60. The prediction performed by the state prediction unit 64 is the same as that performed in the first embodiment, and the state prediction parameter generated by the state prediction unit 64 is a zero run length formed by consecutive bits including decoding information. Or the probability that each bit included in the code information is “1” and the probability that the bit plane end signal strength is SON. The state prediction unit 64 needs to always input the latest scanned DCT coefficient from the zero run length decoding unit 62 in order to predict the state of the decoded information.
[0190] 次に、階層復号ィ匕部 50は、可変長復号化処理を行う(S62)。具体的には、可変長 復号化部 60が、状態予測部 64から入力された状態予測パラメータを用いて、拡張レ ィャ入力部 46からに入力された拡張レイヤストリームを可変長復号ィ匕して零ラン長と ビットプレーン終了信号の組み合わせを生成する。可変長復号化部 60は、生成され た零ラン長とビットプレーン終了信号の組み合わせを零ラン長復号ィ匕部 62に出力す る。状態予測パラメータに基づいて可変長復号ィ匕に用いるハフマンテーブルを選択 する方法は、第 1の実施の形態と同じである。  [0190] Next, the hierarchical decoding unit 50 performs variable-length decoding processing (S62). Specifically, the variable-length decoding unit 60 performs variable-length decoding on the enhancement layer stream input from the enhancement layer input unit 46 using the state prediction parameters input from the state prediction unit 64. Generate a combination of zero run length and bit plane end signal. The variable length decoding unit 60 outputs the combination of the generated zero run length and bit plane end signal to the zero run length decoding unit 62. The method of selecting a Huffman table used for variable length decoding based on the state prediction parameter is the same as that of the first embodiment.
[0191] 次に、階層符号ィ匕部 50は、零ラン長復号化処理を行う (S64) 0具体的には、零ラ ン長復号化部 62が、可変長復号ィ匕部 60から入力された零ラン長とビットプレーン終 了信号の組み合わせを復号化し、スキャン済 DCT係数を生成する。零ラン長復号化 部 62は、生成されたスキャン済 DCT係数を逆スキャン部 52に出力する。以上、階層 復号化処理 (S44)について説明した。 [0192] 図 8に戻って、映像復号ィ匕装置 40は逆スキャン処理を行う(S46)。具体的には、逆 スキャン部 52が、階層復号ィ匕部 52から入力されたスキャン済 DCT係数を定められた 順番に並べ替える逆スキャンを行って DCT係数を生成し、生成された DCT係数を 逆 DCT部 54に出力する。 [0191] Next, layer coding section 50 performs zero run length decoding processing (S64) 0 Specifically, zero run length decoding section 62 receives an input from variable length decoding section 60. The combination of the zero run length and the bit plane end signal is decoded to generate scanned DCT coefficients. The zero run length decoding unit 62 outputs the generated scanned DCT coefficients to the reverse scan unit 52. The hierarchical decoding process (S44) has been described above. Returning to FIG. 8, the video decoding device 40 performs reverse scan processing (S46). Specifically, the reverse scan unit 52 performs reverse scan to rearrange the scanned DCT coefficients input from the hierarchical decoding unit 52 in a predetermined order to generate DCT coefficients, and generates the generated DCT coefficients. Output to the inverse DCT unit 54.
[0193] 次に、映像復号ィ匕装置 40は逆 DCT処理を行う(S48)。具体的には、逆 DCT部 54 1S 逆スキャン部 52から入力された DCT係数をブロックごとに逆 DCTを施し、逆 DC T変換されたブロックを合成して差分復号化画像を生成する。逆 DCT部 54は、生成 された差分復号化画像を加算部 56に出力する。  Next, the video decoding / decoding device 40 performs inverse DCT processing (S48). Specifically, the DCT coefficient input from the inverse DCT unit 54 1S inverse scan unit 52 is subjected to inverse DCT for each block, and the inverse DCT converted blocks are synthesized to generate a differentially decoded image. The inverse DCT unit 54 outputs the generated differentially decoded image to the addition unit 56.
[0194] 次に、映像復号化装置 40は加算処理を行う(S50)。具体的には、加算部 56が、 基本レイヤ復号部 44が拡張レイヤ復号ィ匕部 48に入力した基本レイヤ復号ィ匕画像と 、逆 DCT部 54から入力された差分復号化画像とを加算して拡張レイヤ復号化画像 を生成する。加算部 56は、生成された拡張レイヤ復号ィ匕画像を映像信号出力部 58 に出力する。  Next, the video decoding apparatus 40 performs addition processing (S50). Specifically, the addition unit 56 adds the base layer decoded image input to the enhancement layer decoding unit 48 by the base layer decoding unit 44 and the differentially decoded image input from the inverse DCT unit 54. To generate an enhancement layer decoded image. The addition unit 56 outputs the generated enhancement layer decoded image to the video signal output unit 58.
[0195] 次に、映像復号ィ匕装置 40は映像信号出力処理を行う(S52)。具体的には、映像 信号出力部 58が、基本レイヤ復号化部 44から入力された基本レイヤ復号化画像と、 拡張レイヤ復号ィ匕部 48から入力された拡張レイヤ復号ィ匕画像を映像復号ィ匕装置 40 の外部に出力する。映像信号出力部 58は、基本レイヤ復号化画像か拡張レイヤ復 号ィ匕画像のどちらか一方のみを外部に出力してもよい。  Next, the video decoding / decoding device 40 performs video signal output processing (S 52). Specifically, the video signal output unit 58 decodes the base layer decoded image input from the base layer decoding unit 44 and the enhancement layer decoded image input from the enhancement layer decoding unit 48. Output to the outside of the device 40. The video signal output unit 58 may output only one of the base layer decoded image and the enhancement layer decoded image or the outer layer image to the outside.
[0196] 次に、映像復号化装置 40は、終了判定処理を行う (S54) 0具体的には、基本レイ ャ入力部 42が外部から基本レイヤストリームの入力の有無を判定する。その結果、 基本レイヤストリームの入力がなければ処理を終了し、そうでなければストリーム入力 処理 (S40)に戻る。以上、本発明の第 2の実施の形態の映像復号ィ匕装置 40につい て説明した。 Next, the video decoding apparatus 40 performs an end determination process (S 54) 0 Specifically, the basic layer input unit 42 externally determines the presence / absence of the input of the base layer stream. As a result, if there is no input of the base layer stream, the processing ends, and if not, it returns to the stream input processing (S40). The video decoding apparatus 40 according to the second embodiment of the present invention has been described above.
[0197] 第 2の実施の形態によれば、映像復号化装置 40は、復号ィ匕中のビットより上位のビ ットプレーンの MSBの状態を用いて、符号化ビットの零ラン長、値が「1」である確率、 あるいはビットプレーン終了信号が ONである確率を予測し、予測を元にしてハフマ ン復号ィ匕に用いるハフマンテーブルを入れ替えて、復号化情報を復号化するための 符号量を少なくすることができ、画像の画質を向上することが可能である。 [0198] 第 2の実施の形態によれば、映像復号化装置 40は、復号ィヒ情報の状態を同じプロ ックに属する上位ビットプレーンから予測するので、ブロック間の独立性が保たれ、復 号ィ匕を行うユーザの注目するブロックのみを復号ィ匕できる。これにより、復号化の処 理負荷を軽くすることが可能である。注目するブロックに該当する映像ストリームのみ を伝送することにより、ビットレートを節約することも可能である。 According to the second embodiment, the video decoding apparatus 40 uses the state of the MSB of the bit plane higher than the bit in the decoding table to set the zero run length of the coded bit and the value “0”. The code amount for decoding the decoded information is calculated by predicting the probability of being “1” or the probability that the bit plane end signal is ON, replacing the Huffman table used for the Huffman decoding based on the prediction. It is possible to reduce the number and to improve the image quality of the image. According to the second embodiment, since the video decoding apparatus 40 predicts the state of the decoded information from the upper bit plane belonging to the same block, independence between blocks is maintained. It is possible to decode only the block of interest of the user performing decoding. This makes it possible to lighten the processing load of decryption. It is also possible to save the bit rate by transmitting only the video stream that corresponds to the block of interest.
[0199] 上記した第 2の実施の形態では、復号化方式としてハフマン復号化を用いる例に ついて説明したが、符号ィ匕データが算術符号ィ匕によって生成されている場合には、 可変長復号ィ匕部 60は算術復号ィ匕を行う。この場合、予測したビットの状態に基づい て、算術復号ィ匕に用いるシンボルの出現確率を変更して、算術復号化を行う。  In the second embodiment described above, an example in which Huffman decoding is used as the decoding method has been described. However, if the code data is generated by arithmetic code, variable length decoding is performed. A section 60 performs arithmetic decoding. In this case, arithmetic decoding is performed by changing the appearance probability of symbols used for arithmetic decoding based on the predicted bit state.
[0200] (第 3の実施の形態)  Third Embodiment
第 3の実施の形態では、スケーラブル映像符号ィ匕において、先に符号化した基本 レイヤの情報を用いて拡張レイヤの状態を予測して符号ィ匕を行う映像符号ィ匕装置に ついて説明する。先に符号ィ匕して基本レイヤの映像ストリームに格納した情報に基づ V、て拡張レイヤの状態を表す状態予測パラメータを生成し、生成された状態予測パ ラメータに基づいて符号ィ匕情報を符号ィ匕する。  In the third embodiment, in a scalable video coding system, a video coding system will be described which predicts the state of the enhancement layer using the information of the base layer encoded earlier and performs coding. Based on the information previously encoded and stored in the video stream of the base layer V, a state prediction parameter representing the state of the enhancement layer is generated, and based on the generated state prediction parameter, the code information is Sign it.
[0201] 図 10は、本発明の第 3の実施の形態に係る映像符号ィ匕装置 70の構成を示すプロ ック図である。図 10において、映像符号化装置 70は、映像信号入力部 72、基本レイ ャ符号ィ匕部 74、基本レイヤ出力部 76、拡張レイヤ符号ィ匕部 78、拡張レイヤ出力部 9 0を有する。拡張レイヤ符号ィ匕部 78は、差分部 80、 DCT部 82、スキャン部 84、階層 符号化部 86、状態予測部 88を有する。第 3の実施の形態の映像符号化装置 70の 構成は第 1の実施の形態の映像符号ィ匕装置 10と基本的には同じであるが、第 3の実 施の形態では階層符号ィ匕部 86と状態予測部 88の機能が第 1の実施の形態と異なる  FIG. 10 is a block diagram showing the configuration of a video coding apparatus 70 according to the third embodiment of the present invention. In FIG. 10, the video coding device 70 has a video signal input unit 72, a basic layer coding unit 74, a base layer output unit 76, an enhancement layer coding unit 78, and an enhancement layer output unit 90. The enhancement layer coding unit 78 includes a difference unit 80, a DCT unit 82, a scan unit 84, a layer coding unit 86, and a state prediction unit 88. The configuration of the video encoding device 70 according to the third embodiment is basically the same as the video encoding device 10 according to the first embodiment, but in the third embodiment, the hierarchical encoding is performed. The functions of the unit 86 and the state prediction unit 88 are different from those of the first embodiment.
[0202] 映像信号入力部 72は、映像符号ィ匕装置 70の外部から映像を 1フレームずつ原画 像として入力し、基本レイヤ符号ィ匕部 74と拡張レイヤ符号ィ匕部 78に出力する。映像 符号化装置 70の外部から入力する映像の有無を判定し、映像の入力がなければ処 理を終了する。 The video signal input unit 72 receives an image as an original image frame by frame from the outside of the video code processing apparatus 70 and outputs the video to the base layer code processing unit 74 and the enhancement layer coding unit 78. It is determined whether there is a video input from the outside of the video encoding device 70, and if there is no video input, the processing is terminated.
[0203] 基本レイヤ符号ィ匕部 74は、映像信号入力部 72から入力された原画像を符号化し て基本レイヤストリームを生成し、生成された基本レイヤストリームを基本レイヤ出力 部 76に出力する。また、基本レイヤ符号ィ匕部 74は、基本レイヤストリームを復号化し て基本レイヤ復号化画像を生成し、生成された基本レイヤ復号化画像を拡張レイヤ 符号ィ匕部 78に出力する。 [0203] Base layer coding section 74 encodes the original image input from video signal input section 72. The base layer stream is generated, and the generated base layer stream is output to the base layer output unit 76. Further, base layer coding section 74 decodes the base layer stream to generate a base layer decoded image, and outputs the generated base layer decoded image to enhancement layer coding section 78.
[0204] 基本レイヤ出力部 76は、基本レイヤ符号ィ匕部 74から入力された基本レイヤストリー ムを映像符号ィ匕装置 70の外部に出力する。 The base layer output unit 76 outputs the base layer stream input from the base layer coding unit 74 to the outside of the video coding unit 70.
[0205] 拡張レイヤ出力部 90は、拡張レイヤ符号ィ匕部 78から入力された拡張レイヤストリー ムを映像符号ィ匕装置 70の外部に出力する。 [0205] The enhancement layer output unit 90 outputs the enhancement layer stream input from the enhancement layer coding unit 78 to the outside of the video coding unit 70.
[0206] 差分部 80は、映像信号入力部 72から入力された原画像と基本レイヤ符号化部 74 から入力された基本レイヤ復号化画像との差分をとつて差分画像を生成し、生成され た差分画像を DCT部 82に出力する。 The difference unit 80 generates a difference image by calculating the difference between the original image input from the video signal input unit 72 and the base layer decoded image input from the base layer coding unit 74. The difference image is output to the DCT unit 82.
[0207] DCT部 82は、差分部 80から入力された差分画像を 8 X 8画素ごとの領域であるブ ロックに分割し、ブロックごとに DCT変換を施して DCT係数を生成し、生成された DThe DCT unit 82 divides the differential image input from the differential unit 80 into blocks which are areas of 8 × 8 pixels, performs DCT transform for each block, and generates DCT coefficients. D
CT係数をスキャン部 84に出力する。 The CT coefficient is output to the scan unit 84.
[0208] スキャン部 84は、 DCT部 82から入力された DCT係数を定められた順番でスキャン してスキャン済 DCT係数を生成し、生成されたスキャン済 DCT係数を階層符号ィ匕部[0208] The scan unit 84 scans the DCT coefficients input from the DCT unit 82 in a predetermined order to generate scanned DCT coefficients, and generates the scanned DCT coefficients in the hierarchical coding unit.
86に出力する。 Output to 86.
[0209] 状態予測部 90は、基本レイヤ符号化部 74から入力された基本レイヤ復号化画像 に基づ!/、て状態予測パラメータを生成し、生成された状態予測パラメータを階層符 号ィ匕部 86に出力する。  State prediction unit 90 generates a state prediction parameter based on the base layer decoded image input from base layer coding unit 74, and generates the state prediction parameter as a hierarchical code. Output to section 86.
[0210] 次に、以上のように構成された映像符号ィ匕装置 70の動作を説明する。  Next, the operation of the video coding device 70 configured as described above will be described.
図 11は、第 3の実施の形態の映像符号ィ匕装置 70の動作の一例を示すフローチヤ ートである。なお、図 3に示すフローチャートは、図示しない記憶装置(例えば ROM やフラッシュメモリなど)に格納された制御プログラムを、同じく図示しない CPUが実 行することにより、プログラムの実行によりソフトウェア的に実行されるようにすることも 可能である。  FIG. 11 is a flow chart showing an example of the operation of the video coding apparatus 70 of the third embodiment. The flowchart shown in FIG. 3 is executed as software by executing a control program stored in a storage device (for example, a ROM, a flash memory, etc.) (not shown) and executed by the CPU (not shown). It is also possible to
[0211] まず、映像符号化装置 70は、映像信号入力処理を行う(S70)。具体的には、映像 信号入力部 72が、映像符号ィ匕装置 70の外部から映像を 1フレームずつ原画像とし て入力し、基本レイヤ符号ィ匕部 74と拡張レイヤ符号ィ匕部 78に出力する。 First, the video encoding device 70 performs video signal input processing (S70). Specifically, the video signal input unit 72 converts the video into an original image frame by frame from the outside of the video coding device 70. Input to the base layer code section 74 and the enhancement layer code section 78.
[0212] 次に、映像符号化装置 70は、基本レイヤ符号化処理を行う(S72)。具体的には、 基本レイヤ符号ィ匕部 74が、映像信号入力部 72から入力された原画像を符号化して 基本レイヤストリームを生成し、生成された基本レイヤストリームを基本レイヤ出力部 7 6に出力する。基本レイヤ符号ィ匕部 74が、基本レイヤストリームを復号ィ匕して基本レ ィャ復号化画像を生成し、生成された基本レイヤ復号化画像を拡張レイヤ符号化部 78に出力する。基本レイヤの符号化方法は、第 1の実施の形態と同様に MPEG— 4 AVCなど既存の方法を用いることとする。 Next, the video encoding device 70 performs base layer encoding processing (S72). Specifically, the base layer coding unit 74 encodes the original image input from the video signal input unit 72 to generate a base layer stream, and the generated base layer stream is output to the base layer output unit 76. Output. Base layer coding section 74 decodes the base layer stream to generate a base layer decoded image, and outputs the generated base layer decoded image to enhancement layer coding section 78. Similar to the first embodiment, the base layer coding method uses the existing method such as MPEG-4 AVC.
[0213] 次に、映像符号化装置 70は差分処理を行う(S74)。具体的には、差分部 80が、 映像信号入力部 72から入力された原画像と基本レイヤ符号化部 74から入力された 基本レイヤ復号化画像との差分をとり差分画像を生成する。差分部 80は、生成され た差分画像を DCT部 82に出力する。 Next, the video encoding device 70 performs differential processing (S 74). Specifically, the difference unit 80 obtains a difference between the original image input from the video signal input unit 72 and the base layer decoded image input from the base layer coding unit 74 to generate a difference image. The difference unit 80 outputs the generated difference image to the DCT unit 82.
[0214] 次に、映像符号化装置 70は、状態予測処理を行う(S76)。具体的には、状態予測 部 88が、基本レイヤ符号化部 74から入力された基本レイヤ復号化画像に基づ ヽて 状態予測パラメータを生成し、生成された状態予測パラメータを階層符号化部 86に 出力する。 [0214] Next, the video encoding device 70 performs state prediction processing (S76). Specifically, the state prediction unit 88 generates a state prediction parameter based on the base layer decoded image input from the base layer coding unit 74, and the generated state prediction parameter is transmitted to the layer coding unit 86. Output to
[0215] 状態予測部 88の予測では、 DCT部 82の処理ブロックにあわせて基本レイヤ復号 化画像をブロックに分割し、ブロックごとに特性を調べて対応する拡張レイヤのブロッ クの DCT係数の状態を予測する。ブロックの特性とは、ブロックを基本レイヤストリー ムとして符号ィ匕した際の符号量や、そのブロックの基本レイヤ復号ィ匕画像に含まれる エッジの量である。ブロックに含まれるエッジが多いと符号量も多くなる。エッジの量 は、公知の Robertsフィルタや Sobelフィルタ、 DCT変換して DCT係数の絶対和を 用いるなどして計算する。ブロックの拡張レイヤの状態とは、差分映像の対応するブ ロックが含むエッジの量である。差分映像のあるブロックにエッジが多いと、 DCT変換 などの直交変換を行った際に高周波に大きい絶対値を持つ係数が現れる。状態予 測部 88は、基本レイヤ復号ィ匕画像のあるブロックがエッジを多く含んでいればいるほ ど、拡張レイヤの DCT係数は高周波に大きい値を含むと予測する。拡張レイヤの D CT係数の高周波係数への偏りを状態予測パラメータとする。 [0216] 次に、映像符号ィ匕装置 70は DCT処理を行う(S78)。具体的には、 DCT部 82が、 差分部 80から入力された差分画像を 8 X 8画素ごとの領域であるブロックに分割し、 ブロックごとに DCT変換を施して DCT係数を生成し、生成された DCT係数をスキヤ ン部 84に出力する。 In the prediction of the state prediction unit 88, the base layer decoded image is divided into blocks according to the processing block of the DCT unit 82, the characteristics are checked for each block, and the state of the DCT coefficient of the corresponding block of the enhancement layer Predict. The characteristics of a block are the amount of code when a block is coded as a base layer stream, and the amount of edges included in the base layer decoded image of that block. When the number of edges included in the block is large, the code amount is also large. The amount of edge is calculated by using a known Roberts filter, Sobel filter, DCT transform, and using the absolute sum of DCT coefficients. The state of the block's enhancement layer is the amount of edges that the corresponding block of the difference image contains. When there are many edges in a block with difference video, a coefficient with a large absolute value appears at high frequency when performing orthogonal transformation such as DCT transformation. The state prediction unit 88 predicts that the DCT coefficient of the enhancement layer contains a large value at a high frequency as the certain block of the base layer decoded image contains more edges. The bias to the high frequency coefficient of the DCT coefficient of the enhancement layer is used as the state prediction parameter. Next, the video code processing apparatus 70 performs a DCT process (S 78). Specifically, the DCT unit 82 divides the difference image input from the difference unit 80 into blocks which are regions of 8 × 8 pixels, performs DCT transform for each block, and generates DCT coefficients. The DCT coefficient is output to the scan unit 84.
[0217] 次に、映像符号ィ匕装置 70はスキャン処理を行う(S80)。具体的には、スキャン部 8 4力 DCT部 82から入力された DCT係数を定められた順番でスキャンしてスキャン 済 DCT係数を生成し、生成されたスキャン済 DCT係数を階層符号ィ匕部 86に出力す る。  [0217] Next, the video code processing apparatus 70 performs a scan process (S80). Specifically, the scanning unit 84 scans the DCT coefficients input from the four-force DCT unit 82 in a predetermined order to generate scanned DCT coefficients, and the generated scanned DCT coefficients are stored in the layer coding unit 86. Output to
[0218] 次に、映像符号化装置 70は、階層符号化処理を行う(S82)。具体的には、階層符 号ィ匕部 86が、状態予測部 88から入力された状態予測パラメータを用いて、スキャン 部 84から入力されたスキャン済 DCT係数を符号ィ匕し、符号化によって生成された拡 張レイヤストリームを拡張レイヤ出力部 90に出力する。  Next, the video encoding device 70 performs hierarchical coding processing (S 82). Specifically, the hierarchical encoding unit 86 encodes the scanned DCT coefficients input from the scanning unit 84 using the state prediction parameters input from the state prediction unit 88, and generates the encoded coefficients. The enhancement layer stream thus output is output to the enhancement layer output unit 90.
[0219] 階層符号ィ匕部 86による符号ィ匕では、ブロックごとのスキャン済 DCT係数をビットプ レーンごとに零ラン長符号ィ匕して零ラン長とビットプレーン終了信号との組み合わせ を生成し、状態予測パラメータを用いて可変長符号化を行う。  [0219] In the coding by the hierarchical coding unit 86, the scanned DCT coefficients for each block are subjected to zero run length coding for each bit plane to generate a combination of a zero run length and a bit plane end signal, Variable length coding is performed using state prediction parameters.
[0220] 階層符号ィ匕部 86が可変長符号化としてハフマン符号ィ匕を行う場合を考える。状態 予測部 88が DCT係数の高周波への偏りが大きいと予測したとき、偏りが小さいと予 測したときに比べ、ビットプレーン終了信号が ONである「1」のビットはスキャンの後ろ に出現する可能性が高い。よって、階層符号ィ匕部 86は、偏りが大きいと予測された ブロックに属する組み合わせを符号ィ匕する場合、零ランを開始するビットのスキャン 座標が大きければ大き 、ほど、ビットプレーン終了信号が ONである組み合わせに短 い符号を割り当てたノ、フマンテーブルを選択する。逆に、階層符号ィ匕部 86は、偏り 力 S小さいと予測されたブロックに属する組み合わせを符号化する場合、零ランを開始 するビットのスキャン座標が小さ 、場合にも、ビットプレーン終了信号が ONである組 み合わせに短 、符号を割り当てたノヽフマンテーブルを選択する。  A case will be considered in which hierarchical code unit 86 performs Huffman coding as variable-length coding. When the state prediction unit 88 predicts that the deviation to the high frequency of the DCT coefficient is large, the bit “1” in which the bit plane end signal is ON appears behind the scan, compared to when the deviation is predicted to be small Probability is high. Therefore, when the hierarchical coding unit 86 codes the combination belonging to the block predicted to have a large deviation, the larger the scan coordinate of the bit that starts the zero run, the larger the bit plane end signal turns ON. Select the Noh-Fuman table where short codes have been assigned to the combinations. Conversely, when coding the combination belonging to the block predicted to have a small bias power S, the hierarchical code input unit 86 has a small scan coordinate of the bit that starts the zero run, and the bit plane end signal is Select a short table with codes assigned to short for combinations that are ON.
[0221] DCT係数の高周波への偏りが大きいと予測したとき、小さいと予測するときに比べ 、低いビット位以上のビットプレーンにおいて MSBが出現している可能性が高い。よ つて、偏りが大きいと予測されたブロックに属する組み合わせを符号ィ匕する場合、零 ラン長の短 、組み合わせに短 、符号を割り当てたノヽフマンテーブルを選択して、ノヽ フマン符号化する。逆に、偏りが小さいと予測されたブロックに属する組み合わせを 符号化する場合、零ラン長の長 、組み合わせに短 、符号を割り当てたハフマンテー ブルを選択して、ハフマン符号ィ匕する。 [0221] When it is predicted that the high-frequency bias of the DCT coefficient is large, there is a high possibility that the MSB appears in bit planes of lower bit positions or more than when it is predicted that the coefficient is small. Therefore, when coding a combination belonging to a block predicted to have a large bias, it is zero Select the Nofman table where the run length is short, the combination is short, and the code is assigned, and perform Nofman encoding. Conversely, when coding a combination belonging to a block predicted to have a small bias, a Huffman table with a length of zero run length, a short combination, and a code assigned is selected to perform Huffman coding.
[0222] DCT係数の高周波への偏りが大きいと予測したとき、階層符号ィ匕部 86は、スキヤ ン座標の大きい符号ィ匕情報が「1」である出現確率を高くして、算術符号化を行う。  [0222] When it is predicted that the high-frequency bias of the DCT coefficient is large, the hierarchical coding unit 86 performs arithmetic coding with a high probability of occurrence that the code information with large scan coordinates is "1". I do.
[0223] 次に、映像符号化装置 70は、ストリーム出力処理を行う(S84)。具体的には、基本 レイヤ出力部 76が、基本レイヤ符号ィ匕部 74から入力された基本レイヤストリームを映 像符号ィ匕装置 70の外部に出力する。拡張レイヤ出力部 90は、拡張レイヤ符号化部 78から入力された拡張レイヤストリームを映像符号ィ匕装置 70の外部に出力する。  Next, the video encoding device 70 performs stream output processing (S 84). Specifically, the base layer output unit 76 outputs the base layer stream input from the base layer coding unit 74 to the outside of the video coding unit 70. The enhancement layer output unit 90 outputs the enhancement layer stream input from the enhancement layer coding unit 78 to the outside of the video coding device 70.
[0224] 次に、映像符号化装置 70は、終了判定処理 (S86)を行う。具体的には、映像信号 入力部 72が、映像符号化装置 70の外部から入力する映像の有無を判定する。その 判定の結果、映像の入力がなければ処理を終了し、映像の入力があれば映像信号 入力処理 (S70)の処理に戻る。以上、第 3の実施の形態の映像符号化装置 70につ いて説明した。  Next, the video encoding device 70 performs an end determination process (S86). Specifically, the video signal input unit 72 determines the presence or absence of a video input from the outside of the video encoding device 70. As a result of the determination, if there is no video input, the process ends, and if there is a video input, the process returns to the video signal input process (S70). The video encoding device 70 according to the third embodiment has been described above.
[0225] 第 3の実施の形態によれば、映像符号化装置 70は、拡張レイヤの符号ィ匕情報に対 して、基本レイヤのエッジの状態から、零ラン長や「1」である確率、ビットプレーン終 了信号が ONである確率を予測し、予測をもとにしてハフマン符号ィ匕に用いるハフマ ンテーブルを入れ替え、可変長符号ィ匕の符号ィ匕効率を向上することが可能である。  According to the third embodiment, the video encoding device 70 has a probability of being a zero run length or “1” from the state of the edge of the base layer with respect to the code layer information of the enhancement layer. It is possible to predict the probability that the bit plane end signal is ON, and replace the Huffman table used for the Huffman code based on the prediction to improve the code efficiency of the variable length code. is there.
[0226] 上記した第 3の実施の形態では、可変長符号化としてハフマン符号ィ匕を行う例につ いて説明したが、可変長符号化として算術符号ィ匕を行ってもよい。この場合、状態予 測部 88は、以下のように状態予測を行う。 DCT係数の高周波への偏りが大きいと予 測したとき、階層符号ィ匕部 86は、ビットのスキャン座標が大きければ大きいほど、ビッ トプレーン終了信号が ONである出現確率を高くして、算術符号化を行う。逆に、状 態予測部 88が DCT係数の高周波への偏りが小さいと予測したとき、ビットのスキャン 座標が小さい場合にも、ビットプレーン終了信号力 ONである出現確率を高くして、算 術符号化を行う。  In the third embodiment described above, an example in which Huffman coding is performed as variable-length coding has been described, but arithmetic coding may be performed as variable-length coding. In this case, the state prediction unit 88 performs state prediction as follows. When it is predicted that the high-frequency bias of the DCT coefficient is large, the hierarchical coding unit 86 increases the probability that the bit plane end signal is ON as the scanning coordinate of the bit increases, thereby increasing the arithmetic code. Perform. Conversely, when the state prediction unit 88 predicts that the deviation of the DCT coefficients toward high frequencies is small, the probability that the bit plane end signal power is ON is made high even when the scan coordinates of the bit are small. Perform encoding.
[0227] 上記した第 3の実施の形態では、階層符号ィ匕方式において、基本レイヤに基づい て拡張レイヤの状態を予測する例について説明したが、他の画像情報に基づいて状 態予測を行ってもよい。 MPEG符号ィ匕などに用いる動き予測補償符号ィ匕は、原画像 を幾つ力の領域に分割し、符号化する領域ごとに前後の復号化画像から参照領域 を探索して、参照領域と符号化する領域との差分を可変長符号化する。この動き予 測補償符号ィ匕方式における参照フレームに基づいて後のフレームとの差分画像の 状態を予測することもできる。本発明を動き予測補償符号ィ匕に適用する場合には、 第 3の実施の形態における基本レイヤに代えて参照フレームを用い、拡張レイヤに代 えて差分画像を用いることにより、参照フレームから予測した状態に基づいて差分画 像を効率良く符号ィ匕できる。 In the third embodiment described above, the hierarchical coding scheme is based on the base layer. Although the example of predicting the state of the enhancement layer has been described, the state prediction may be performed based on other image information. The motion prediction / compensation code MPEG used for MPEG code 分割 etc. divides the original image into several areas, searches the reference area from the previous and subsequent decoded images for each area to be encoded, and encodes the reference area and the encoding area. The variable with the area to be It is also possible to predict the state of the difference image with the subsequent frame based on the reference frame in this motion prediction / compensation code system. When the present invention is applied to a motion prediction compensation code, prediction is performed from a reference frame by using a reference frame instead of the base layer in the third embodiment and using a difference image instead of the enhancement layer. The differential image can be efficiently encoded based on the state.
[0228] (第 4の実施の形態)  Fourth Embodiment
第 4の実施の形態では、先に復号化した基本レイヤの情報である復号化済情報に 基づ 、て拡張レイヤの状態を表す状態予測パラメータを予測し、その状態予測パラ メータに基づいて復号ィ匕を行う映像復号ィ匕装置について説明する。第 4の実施の形 態では、第 3の実施の形態の映像符号化装置 70で生成した映像ストリームを復号ィ匕 する映像復号化装置について説明する。  In the fourth embodiment, a state prediction parameter representing the state of the enhancement layer is predicted based on decoded information which is information of the base layer decoded earlier, and decoding is performed based on the state prediction parameter. A video decoding apparatus that performs a communication will be described. In the fourth embodiment, a video decoding apparatus for decoding a video stream generated by the video coding apparatus 70 according to the third embodiment will be described.
[0229] 図 12は、本発明の第 4の実施の形態に係る映像復号ィ匕装置 100の構成を示すブ ロック図である。図 12において、映像復号ィ匕装置 100は、基本レイヤ入力部 102、基 本レイヤ復号ィ匕部 104、拡張レイヤ入力部 106,拡張レイヤ復号ィ匕部 108、映像信 号出力部 120を有する。拡張レイヤ復号ィ匕部 108は、階層復号化部 110、逆スキヤ ン部 112、逆 DCT部 114、加算部 116、状態予測部 118を有する。第 4の実施の形 態の映像復号化装置 100の構成は、第 2の実施の形態の映像復号化装置と基本的 な構成は同じであるが、第 4の実施の形態は階層復号ィ匕部 110と状態予測部 118の 機能が異なる。  FIG. 12 is a block diagram showing a configuration of a video decoding / reproducing apparatus 100 according to a fourth embodiment of the present invention. In FIG. 12, the video decoding apparatus 100 includes a base layer input unit 102, a base layer decoding unit 104, an enhancement layer input unit 106, an enhancement layer decoding unit 108, and a video signal output unit 120. The enhancement layer decoding unit 108 includes a hierarchical decoding unit 110, a reverse scan unit 112, an inverse DCT unit 114, an addition unit 116, and a state prediction unit 118. The basic configuration of the video decoding apparatus 100 according to the fourth embodiment is the same as that of the video decoding apparatus according to the second embodiment, but the fourth embodiment is a hierarchical decoding. The functions of the unit 110 and the state prediction unit 118 are different.
[0230] 基本レイヤ入力部 102は、映像復号ィ匕装置 100の外部力も基本レイヤストリームを 入力し、基本レイヤ復号ィ匕部 104に出力する。基本レイヤ入力部 102は、外部から 基本レイヤストリームの入力の有無を判定し、基本レイヤストリームの入力がなければ 処理を終了する。  [0230] Base layer input section 102 also receives the base layer stream as an external force of video decoding / decoding apparatus 100, and outputs the base layer stream to base layer decoding section 104. The base layer input unit 102 determines the presence or absence of input of the base layer stream from the outside, and ends the processing if there is no input of the base layer stream.
[0231] 基本レイヤ復号ィ匕部 104は、基本レイヤ入力部 102から入力された基本レイヤスト リームを復号化し基本レイヤ復号化画像を生成し、生成された基本レイヤ復号化画 像を拡張レイヤ復号ィ匕部 108と映像信号出力部 120に出力する。 Base layer decoding section 104 receives a base layer sequence input from base layer input section 102. The stream is decoded to generate a base layer decoded image, and the generated base layer decoded image is output to the enhancement layer decoding unit 108 and the video signal output unit 120.
[0232] 拡張レイヤ入力部 106は、映像復号ィ匕装置 100の外部力も拡張レイヤストリームを 入力し、拡張レイヤ復号ィ匕部 108に出力する。 Enhancement layer input section 106 also receives an enhancement layer stream as an external force of video decoding / decoding apparatus 100 and outputs the enhancement layer stream to enhancement layer decoding section 108.
[0233] 映像信号出力部 120は、基本レイヤ復号化部 104から入力された基本レイヤ復号 化画像と、拡張レイヤ復号化部 108から入力された拡張レイヤ復号化画像を映像復 号ィ匕装置 100の外部に出力する。 [0233] Video signal output section 120 outputs the base layer decoded image input from base layer decoding section 104 and the enhancement layer decoded image input from enhancement layer decoding section 108 to video decoding apparatus 100. Output to the outside of
[0234] 階層復号化部 110は、状態予測部 118から入力された状態予測パラメータを用い て、拡張レイヤ入力部 106から入力された拡張レイヤストリームを復号ィ匕してスキャン 済 DCT係数を生成し、生成されたスキャン済 DCT係数を逆スキャン部 112に出力す る。 Hierarchical decoding section 110 decodes the enhancement layer stream input from enhancement layer input section 106 using the state prediction parameters input from state prediction section 118 to generate scanned DCT coefficients. The generated scanned DCT coefficients are output to the inverse scan unit 112.
[0235] 逆スキャン部 112は、階層復号ィ匕部 112から入力されたスキャン済 DCT係数を定 められた順番に並べ替える逆スキャンを行って DCT係数を生成し、生成された DCT 係数を逆 DCT部 114に出力する。  [0235] The reverse scan unit 112 performs reverse scan to rearrange the scanned DCT coefficients input from the hierarchical decoding unit 112 in a specified order to generate DCT coefficients, and reverses the generated DCT coefficients. Output to the DCT unit 114.
[0236] 逆 DCT部 114は、逆スキャン部 112から入力された DCT係数を領域ごとに逆 DC[0236] The inverse DCT unit 114 inverts the DCT coefficients input from the inverse scan unit 112 for each region.
Tを施し領域を合成して差分復号化画像を生成し、生成された差分復号化画像を加 算部 116に出力する。 T is applied to synthesize a region to generate a differentially decoded image, and the generated differentially decoded image is output to the addition unit 116.
[0237] 加算部 116は、基本レイヤ復号部 104から入力された基本レイヤ復号化画像と、逆 DCT部 114から入力された差分復号化画像とを加算して拡張レイヤ復号化画像を 生成し、生成された拡張レイヤ復号ィ匕画像を映像信号出力部 120に出力する。  [0237] The addition unit 116 adds the base layer decoded image input from the base layer decoding unit 104 and the differentially decoded image input from the inverse DCT unit 114 to generate an enhancement layer decoded image, The generated enhancement layer decoded image is output to the video signal output unit 120.
[0238] 状態予測部 118は、基本レイヤ復号ィ匕部 104から拡張レイヤ復号ィ匕部 108に入力 された基本レイヤ復号化画像に基づ!/ヽて状態予測パラメータを生成し、生成された 状態予測パラメータを階層復号化部 116に出力する。  State prediction section 118 generates state prediction parameters based on the base layer decoded image input from base layer decoding section 104 to enhancement layer decoding section 108, and is generated. The state prediction parameters are output to hierarchical decoding section 116.
[0239] 次に、以上のように構成された映像復号ィ匕装置 100の動作を説明する。  [0239] Next, the operation of the video decoding apparatus 100 configured as described above will be described.
図 13は、図 12に示す第 4の実施の形態の映像復号ィ匕装置 100の動作の一例を表 すフローチャートである。図 13に示すフローチャートは、図示しない記憶装置(例え ば ROMやフラッシュメモリなど)に格納された制御プログラムを、同じく図示しない CP Uが実行することにより、プログラムの実行によりソフトウェア的に実行されるようにす ることち可會である。 FIG. 13 is a flowchart showing an example of the operation of the video decoding / decoding device 100 of the fourth embodiment shown in FIG. The flowchart shown in FIG. 13 is executed by executing a control program stored in a storage device (not shown) (eg, ROM, flash memory, etc.) by executing the program by executing a program (not shown). To It is a good thing.
[0240] まず、映像復号ィ匕装置 100はストリーム入力処理を行う(S90)。具体的には、基本 レイヤ入力部 102が、映像復号ィ匕装置 100の外部力も基本レイヤストリームを入力し 、基本レイヤ復号ィ匕部 104に出力する。拡張レイヤ入力部 106が、映像復号化装置 100の外部力も拡張レイヤストリームを入力し、拡張レイヤ復号ィ匕部 108に出力する  First, the video decoding / reading apparatus 100 performs stream input processing (S90). Specifically, the base layer input unit 102 inputs the base layer stream also from the external power of the video decoding apparatus 100 and outputs the base layer stream to the base layer decoding unit 104. The enhancement layer input unit 106 also receives the enhancement layer stream from the external power of the video decoding apparatus 100 and outputs the enhancement layer stream to the enhancement layer decoding unit 108.
[0241] 次に、映像復号化装置 100は基本レイヤ復号化処理を行う(S92)。具体的には、 基本レイヤ復号ィ匕部 104が、基本レイヤ入力部 102から入力された基本レイヤストリ 一ムを復号化して基本レイヤ復号化画像を生成し、生成された基本レイヤ復号化画 像を拡張レイヤ復号ィ匕部 108と映像信号出力部 120に出力する。 Next, the video decoding apparatus 100 performs base layer decoding processing (S92). Specifically, base layer decoding section 104 decodes the base layer stream input from base layer input section 102 to generate a base layer decoded image, and the generated base layer decoded image is generated. It is output to the enhancement layer decoding unit 108 and the video signal output unit 120.
[0242] 次に、映像復号ィ匕装置 100は状態予測処理を行う(S94)。具体的には、状態予測 部 118が、基本レイヤ復号ィ匕部 104から入力された基本レイヤ復号ィ匕画像力も状態 予測パラメータを生成し、生成された状態予測パラメータを階層復号化部 116に出 力する。状態予測部 118の予測では、逆 DCT部 112が合成する領域にあわせて基 本レイヤ復号化画像を分割し、分割された領域ごとに特性を調べて、対応する拡張 レイヤの領域の DCT係数の状態を予測する。状態予測部 118が行う予測は、第 3の 実施の形態で行った予測と同じである。  Next, the video decoding apparatus 100 performs state prediction processing (S94). Specifically, the state prediction unit 118 also generates state prediction parameters for the base layer decoding / image power input from the base layer decoding unit 104, and outputs the generated state prediction parameters to the hierarchy decoding unit 116. To force. In the prediction of the state prediction unit 118, the base layer decoded image is divided according to the region to be synthesized by the inverse DCT unit 112, the characteristics are checked for each divided region, and the DCT coefficients of the corresponding extension layer region are Predict the state. The prediction performed by the state prediction unit 118 is the same as the prediction performed in the third embodiment.
[0243] 次に、映像復号ィ匕装置 100は階層復号ィ匕処理を行う(S96)。具体的には、階層復 号化部 110が、状態予測部 118から入力された状態予測パラメータを用いて、拡張 レイヤ入力部 106から入力された拡張レイヤストリームを復号ィ匕してスキャン済 DCT 係数を生成し、生成されたスキャン済 DCT係数を逆スキャン部 112に出力する。階 層復号ィ匕部 86の復号ィ匕では、状態予測パラメータを用いて可変長復号ィ匕を行い、 各領域のビットプレーンごとの零ラン長とビットプレーン終了信号の組み合わせを生 成し、それを零ラン長復号ィ匕することによってスキャン済 DCT係数を生成する。階層 復号ィ匕部 110は、状態予測パラメータに基づいて、ハフマンテーブルを選択し、選択 されたハフマンテーブルを用いて復号ィ匕を行う。  Next, the video decoding apparatus 100 performs hierarchical decoding processing (S96). Specifically, the hierarchical decoding unit 110 decodes the enhancement layer stream input from the enhancement layer input unit 106 using the state prediction parameters input from the state prediction unit 118, and the scanned DCT coefficients are decoded. , And outputs the generated scanned DCT coefficients to the inverse scan unit 112. In the decoding of the hierarchical decoding unit 86, variable length decoding is performed using state prediction parameters, and a combination of zero run length and bit plane end signal for each bit plane of each area is generated, Generate scanned DCT coefficients by zero run length decoding. The hierarchy decoding unit 110 selects a Huffman table based on the state prediction parameter, and performs decoding using the selected Huffman table.
[0244] 次に、映像復号ィ匕装置 100は逆スキャン処理を行う(S98)。具体的には、逆スキヤ ン部 112が、階層復号化部 112から入力されたスキャン済 DCT係数を定められた順 番に並べ替える逆スキャンを行って DCT係数を生成し、生成された DCT係数を逆 D CT部 114に出力する。 Next, the video decoding apparatus 100 performs reverse scan processing (S 98). Specifically, reverse scan unit 112 determines the order in which the scanned DCT coefficients input from hierarchical decoding unit 112 are determined. Inverse scan is performed to rearrange numbers to generate DCT coefficients, and the generated DCT coefficients are output to the inverse DCT unit 114.
[0245] 次に、映像復号ィ匕装置 100は逆 DCT処理を行う(S100)。具体的には、逆 DCT部 114が、逆スキャン部 112から入力された DCT係数を領域ごとに逆 DCTを施し、領 域を合成して差分復号化画像を生成し、生成された差分復号化画像を加算部 116 に出力する。  Next, the video decoding / writing apparatus 100 performs inverse DCT processing (S 100). Specifically, the inverse DCT unit 114 performs inverse DCT on the DCT coefficients input from the inverse scan unit 112 for each area, synthesizes the areas, and generates a differentially decoded image, and the generated differential decoding The image is output to the addition unit 116.
[0246] 次に、映像復号化装置 100は加算処理を行う(S112)。具体的には、加算部 116 が、基本レイヤ復号部 104が拡張レイヤ復号ィ匕部 108に入力した基本レイヤ復号ィ匕 画像と、逆 DCT部 114から入力された差分復号化画像とを加算して拡張レイヤ復号 化画像を生成する。加算部 116は、生成された拡張レイヤ復号化画像を映像信号出 力部 120に出力する。  [0246] Next, the video decoding apparatus 100 performs an addition process (S112). Specifically, the addition unit 116 adds the base layer decoded image input to the enhancement layer decoding unit 108 by the base layer decoding unit 104 and the differentially decoded image input from the inverse DCT unit 114. To generate an enhancement layer decoded image. The addition unit 116 outputs the generated enhancement layer decoded image to the video signal output unit 120.
[0247] 次に、映像復号ィ匕装置 100は映像信号出力処理を行う(S114)。具体的には、映 像信号出力部 120が、基本レイヤ復号ィ匕部 104から入力された基本レイヤ復号ィ匕画 像と、拡張レイヤ復号化部 108から入力された拡張レイヤ復号化画像を映像復号ィ匕 装置 100の外部に出力する。映像信号出力部 120は、基本レイヤ復号化画像か拡 張レイヤ復号ィ匕画像のどちらか一方のみを外部に出力してもよい。  Next, the video decoding / reproducing apparatus 100 performs video signal output processing (S114). Specifically, the video signal output unit 120 outputs the base layer decoded image input from the base layer decoding unit 104 and the enhancement layer decoded image input from the enhancement layer decoding unit 108 as video. Output to the outside of the decryption device 100. The video signal output unit 120 may output only one of the base layer decoded image and the enhancement layer decoded 匕 image to the outside.
[0248] 次に、映像復号化装置 100は終了判定処理を行う (S116) 0具体的には、基本レ ィャ入力部 102が外部力も基本レイヤストリームの入力の有無を判定する。基本レイ ヤストリームの入力がなければ処理を終了し、基本レイヤストリームの入力があればス トリーム入力処理 (S90)に戻る。以上、本発明の第 4の実施の形態の映像復号化装 置 100について説明した。 Next, the video decoding apparatus 100 performs an end determination process (S116) 0 Specifically, the basic layer input unit 102 determines whether there is an input of the external layer or the input of the basic layer stream. If there is no input of the basic layer stream, the processing ends, and if there is an input of the basic layer stream, the processing returns to the stream input processing (S90). The video decoding apparatus 100 according to the fourth embodiment of the present invention has been described above.
[0249] 第 4の実施の形態によれば、映像復号化装置 100は、拡張レイヤの復号化情報に 対して、基本レイヤのエッジの状態から、零ラン長や「1」である確率を予測し、また、 ビットプレーン終了信号が ONである確率を予測することによって、予測をもとにして ハフマン復号ィ匕に用いるハフマンテーブルを入れ替え、復号化情報を復号化するた めの符号量を少なくすることができ、画像の画質を向上することが可能である。  According to the fourth embodiment, the video decoding apparatus 100 predicts the zero run length and the probability of being “1” from the state of the edge of the base layer with respect to the decoding information of the enhancement layer. Also, by predicting the probability that the bit plane end signal is ON, the Huffman table used for the Huffman decoding is replaced based on the prediction, and the code amount for decoding the decoding information is reduced. It is possible to improve the quality of the image.
[0250] 上記した第 4の実施の形態では、ハフマン復号ィ匕を行う例について説明したが、階 層復号ィ匕部 110は、算術復号ィ匕を行ってもよい。この場合、予測した拡張レイヤの状 態に応じて、算術復号ィ匕に用いるシンボルの出現確率を変えて復号ィ匕を行う。 [0250] In the fourth embodiment described above, an example in which Huffman decoding is performed has been described, but the hierarchical decoding unit 110 may perform arithmetic decoding. In this case, the state of the predicted enhancement layer Decoding is performed by changing the appearance probability of symbols used for arithmetic decoding according to the state.
[0251] 上記した第 4の実施の形態では、基本レイヤから拡張レイヤの状態を予測し、予測 した状態に基づいて拡張レイヤの復号ィ匕を行う例について説明したが、動き予測補 償復号ィ匕を行う場合には、参照画像に基づいて状態を予測してもよい。本発明を動 き予測補償符号ィ匕に適用する場合には、第 4の実施の形態における基本レイヤに代 えて参照フレームを用い、拡張レイヤに代えて差分画像を用いることにより、参照フレ 一ムカも予測した状態に基づいて差分画像を復号ィ匕できる。  In the fourth embodiment described above, an example has been described in which the state of the enhancement layer is predicted from the base layer and decoding of the enhancement layer is performed based on the predicted state. However, motion prediction / compensation decoding When performing chewing, the state may be predicted based on the reference image. When the present invention is applied to a motion prediction compensation code, a reference frame is used instead of the base layer in the fourth embodiment, and a difference image is used instead of the enhancement layer to obtain a reference frame. The difference image can be decoded based on the predicted state.
[0252] 以上に、現時点で考えられる本発明の好適な実施の形態を説明した力 本実施の 形態に対して多様な変形が可能なことが理解され、本発明の真実の精神と範囲内に あるそのようなすべての変形を添付の請求の範囲が含むことが意図されている。 産業上の利用可能性  As described above, it is understood that various modifications can be made to the embodiment of the present invention which has described the presently preferred embodiment of the present invention, and it is within the true spirit and scope of the present invention. It is intended that the appended claims cover all such variations. Industrial applicability
[0253] 以上説明したように、本発明は、先に符号化された変換係数に関する情報に基づ いて後に符号化されるビットの状態、すなわち後に符号化されるビットにおける「1」と 「0」の出現確率を予測することによって、効率の良い符号ィ匕を行うことができるという すぐれた効果を有し、映像を符号ィ匕する映像符号ィ匕装置等として有用である。本発 明は、特に通信速度の変動するネットワークを介して映像ストリームの量を動的に変 化させながら映像を送受信するシステムの映像符号ィ匕方式等に有用である。 As described above, according to the present invention, the states of bits to be encoded later based on information on previously encoded transform coefficients, that is, “1” and “0” in bits to be encoded later. By predicting the appearance probability of “,” it has an excellent effect of being able to perform efficient coding, and is useful as a video coding apparatus that codes video. The present invention is particularly useful for a video coding scheme of a system that transmits and receives video while dynamically changing the amount of video stream via a network with a variable communication speed.

Claims

請求の範囲 The scope of the claims
[1] 映像を周波数変換して周波数成分を表す変換係数を生成する変換係数生成部と 前記変換係数生成部にて生成された変換係数を 2進数に変換し、複数の変換係 数の同じ位のビットからなるビットプレーンを最上位ビットから最下位ビットに至るまで 生成し、上位ビットプレーン力も順に符号ィ匕するビットプレーン符号ィ匕部と、  [1] A conversion coefficient generation unit that generates a conversion coefficient representing a frequency component by frequency-converting an image, and the conversion coefficient generated by the conversion coefficient generation unit are converted into binary numbers, and the same order of a plurality of conversion coefficients A bit plane code section that generates bit planes consisting of bits from the most significant bit to the least significant bit, and the higher order bit plane powers in order.
先に符号化された変換係数に関する情報に基づいて、後に符号化される変換係数 のビットの状態を予測する状態予測部と、  A state prediction unit that predicts a state of bits of a transform coefficient to be encoded later, based on information on the transform coefficient encoded earlier;
を備え、  Equipped with
前記ビットプレーン符号ィ匕部は、前記状態予測部によって予測されたビットの状態 に応じて符号ィ匕を行う映像符号化装置。  The video coding apparatus according to claim 1, wherein the bit-plane coding unit performs coding according to the state of the bit predicted by the state prediction unit.
[2] 前記状態予測部は、上位のビットプレーンの情報に基づいて、下位のビットプレー ンのビットの状態を予測する請求項 1に記載の映像符号ィ匕装置。 [2] The video coding device according to claim 1, wherein the state prediction unit predicts the state of the bits of the lower bit plane based on the information of the upper bit plane.
[3] 前記状態予測部は、予測対象のビットより上位のビットプレーンにおいて、ビットの 状態「1」が現れた変換係数の個数に応じてビットの状態を予測する請求項 2に記載 の映像符号化装置。 [3] The video code according to claim 2, wherein the state prediction unit predicts the state of the bit according to the number of transform coefficients in which the state “1” of the bit appears in the bit plane higher than the bit to be predicted. Device.
[4] 前記状態予測部は、予測対象のビットの変換係数にぉ 、て上位ビットにビットの状 態「1」が現れたか否かに基づいて、ビットの状態を予測する請求項 2に記載の映像 符号化装置。  [4] The state prediction unit according to claim 2, wherein the state prediction unit predicts the state of the bit based on whether or not the state of the bit “1” appears in the upper bits of the transform coefficient of the bit to be predicted. Video coding device.
[5] 前記状態予測部は、予測対象のビットより上位のビットプレーンにお 、てビットの状 態「1」が現れた変換係数のうちで、予測対象のビットの変換係数よりも符号ィ匕順序が 後方に位置し、かつ、最も近い変換係数までの距離に基づいて、ビットの状態を予測 する請求項 2に記載の映像符号化装置。  [5] The state prediction unit is configured to generate a code having a code state that is higher than the conversion coefficient of the prediction target bit among the conversion coefficients in which the bit state “1” appears in the bit plane higher than the prediction target bit. The video encoding apparatus according to claim 2, wherein the state of the bit is predicted based on the distance to the closest transform coefficient, which is located behind the order.
[6] 前記状態予測部は、予測対象のビットより上位のビットプレーンにおいて、符号ィ匕 順序が最も後方の位置でビットの状態「1」が現れた変換係数に基づいて、ビットの状 態を予測する請求項 2に記載の映像符号化装置。  [6] The state prediction unit determines the state of the bit based on the conversion coefficient in which the state of the bit “1” appears at the last position of the code order in the bit plane higher than the bit to be predicted. The video coding apparatus according to claim 2, wherein the video coding apparatus predicts.
[7] 前記状態予測部は、予測対象のビットの状態力 「1」であり、かつ、符号ィ匕順序が後 方に位置する変換係数の同じビットプレーンに属する全てのビットの状態が「0」であ る確率を予測し、 [7] The state prediction unit is a state power “1” of the bit to be predicted, and the state of all bits belonging to the same bit plane of the conversion coefficient whose code order is located backward is “0”. " Predict the probability of
前記ビットプレーン符号化部は、前記確率に基づいて、零ラン長符号化を行う請求 項 2に記載の映像符号化装置。  The video coding apparatus according to claim 2, wherein the bit plane coding unit performs zero run length coding based on the probability.
[8] 前記状態予測部は、予測対象のビットより上位のビットプレーンにおいて、あらかじ め定められた順番以降の符号ィ匕順序の変換係数にビットの状態「1」を含む力否かを 判定し、ビットの状態「1」を含まないとの判定に応じて、予測対象のビットの状態が「1 」であり、かつ、符号ィヒ順序が後方に位置する変換係数の同じビットプレーンに属す る全てのビットの状態が「0」である確率を予測し、 [8] The state prediction unit determines whether or not the conversion coefficient of the code order after the predetermined order includes the bit state “1” in the bit plane higher than the bit to be predicted. And according to the determination that the bit state "1" is not included, the state of the bit to be predicted is "1" and the code order belongs to the same bit plane of the conversion coefficient located behind. Predict the probability that the state of all bits is "0",
前記ビットプレーン符号化部は、前記確率に基づいて、零ラン長符号化を行う請求 項 2に記載の映像符号化装置。  The video coding apparatus according to claim 2, wherein the bit plane coding unit performs zero run length coding based on the probability.
[9] 前記状態予測部は、予測対象のビットを含む零ラン長を予測し、 [9] The state prediction unit predicts a zero run length including a bit to be predicted,
前記ビットプレーン符号化部は、前記状態予測部にて予測された零ラン長に基づ V、て選択したノヽフマンテーブルを用いてハフマン符号ィ匕を行う請求項 2に記載の映 像符号化装置。  3. The image coding method according to claim 2, wherein the bit plane coding unit performs Huffman coding using the selected table based on the zero run length predicted by the state prediction unit. apparatus.
[10] 前記状態予測部は、ビットの状態「0」が連続して出現する確率があらかじめ設定さ れた閾値を下回る箇所までのゼロの個数を零ラン長として求める請求項 9に記載の 映像符号化装置。  10. The image according to claim 9, wherein the state prediction unit obtains, as a zero run length, the number of zeros up to a point where the probability that the bit state “0” appears continuously falls below a preset threshold. Encoding device.
[11] 前記状態予測部は、予測対象のビットの状態が「1」又は「0」である確率を予測し、 前記ビットプレーン符号化部は、前記状態予測部にて予測された前記確率に基づ V、て決定したシンボルの生起確率を用いて、算術符号ィ匕を行う請求項 2に記載の映 像符号化装置。  [11] The state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bitplane coding unit is configured to calculate the probability predicted by the state prediction unit. 3. The video encoding apparatus according to claim 2, wherein arithmetic coding is performed using the occurrence probability of the symbol determined based on V,.
[12] 前記状態予測部は、予測対象のビットの状態力 「l」であり、かつ、符号ィ匕順序が後 方に位置する変換係数の同じビットプレーンに属する全てのビットの状態が「0」であ る確率を予測し、  [12] The state prediction unit is a state power “1” of the bit to be predicted, and the state of all bits belonging to the same bit plane of the conversion coefficient whose code order is located backward is “0”. Predict the probability of
前記ビットプレーン符号化部は、前記状態予測部にて予測された前記確率に基づ V、て決定したシンボルの生起確率を用いて、算術符号ィ匕を行う請求項 2に記載の映 像符号化装置。  3. The video code according to claim 2, wherein the bit plane coding unit performs arithmetic coding using the occurrence probability of the symbol determined based on the probability predicted by the state prediction unit. Device.
[13] 映像を単独で復号ィ匕可能な基本レイヤと基本レイヤの映像品質を向上させる拡張 レイヤとに階層符号化する映像符号化装置であり、 [13] Enhancement to improve the video quality of base layer and base layer that can decode video independently It is a video coding device that performs hierarchical coding with layers,
前記状態予測部は、前記基本レイヤの情報に基づいて、前記拡張レイヤの変換係 数のビットの状態を予測し、  The state prediction unit predicts the state of the bit of the conversion coefficient of the enhancement layer based on the information of the base layer;
前記ビットプレーン符号ィ匕部は、前記状態予測部によって予測されたビットの状態 に応じて前記拡張レイヤの変換係数を符号化する請求項 1に記載の映像符号ィ匕装 置。  The video coding apparatus according to claim 1, wherein the bit-plane coding unit encodes the transform coefficient of the enhancement layer according to the state of the bit predicted by the state prediction unit.
[14] 前記状態予測部は、前記基本レイヤに含まれるエッジの情報または前記基本レイ ャの符号ィ匕量に基づ 、て、前記拡張レイヤの変換係数のビットの状態を予測する請 求項 13に記載の映像符号化装置。  [14] The condition prediction unit is a request to predict the state of the bit of the transform coefficient of the enhancement layer based on edge information included in the base layer or a code amount of the base layer. 13. The video encoding device according to 13.
[15] 前記状態予測部は、映像を構成するフレームの変換係数のビットの状態を、動き予 測補償符号ィ匕に用いる参照フレームの情報に基づいて予測し、 [15] The state prediction unit predicts the state of bits of transform coefficients of a frame making up an image based on information of a reference frame used for motion prediction / compensation code 、,
前記ビットプレーン符号ィ匕部は、前記状態予測部によって予測されたビットの状態 に応じて前記フレームの変換係数を符号ィ匕する請求項 1に記載の映像符号ィ匕装置。  The video coding apparatus according to claim 1, wherein the bit plane coding unit codes the conversion coefficient of the frame according to the state of the bit predicted by the state prediction unit.
[16] 前記状態予測部は、前記参照フレームに含まれるエッジの情報または前記参照フ レームの符号ィ匕量に基づ 、て、前記フレームの変換係数のビットの状態を予測する 請求項 15に記載の映像符号化装置。 [16] The state prediction unit predicts a state of bits of transform coefficients of the frame based on edge information included in the reference frame or a code amount of the reference frame. Video coding device according to claim 1.
[17] ビットプレーン符号ィ匕された映像の符号ィ匕データを上位ビットプレーン力も順に復 号ィ匕するビットプレーン復号ィ匕部と、 [17] A bit-plane decoding unit that decodes the bit-plane coded image data and the upper-bit-plane data sequentially in the same manner as the upper-bit-plane power, and
先に復号化された変換係数に関する情報に基づいて、後に復号化される変換係数 のビットの状態を予測する状態予測部と、  A state prediction unit that predicts the state of bits of the transform coefficient to be decoded later based on the information on the previously decoded transform coefficient;
を備え、  Equipped with
前記ビットプレーン復号ィ匕部は、前記状態予測部によって予測されたビットの状態 に応じて復号ィ匕を行う映像復号化装置。  The video decoding apparatus, wherein the bit plane decoding unit performs decoding according to the state of the bit predicted by the state prediction unit.
[18] 前記状態予測部は、上位のビットプレーンの情報に基づいて、下位のビットプレー ンのビットの状態を予測する請求項 17に記載の映像復号ィ匕装置。 [18] The video decoding apparatus according to claim 17, wherein the state prediction unit predicts the state of the bits of the lower bit plane based on the information of the upper bit plane.
[19] 前記状態予測部は、予測対象のビットより上位のビットプレーンにおいて、ビットの 状態「1」が現れた変換係数の個数に応じてビットの状態を予測する請求項 18に記 載の映像復号化装置。 [19] The image according to claim 18, wherein the state prediction unit predicts the state of the bit according to the number of transform coefficients in which the state of the bit “1” appears in the bit plane higher than the bit to be predicted. Decryption device.
[20] 前記状態予測部は、予測対象のビットの変換係数にぉ 、て上位ビットにビットの状 態「1」が現れたか否かに基づいて、ビットの状態を予測する請求項 18に記載の映像 復号化装置。 [20] The state prediction unit according to claim 18, wherein the state prediction unit predicts the state of the bit based on whether or not the state of the bit “1” appears in the upper bits of the transform coefficient of the bit to be predicted. Video decoder.
[21] 前記状態予測部は、予測対象のビットより上位のビットプレーンにお 、てビットの状 態「1」が現れた変換係数のうちで、予測対象のビットの変換係数よりも復号ィ匕順序が 後方に位置し、かつ最も近い変換係数までの距離に基づいて、ビットの状態を予測 する請求項 18に記載の映像復号化装置。  [21] The state prediction unit decodes more than the conversion coefficient of the prediction target bit among the conversion coefficients in which the bit state “1” appears in the bit plane higher than the prediction target bit. The video decoding apparatus according to claim 18, wherein the state of the bit is predicted based on the distance to the closest transform coefficient, which is located behind the order.
[22] 前記状態予測部は、予測対象のビットより上位のビットプレーンにおいて、復号ィ匕 順序が最も後方の位置でビットの状態「1」が現れた変換係数に基づいて、ビットの状 態を予測する請求項 18に記載の映像復号化装置。 [22] The state prediction unit determines the bit state based on the conversion coefficient in which the bit state “1” appears at the position at the rearmost position of the decoding order in the bit plane higher than the bit to be predicted. The video decoding apparatus according to claim 18, which predicts.
[23] 前記状態予測部は、予測対象のビットの状態力 「l」であり、かつ、復号ィヒ順序が後 方に位置する変換係数の同じビットプレーンに属する全てのビットの状態が「0」であ る確率を予測し、 [23] The state prediction unit is a state power “1” of the bit to be predicted, and the state of all bits belonging to the same bit plane of the conversion coefficient whose decoding order is located backward is “0”. Predict the probability of
前記ビットプレーン復号化部は、前記確率に基づいて、零ラン長復号化を行う請求 項 18に記載の映像復号化装置。  The video decoding apparatus according to claim 18, wherein the bit-plane decoding unit performs zero run length decoding based on the probability.
[24] 前記状態予測部は、予測対象のビットより上位のビットプレーンにおいて、あらかじ め定められた順番以降の復号ィ匕順序の変換係数にビットの状態「1」を含む力否かを 判定し、ビットの状態「1」を含まないとの判定に応じて、予測対象のビットの状態が「1 」であり、かつ、復号ィヒ順序が後方に位置する変換係数の同じビットプレーンに属す る全てのビットの状態が「0」である確率を予測し、 [24] The state prediction unit determines whether or not the conversion coefficient of the decoding order after the predetermined order includes the bit state “1” in the bit plane higher than the bit to be predicted. And according to the determination that the bit state "1" is not included, the state of the bit to be predicted is "1", and the decoding order belongs to the same bit plane of the transform coefficient located behind. Predict the probability that the state of all bits is "0",
前記ビットプレーン復号化部は、前記確率に基づいて、零ラン長復号化を行う請求 項 18に記載の映像復号化装置。  The video decoding apparatus according to claim 18, wherein the bit-plane decoding unit performs zero run length decoding based on the probability.
[25] 前記状態予測部は、予測対象のビットを含む零ラン長を予測し、 [25] The state prediction unit predicts a zero run length including a bit to be predicted,
前記ビットプレーン復号化部は、前記状態予測部にて予測された零ラン長に基づ いて選択したノヽフマンテーブルを用いてハフマン復号ィ匕を行う請求項 18に記載の映 像復号化装置。  The video decoding apparatus according to claim 18, wherein the bit-plane decoding unit performs Huffman decoding using a Nofman table selected based on the zero run length predicted by the state prediction unit.
[26] 前記状態予測部は、ビットの状態「0」が連続して出現する確率があらかじめ設定さ れた閾値を下回る箇所までのゼロの個数を零ラン長として求める請求項 25に記載の 映像復号化装置。 26. The state prediction unit according to claim 25, wherein the number of zeros until the probability that the state of the bit “0” appears continuously falls below a preset threshold is obtained as a zero run length. Video decoding device.
[27] 前記状態予測部は、予測対象のビットの状態が、「1」又は「0」である確率を予測し 前記ビットプレーン復号化部は、前記状態予測部にて予測された前記確率に基づ V、て決定したシンボルの生起確率を用いて、算術復号ィ匕を行う請求項 18に記載の 映像復号化装置。  [27] The state prediction unit predicts the probability that the state of the bit to be predicted is “1” or “0”, and the bit plane decoding unit calculates the probability predicted by the state prediction unit. The video decoding apparatus according to claim 18, wherein arithmetic decoding is performed using the occurrence probability of the symbol determined based on V,.
[28] 前記状態予測部は、予測対象のビットの状態力「l」であり、かつ、復号化順序が、 後方に位置する変換係数の同じビットプレーンに属する全てのビットの状態が「0」で ある確率を予測し、  [28] The state prediction unit is the state power “1” of the bit to be predicted, and the state of all bits belonging to the same bit plane of the conversion coefficient located behind in the decoding order is “0”. Predict the probability of being
前記ビットプレーン復号化部は、前記状態予測部にて予測された零ラン長に基づ V、て決定したシンボルの生起確率を用いて、算術復号ィ匕を行う請求項 18に記載の 映像復号化装置。  The video decoding according to claim 18, wherein the bit plane decoding unit performs arithmetic decoding using the occurrence probability of the symbol determined based on the zero run length predicted by the state prediction unit. Device.
[29] 前記状態予測部は、復号化された基本レイヤの情報に基づ 、て、基本レイヤの映 像品質を向上させる拡張レイヤの変換係数のビットの状態を予測し、  [29] The state prediction unit predicts, based on the decoded information of the base layer, the states of bits of transform coefficients of the enhancement layer for improving the video quality of the base layer,
前記ビットプレーン復号ィ匕部は、前記状態予測部によって予測されたビットの状態 に応じて前記拡張レイヤの変換係数を復号ィ匕する請求項 17に記載の映像復号ィ匕装 置。  The video decoding apparatus according to claim 17, wherein the bit plane decoding unit decodes the transform coefficient of the enhancement layer according to the state of the bit predicted by the state prediction unit.
[30] 前記状態予測部は、前記基本レイヤに含まれるエッジの情報または前記基本レイ ャの符号ィ匕量に基づ 、て、前記拡張レイヤの変換係数のビットの状態を予測する請 求項 29に記載の映像復号化装置。  [30] The state prediction unit is a request to predict the state of bits of the transform coefficient of the enhancement layer based on edge information included in the base layer or a code amount of the base layer. 29. The video decoding apparatus as described in 29.
[31] 前記状態予測部は、映像を構成するフレームの変換係数のビットの状態を、動き予 測補償符号ィ匕に用いる参照フレームの情報に基づいて予測し、 [31] The state prediction unit predicts the state of the bit of the transform coefficient of the frame making up the video based on the information of the reference frame used for the motion prediction / compensation code 、.
前記ビットプレーン復号ィ匕部は、前記状態予測部によって予測されたビットの状態 に応じて前記フレームの変換係数を復号ィ匕する請求項 17に記載の映像復号ィ匕装置  The video decoding apparatus according to claim 17, wherein the bit plane decoding unit decodes the transform coefficient of the frame according to the state of the bit predicted by the state prediction unit.
[32] 前記状態予測部は、前記参照フレームに含まれるエッジの情報または前記参照フ レームの符号ィ匕量に基づいて、前記フレームのビットの状態を予測する請求項 31に 記載の映像復号化装置。 32. The video decoding according to claim 31, wherein the state prediction unit predicts the state of the bits of the frame based on edge information included in the reference frame or a code amount of the reference frame. apparatus.
[33] 映像を周波数変換して周波数成分を表す変換係数を生成する変換係数生成ステ ップと、 [33] A transform coefficient generation step of frequency converting an image to generate transform coefficients representing frequency components;
前記変換係数生成ステップにお 、て生成された変換係数を 2進数に変換し、複数 の変換係数の同じ位のビットからなるビットプレーンを最上位ビットから最下位ビットに 至るまで生成し、上位ビットプレーン力も順に符号ィ匕するビットプレーン符号化ステツ プと、  In the conversion coefficient generation step, the generated conversion coefficients are converted into binary numbers, and bit planes consisting of the same-order bits of a plurality of conversion coefficients are generated from the most significant bit to the least significant bit, and the upper bits With the bit plane coding step where the plane force is also coded in order,
先に符号化された変換係数に関する情報に基づいて、後に符号化される変換係数 のビットの状態を予測する状態予測ステップと、  State prediction step of predicting the state of bits of the transform coefficient to be encoded later based on the information on the previously encoded transform coefficient;
を備え、  Equipped with
前記ビットプレーン符号化ステップは、前記状態予測ステップにお 、て予測された ビットの状態に応じて符号ィ匕を行う映像符号ィ匕方法。  A video coding method in which the bit plane coding step performs a coding according to a state of a bit predicted in the state prediction step.
[34] ビットプレーン符号ィ匕された映像の符号ィ匕データを上位ビットプレーン力も順に復 号ィ匕するビットプレーン復号化ステップと、 [34] A bit-plane decoding step of decoding the bit-plane coded image data and the code data of the video in sequence with the upper bit-plane power, and
先に復号化された変換係数に関する情報に基づいて、後に復号化される変換係数 のビットの状態を予測する状態予測ステップと、  State prediction step of predicting the state of bits of the transform coefficient to be decoded later based on information on the previously decoded transform coefficient;
を備え、  Equipped with
前記ビットプレーン復号化ステップは、前記状態予測部によって予測されたビットの 状態に応じて復号化を行う映像復号化方法。  The video decoding method, wherein the bit plane decoding step performs decoding in accordance with the state of the bit predicted by the state prediction unit.
[35] 映像を符号ィ匕するためにコンピュータに、 [35] To the computer to code the picture,
映像を周波数変換して周波数成分を表す変換係数を生成する変換係数生成ステ ップと、  A transform coefficient generation step of frequency-converting an image to generate transform coefficients representing frequency components;
前記変換係数生成ステップにお 、て生成された変換係数を 2進数に変換し、複数 の変換係数の同じ位のビットからなるビットプレーンを最上位ビットから最下位ビットに 至るまで生成し、上位ビットプレーン力も順に符号ィ匕するビットプレーン符号化ステツ プと、  In the conversion coefficient generation step, the generated conversion coefficients are converted into binary numbers, and bit planes consisting of the same-order bits of a plurality of conversion coefficients are generated from the most significant bit to the least significant bit, and the upper bits With the bit plane coding step where the plane force is also coded in order,
先に符号化された変換係数に関する情報に基づいて、後に符号化される変換係数 のビットの状態を予測する状態予測ステップと、  State prediction step of predicting the state of bits of the transform coefficient to be encoded later based on the information on the previously encoded transform coefficient;
を実行させ、 前記ビットプレーン符号化ステップは、前記状態予測ステップにお 、て予測された ビットの状態に応じて符号ィ匕を行うプログラム。 To run The bit plane coding step is a program for performing a code in accordance with a state of a bit predicted in the state prediction step.
ビットプレーン符号ィ匕された映像の符号ィ匕データを復号ィ匕するために、コンビユー タに、  In order to decode bit-plane-coded video code data, the computer can
上位ビットプレーン力も順に復号ィ匕するビットプレーン復号化ステップと、 先に復号化された変換係数に関する情報に基づいて、後に復号化される変換係数 のビットの状態を予測する状態予測ステップと、  A bit plane decoding step of sequentially decoding the upper bit plane power, and a state prediction step of predicting a bit state of a transform coefficient to be decoded later based on information on the previously decoded transform coefficient;
を実行させ、  To run
前記ビットプレーン復号化ステップは、前記状態予測部によって予測されたビットの 状態に応じて復号化を行うプログラム。  The said bit plane decoding step is a program which decodes according to the state of the bit estimated by the said state estimation part.
PCT/JP2005/013406 2004-10-26 2005-07-21 Video encoder, video decoder, video encoding method, and video decoding method WO2006046334A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2004310572A JP2006128759A (en) 2004-10-26 2004-10-26 Video coding device, video decoding device, video coding method and video decoding method
JP2004-310572 2004-10-26

Publications (1)

Publication Number Publication Date
WO2006046334A1 true WO2006046334A1 (en) 2006-05-04

Family

ID=36227585

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2005/013406 WO2006046334A1 (en) 2004-10-26 2005-07-21 Video encoder, video decoder, video encoding method, and video decoding method

Country Status (2)

Country Link
JP (1) JP2006128759A (en)
WO (1) WO2006046334A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010026351A1 (en) * 2008-09-05 2010-03-11 Commissariat à l'Energie Atomique Method for entropically transcoding a first binary data stream into a second compressed binary data stream, and corresponding computer program and image recording device
US10917663B2 (en) 2015-09-18 2021-02-09 Koninklijke Philips N.V. Method and apparatus for fast and efficient image compression and decompression
RU2765654C2 (en) * 2017-07-10 2022-02-01 Интопикс Method and device for digital data compression
RU2765654C9 (en) * 2017-07-10 2022-05-26 Интопикс Method and device for digital data compression

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20120038355A (en) * 2010-10-13 2012-04-23 삼성전자주식회사 Method and apparatus of entropy encoding/decoding and symbol endcoding/decoding method and apparatus therefor

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63191490A (en) * 1987-02-03 1988-08-08 Mitsubishi Electric Corp Image encoding and transmitting method
JPH06225274A (en) * 1993-01-21 1994-08-12 Oki Electric Ind Co Ltd Picture coding/decoding device
JPH11252573A (en) * 1998-03-05 1999-09-17 Nec Corp Hierarchical image coding system and hierarchical image decoding system
JP2001086004A (en) * 1999-08-02 2001-03-30 Samsung Electronics Co Ltd Variable length coding method and device
JP2001094982A (en) * 1999-09-20 2001-04-06 Nippon Telegr & Teleph Corp <Ntt> Hierarchical coding method and device, program recording medium used for realization of the method, hierarchical decoding method and device thereof, and program recording medium used for realization of the method
JP2002027472A (en) * 2000-07-10 2002-01-25 Fuji Xerox Co Ltd Image encoding device and method therefor, image decoding device and method therefor
JP2003250049A (en) * 2002-02-21 2003-09-05 Sony Corp Bit modeling apparatus

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63191490A (en) * 1987-02-03 1988-08-08 Mitsubishi Electric Corp Image encoding and transmitting method
JPH06225274A (en) * 1993-01-21 1994-08-12 Oki Electric Ind Co Ltd Picture coding/decoding device
JPH11252573A (en) * 1998-03-05 1999-09-17 Nec Corp Hierarchical image coding system and hierarchical image decoding system
JP2001086004A (en) * 1999-08-02 2001-03-30 Samsung Electronics Co Ltd Variable length coding method and device
JP2001094982A (en) * 1999-09-20 2001-04-06 Nippon Telegr & Teleph Corp <Ntt> Hierarchical coding method and device, program recording medium used for realization of the method, hierarchical decoding method and device thereof, and program recording medium used for realization of the method
JP2002027472A (en) * 2000-07-10 2002-01-25 Fuji Xerox Co Ltd Image encoding device and method therefor, image decoding device and method therefor
JP2003250049A (en) * 2002-02-21 2003-09-05 Sony Corp Bit modeling apparatus

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LI W. ET AL.: "Overview of Fine granularity Scalability in MPEG-4 Video Standard.", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY., vol. 11, no. 3, March 2001 (2001-03-01), pages 301 - 317, XP000994715 *
PENG W. ET AL.: "Context-based binary arithmetic coding with stochastic bit reshuffling for advanced fine granularity scalability.", PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS'04), vol. 3, 23 May 2004 (2004-05-23), pages 953 - 956, XP010719708 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010026351A1 (en) * 2008-09-05 2010-03-11 Commissariat à l'Energie Atomique Method for entropically transcoding a first binary data stream into a second compressed binary data stream, and corresponding computer program and image recording device
FR2935865A1 (en) * 2008-09-05 2010-03-12 Commissariat Energie Atomique METHOD FOR ENTROPTICALLY TRANSCODING A FIRST BINARY DATA TRAIN TO A SECOND COMPRESSED BINARY DATA TRAIN, COMPUTER PROGRAM, AND CORRESPONDING IMAGE CAPTURE DEVICE
US8787455B2 (en) 2008-09-05 2014-07-22 Commissariat A L'energie Atomique Et Aux Energies Alternatives Method for entropically transcoding a first data stream into a second compressed binary data stream, and corresponding computer program and image recording device
US10917663B2 (en) 2015-09-18 2021-02-09 Koninklijke Philips N.V. Method and apparatus for fast and efficient image compression and decompression
RU2765654C2 (en) * 2017-07-10 2022-02-01 Интопикс Method and device for digital data compression
RU2765654C9 (en) * 2017-07-10 2022-05-26 Интопикс Method and device for digital data compression

Also Published As

Publication number Publication date
JP2006128759A (en) 2006-05-18

Similar Documents

Publication Publication Date Title
US10623781B2 (en) Variable length coding method and variable length decoding method
JP5590133B2 (en) Moving picture coding apparatus, moving picture coding method, moving picture coding computer program, moving picture decoding apparatus, moving picture decoding method, and moving picture decoding computer program
US9681139B2 (en) Method and apparatus for ROI coding using variable block size coding information
US8340179B2 (en) Methods and devices for coding and decoding moving images, a telecommunication system comprising such a device and a program implementing such a method
US20070242753A1 (en) Coding and decoding method and apparatus using plural scanning patterns
JP2013251903A (en) Rate control method for multi-layered video coding, and video encoding apparatus and video signal processing system using the same
JP2005295504A (en) Moving image encoder, decoder, control method, computer program and computer readable memory medium
EP2084907B1 (en) Method and system for scalable bitstream extraction
JP2005513928A (en) Complexity scalability (FGS) for fine grain video coding
JPWO2003079692A1 (en) Hierarchical encoding apparatus and decoding apparatus
JP2005318296A (en) Method, device, and program for encoding dynamic image
JP2005151391A (en) Method and apparatus for coding moving image, and program
WO2006046334A1 (en) Video encoder, video decoder, video encoding method, and video decoding method
JP2004048607A (en) Digital image coding device and method thereof
WO1998000977A1 (en) Method and apparatus for bound-based adaptive entropy encoding/decoding
WO2006038679A1 (en) Moving picture encoding device, method, program, and moving picture decoding device, method, and program
EP1843595A2 (en) Coding and decoding method and apparatus using plural scanning patterns
KR20020087810A (en) Apparatus and method for applying adaptive selective enhancement in the fine granular scalable coding
JP2008078979A (en) Allocation apparatus for hierarchical code volume of moving image, hierarchical encoder, allocation program for hierarchical code volume, and hierarchical encoding program
Pinho et al. L-infinity progressive image compression
JP2004134916A (en) Moving picture encoder and moving picture decoder
JP2004350072A (en) Image coder, image coding method, image coding program and recording medium with the program recorded thereon

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BW BY BZ CA CH CN CO CR CU CZ DK DM DZ EC EE EG ES FI GB GD GE GM HR HU ID IL IN IS KE KG KM KP KZ LC LK LR LS LT LU LV MA MD MK MN MW MX MZ NA NG NI NO OM PG PH PL PT RO RU SC SD SE SG SL SM SY TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SZ TZ UG ZM ZW AM AZ BY KG MD RU TJ TM AT BE BG CH CY DE DK EE ES FI FR GB GR HU IE IS IT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW MR NE SN TD TG

DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 05766332

Country of ref document: EP

Kind code of ref document: A1