CN103493494A - Image decoding apparatus, image encoding apparatus, and data structure of encoded data - Google Patents

Image decoding apparatus, image encoding apparatus, and data structure of encoded data Download PDF

Info

Publication number
CN103493494A
CN103493494A CN201280020067.0A CN201280020067A CN103493494A CN 103493494 A CN103493494 A CN 103493494A CN 201280020067 A CN201280020067 A CN 201280020067A CN 103493494 A CN103493494 A CN 103493494A
Authority
CN
China
Prior art keywords
decoding
mentioned
coefficient
conversion coefficient
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201280020067.0A
Other languages
Chinese (zh)
Inventor
八杉将伸
猪饲知宏
山本智幸
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Publication of CN103493494A publication Critical patent/CN103493494A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/129Scanning of coding units, e.g. zig-zag scan of transform coefficients or flexible macroblock ordering [FMO]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/18Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/88Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving rearrangement of data among different coding units, e.g. shuffling, interleaving, scrambling or permutation of pixel data or permutation of transform coefficient data among different blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A motion video decoding apparatus (1), which decodes a conversion coefficient that is obtained by executing a frequency conversion, for each of a plurality of conversion units, of pixel values of an image to be converted, from TU information (TUI) about encoded data obtained by encoding the conversion coefficient, is provided with: an area dividing unit (121) that divides the blocks to be converted, which are the above-mentioned conversion units, into a plurality of decoding areas; and an area decoding unit (122) that decodes the conversion units included in the decoding areas, by referring to VLC tables (TBL11) that are decoding information for obtaining the conversion coefficient from the TU information (TUI), and that are allotted to each of the decoding areas.

Description

The data structure of picture decoding apparatus, picture coding device and coded data
Technical field
The data structure of the picture decoding apparatus that the present invention relates to conversion coefficient is decoded, the picture coding device that conversion coefficient is encoded and coded data that transform coefficients encoding is obtained.
Background technology
In order to transmit efficiently or to record moving image, employing be by moving image being encoded to generate the dynamic image encoding device of coded data and by this coded data being decoded to generate the moving image decoding apparatus of decoded picture.
As concrete moving image encoding mode, such as enumerating H.264/MPEG-4.AVC, (Video Coding Expert Group: the joint development Video coding expert group) is adopted mode in KTA software, TMuC (Test Model under Consideration: the test model in consideration) adopted mode and by HEVC (the High-Efficiency Video Coding: (non-patent literatures 1,2) such as modes efficient video coding) proposed of the codec as its successor in software with codec to VCEG.
In above-mentioned moving image encoding mode, usually image is divided into to the piece of given size, by each piece, pixel value is carried out to frequency translation, induced transformation coefficient, and the conversion coefficient enforcement coding processing to being exported thus.At this, the size as the piece (coded object piece) that is made as the object that coding processes, although stipulated the size of various size, but have the code of the conversion coefficient of the coded object piece that size is larger to measure more trend.
In HM (HEVC TestModel:HEVC test model) 2.0, the technology of in the piece of the size more than 16 * 16 pixels, by 64 coefficients of at utmost only encoding, seeking the minimizing of code amount has been proposed.For example, about the piece of encoding more than 8 * 8, the technology (non-patent literature 3) in the zone of a kind of maximum 8 * 8 that is encoded to the low frequency component side has been proposed.
Yet, known: as to adopt above-mentioned technology, in the situation that omitted the coding of the coefficient that is positioned at the high fdrequency component side, reduce sometimes image quality in the coefficient coding of large-sized.
On the other hand, the piece of above size about 8 * 8 pixels, proposed that a kind of { run, the value of the combination of level} reduces the technology (non-patent literature 4) of code amount thus by calculating induced representation.In addition, run refers to the number (0 distance of swimming) of the zero coefficient continuous along the regulation scanning sequency, and level refers to the absolute value of coefficient.
Under above this background, in the HM3.0 of the follow-up specification as HM2.0, in order to reduce the code amount, adopt the proposal of non-patent literature 4, and in order to improve image quality, even if also encode all conversion coefficients under piece (change of scale) size of the size more than 16 * 16.
Technical literature formerly
Non-patent literature
Non-patent literature 1: " WD2:Working Draft2of High-Efficiency Video Coding (JCTVC-D503) ", Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16WP3and ISO/IEC JTC1/SC29/WGl14th Meeting:Daegu, KR, 1/2011 (in January, 2011 is open)
Non-patent literature 2: " WD3:Working Draft3of High-Efficiency Video Coding (JCTVC-E603) ", Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16WP3and ISO/IEC JTC1/SC29/WG115th Meeting:Geneva, CH, 3/2011 (in March, 2011 is open)
Non-patent literature 3: " Samsung ' s Response to the Call for Proposals on Video Compression Technology (JCTVC-A124) ", Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16WP3and ISO/IEC JTC1/SC29/WG111st Meeting:Dresden, DE, 4/2010 (in April, 2010 is open)
Non-patent literature 4: " CE5:coefficient coding with LCEC for large b1ocks (JCTVC-E383) ", Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16WP3and ISO/IEC JTC1/SC29/WG115th Meeting:Geneva, CH, 3/2011 (in March, 2011 is open)
Summary of the invention
Invent problem to be solved
Yet, in the situation that to 16 * 16 pixels with the larger-size first-class processing of encoding, exist size for the form of code coefficient to become large or for the many problems of computing quantitative change of the value of obtaining form.
For example, in the piece of 16 * 16 pixels, the number of coefficient is 256 to the maximum.In addition, in the piece of 32 * 32 pixels, the number of coefficient is 1024 to the maximum.
In addition, in the situation that to the processing of encoding of larger piece, as the form that there is size and become large trend, enumerate scanning form, distance of swimming amplitude coding (the run-level encoding of invisible scanning order; Run-length encoding) VLC form etc.
More specifically, the scanning form size that need to be directly proportional to the number of coefficient, the VLC form of distance of swimming amplitude coding need to the corresponding size of maximum run length.
The present invention has put just in view of the above-mentioned problems, and its purpose is to realize to reduce the data structure of picture decoding apparatus, picture coding device and coded data for obtain the amount of information of the decoded information of conversion coefficient, amount of calculation based on decoded information among coded data.
For solving the technological means of problem
About a side of the present invention, describe, as described below.; picture decoding apparatus involved in the present invention is in order to solve above-mentioned problem; it is characterized in that; this conversion coefficient of decoding among the coded data obtained from conversion coefficient encodes; this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains; above-mentioned picture decoding apparatus possesses: the change of scale cutting unit, and it is divided into a plurality of subunits by above-mentioned change of scale; With the conversion coefficient decoding unit, the conversion coefficient that it is decoded in above-mentioned subunit and comprise with reference to decoded information, this decoded information is for obtain the decoded information of above-mentioned conversion coefficient, the decoded information be assigned with according to each above-mentioned subunit among above-mentioned coded data.
In addition, picture coding device involved in the present invention is in order to solve above-mentioned problem, it is characterized in that, conversion coefficient is encoded, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, above-mentioned picture coding device possesses: the change of scale cutting unit, and it is divided into a plurality of subunits by above-mentioned change of scale; With the transform coefficients encoding unit, the conversion coefficient that it is encoded in above-mentioned change of scale and comprise with reference to coded message, this coded message is for the coded message of the above-mentioned conversion coefficient of encoding, the coded message that is assigned with according to each above-mentioned subunit.
In addition, the data structure of coded data involved in the present invention is in order to solve above-mentioned problem, it is characterized in that, by conversion coefficient is encoded and is generated, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, in the data structure of above-mentioned coded data, the position that comprises the above-mentioned conversion coefficient that becomes coded object is with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out, the picture decoding apparatus of thus above-mentioned coded data being decoded, according to the previous position of above-mentioned conversion coefficient in above-mentioned change of scale decoded, with above-mentioned relative position, determine the position of the above-mentioned conversion coefficient that becomes decoder object.
According to above-mentioned formation, the change of scale that at first will become the object of decoding in decoding is processed is divided into a plurality of subunits.
Change of scale refers to, pixel value is transformed into to the unit of frequency field.As change of scale, such as size of enumerating 64 * 64 pixels, 32 * 32 pixels, 16 * 16 pixels etc.
In the situation that change of scale is 16 * 16 sizes, subunit can be also for example the zone of 8 * 8 sizes.
In addition, according to above-mentioned formation, will be made as one by one the processing object, the conversion coefficient comprised in this subunit of decoding by cutting apart a plurality of subunits that obtain.Order to the decoding subunit is not particularly limited, also can be according to the random order processing of decoding.
In addition, in the above-described configuration, when the decoding conversion coefficient, reference is assigned to the decoded information of each subunit of a plurality of subunits.
Decoded information refers to, for the information of the parameter value of the regulation of reproduction conversion coefficient among the code from coded data (bit string).For example, decoded information means the form for the corresponding relation of the parameter value of the regulation of reproduction conversion coefficient among the code from coded data.Again for example, decoded information is the calculating formula for the parameter value of the regulation of induced transformation coefficient among the code from coded data.
That is to say, in the above-described configuration, use the less subunit about the size than original transform unit and the decoded information stipulated, the conversion coefficient of decoding.
Thereby the situation that the decoded information of stipulating than the size based on about original transform unit is decoded and processed, can bring into play amount of information and this effect of the amount of calculation based on decoded information that can reduce decoded information.
And then can say, because can reduce to become the number of the conversion coefficient of object in decoding is processed, so the size of the scanning form that the scanning sequency of conversion coefficient is defined also can be made as less.
In addition, furtherly, can suppress lowlyer by amount, the disposal ability of the memory that needs in decoding is processed.
In addition, subunit also can be consistent with any one of coding units in the technology of non-patent literature 1,2.In this case, can be in above-mentioned coding units general by predefined VLC form, be decoded information.
In addition, according to the picture coding device formed as described above or the data structure of coded data, can bring into play the effect same with picture decoding apparatus involved in the present invention.
Another side of the present invention is described, as described below.; picture decoding apparatus involved in the present invention is in order to solve above-mentioned problem; it is characterized in that; this conversion coefficient of decoding among the coded data obtained from conversion coefficient encodes; this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains; above-mentioned picture decoding apparatus possesses: the relative position decoding unit, and its decoding becomes the relative position of the conversion coefficient of decoder object with respect to the previous conversion coefficient decoded; And position determination unit, its according to the previous above-mentioned conversion coefficient decoded the position in above-mentioned change of scale and above-mentioned relative position determine the position of the above-mentioned conversion coefficient that becomes above-mentioned decoder object.
In addition, picture coding device involved in the present invention is in order to solve above-mentioned problem, it is characterized in that, conversion coefficient is encoded, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, above-mentioned picture coding device possesses: the relative position coding unit, its coding becomes the position of above-mentioned conversion coefficient of coded object with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out.
In addition, the data structure of coded data involved in the present invention is in order to solve above-mentioned problem, its purpose is, by conversion coefficient is encoded and is generated, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, in the data structure of above-mentioned coded data, the position that comprises the above-mentioned conversion coefficient that becomes coded object is with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out, thus, the picture decoding apparatus that above-mentioned coded data is decoded, according to the previous position of above-mentioned conversion coefficient in above-mentioned change of scale decoded, with above-mentioned relative position, determine the position of the above-mentioned conversion coefficient that becomes decoder object.
According to above-mentioned formation, the position according to the previous above-mentioned conversion coefficient decoded in above-mentioned change of scale and above-mentioned relative position are determined the position of the above-mentioned conversion coefficient that becomes above-mentioned decoder object.Thus, can determine based on relative position relevance ground the position of conversion coefficient.In addition, change of scale refers to, the unit of regulation converted.
In the situation that above-mentioned run is encoded, because scanning sequency is according to the rules counted the length of run, even if therefore close as the relative position of the nonzero coefficient of benchmark and the two-dimensional coordinate of next nonzero coefficient in change of scale, when result also exists run elongated, the situation that exists thus the code amount to increase.
This trend easily becomes in the zone of sparse high fdrequency component particularly remarkable at conversion coefficient.In addition, run is elongated to be meaned, must prepare larger form correspondingly.
With respect to this, if utilize relative position to determine the position of conversion coefficient, can cut down in this case the code amount.
According to above-mentioned formation, owing to having utilized relative position to determine the position of conversion coefficient, therefore can reduce the code amount that should decode.
Its result, can bring into play amount of information, this effect of the amount of calculation based on decoded information that can reduce decoded information.
In addition, furtherly, can suppress lowlyer by amount, the disposal ability of the memory that needs in decoding is processed.
In addition, according to the picture coding device formed as described above or the data structure of coded data, can bring into play the effect same with picture decoding apparatus involved in the present invention.
The invention effect
Picture decoding apparatus involved in the present invention is configured to, and possesses: the change of scale cutting unit, and it is divided into a plurality of subunits by change of scale; With the conversion coefficient decoding unit, the conversion coefficient that it is decoded in above-mentioned subunit and comprise with reference to decoded information, this decoded information is for obtain the decoded information of above-mentioned conversion coefficient, the decoded information be assigned with according to each above-mentioned subunit among coded data.
In addition, picture coding device involved in the present invention is configured to, and possesses: the change of scale cutting unit, and it is divided into a plurality of subunits by change of scale; With the transform coefficients encoding unit, the conversion coefficient that it is encoded in above-mentioned change of scale and comprise with reference to coded message, this coded message is for the coded message of the above-mentioned conversion coefficient of encoding, the coded message that is assigned with according to each above-mentioned subunit.
In addition, the data structure of coded data involved in the present invention is configured to, comprise the relative position of the position of the conversion coefficient that becomes coded object with respect to the position of previous above-mentioned conversion coefficient of encoding out, the picture decoding apparatus of thus above-mentioned coded data being decoded, position and above-mentioned relative position according to the previous above-mentioned conversion coefficient decoded in above-mentioned change of scale, come definite position that becomes the above-mentioned conversion coefficient of decoder object.
Picture decoding apparatus involved in the present invention is configured to, and possesses: the relative position decoding unit, and its decoding becomes the relative position of the conversion coefficient of decoder object with respect to the previous conversion coefficient decoded; And position determination unit, its according to the previous above-mentioned conversion coefficient decoded the position in change of scale and above-mentioned relative position determine the position of the above-mentioned conversion coefficient that becomes above-mentioned decoder object.
In addition, picture coding device involved in the present invention is configured to, and possesses: the relative position coding unit, its coding becomes the position of conversion coefficient of coded object with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out.
In addition, the data structure of coded data involved in the present invention is following data structure, that is: the position that comprises the conversion coefficient that becomes coded object is with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out, thus, the picture decoding apparatus that above-mentioned coded data is decoded, position and above-mentioned relative position according to the previous above-mentioned conversion coefficient decoded in above-mentioned change of scale, come definite position that becomes the above-mentioned conversion coefficient of decoder object.
According to above-mentioned picture decoding apparatus, can bring into play and can reduce for obtain the amount of information of the decoded information of above-mentioned conversion coefficient, this effect of amount of calculation based on decoded information among coded data.In addition, according to the data structure of above-mentioned picture coding device or coded data, can bring into play the effect same with above-mentioned picture decoding apparatus.
The accompanying drawing explanation
Fig. 1 is the functional block diagram that the configuration example of the TU information decoding section that possesses of the moving image decoding apparatus related about one embodiment of the present invention is meaned.
Fig. 2 is about the functional block diagram shown in the concise and to the point formation of above-mentioned moving image decoding apparatus.
Fig. 3 means that the dynamic image encoding device related by one embodiment of the present invention generates and, by the figure that the data of the coded data of above-mentioned moving image decoding apparatus decoding form, (a)~(d) means respectively the figure of picture layer, slicing layer, tree piece layer and CU layer.
Fig. 4 comes the handling process of coding/decoding coefficient to carry out illustrative flow chart about the object piece being carried out to Region Segmentation.
Fig. 5 mean 16 * 16 sizes the object piece cut apart routine figure.
Fig. 6 means the example of decoding in situation about processing in the zone that the object piece is divided into to 48 * 8 sizes.
Fig. 7 is about in the situation that there is the decoding of nonzero coefficient and remaining 1 regional zero coefficient nothing but of decoding to process in 3 decoding zones in the middle of 4 zones of decoding that the object piece comprises, carrying out illustrative figure.
Fig. 8 is the figure that the example about change scan method according to the regional position of decoding is meaned.
Fig. 9 is the figure that the example about the object piece being divided into to the zone of non-square is meaned.
Figure 10 is the functional block diagram that the configuration example of the dynamic image encoding device related about one embodiment of the present invention is meaned.
Figure 11 is the block diagram that the configuration example of the variable-length encoding part that possesses about above-mentioned dynamic image encoding device is meaned.
Figure 12 means for will { run, the group of the parameter of level} is transformed into the figure of an example of the VLC form of Code Number (bit string).
Figure 13 is about above-mentioned VLC form is set up to the figure that the example in corresponding situation is meaned with each decoding zone.
Figure 14 means the figure of the example of dynamic optimization.
Figure 15 is the functional block diagram that the configuration example of the TU information decoding section related about other execution modes of the present invention is meaned.
Figure 16 is that the handling process about specify the coding/decoding nonzero coefficient by relative position is carried out illustrative flow chart.
Figure 17 means the figure of the execution example that the decoding of above-mentioned TU information decoding section is processed.
Figure 18 is the functional block diagram that the configuration example of the TU information coding section related about other execution modes of the present invention is meaned.
Figure 19 is the figure that the example about the coding of the nonzero coefficient based on the relative position appointment is meaned.
Figure 20 means relative position pattern that above-mentioned TU information decoding section the possesses figure by the configuration example of form.
Figure 21 is that the zone about the high fdrequency component side in the object piece by shown in Figure 17, Figure 19 further is subdivided into the figure that the example in 3 zones is meaned.
Figure 22 means to set up with above-mentioned 3 zones an example of corresponding VLC form.The value that this figure (a) is illustrated in x or y does not become the VLC form be referenced in the situation of the above positive value of regulation.In addition, the value that this figure (b) is illustrated in x or y does not become the VLC form be referenced in the situation of the following negative value of regulation.
Figure 23 carries out the flow chart that an example of the handling process of coding/decoding is meaned when switching 2 processing.
Figure 24 carries out illustrative figure about the processing of 64 coefficients of the low frequency component side in coding/decoding object piece only.This figure (a) means the situation of inter prediction, (b) means the situation of infra-frame prediction.
Figure 25 is the figure that the example about the data structure of coefficient coding data is meaned.
Figure 26 is that the situation about cutting apart about carrying out 2 levels is carried out illustrative figure.
Figure 27 means the performance example (quaternary tree performance) of the labelled tree of cutting apart situation and coefficient distribution situation of the object piece shown in Figure 26.
Figure 28 is the flow chart that the example about the decoding handling process in recursion zone is meaned.
Figure 29 is the figure that another example about the data structure of coefficient coding data is meaned.
Figure 30 means the figure of the concrete example of the coefficient coding data shown in Figure 29.
Figure 31 is the figure that the another example about the data structure of coefficient coding data is meaned.
Figure 32 means the figure of the concrete example of the coefficient coding data shown in Figure 31.
Figure 33 is that one of the 1st level coding region about shown in Figure 26 carries out the figure meaned in detail.
Figure 34 is that one of the 1st level coding region about shown in Figure 26 carries out the figure meaned in detail.
Figure 35 means the figure of the partitioning scheme in the object piece of 16 * 16 sizes.
Figure 36 means the figure of the partitioning scheme in the object piece of 16 * 16 sizes.
Figure 37 is the figure shown in the formation of the dispensing device about having carried above-mentioned dynamic image encoding device and the receiving system that has carried above-mentioned moving image decoding apparatus.(a) mean to have carried the dispensing device of dynamic image encoding device, (b) mean to have carried the receiving system of moving image decoding apparatus.
Figure 38 is the figure shown in the formation of the tape deck about having carried above-mentioned dynamic image encoding device and the replay device that has carried above-mentioned moving image decoding apparatus.(a) mean to have carried the tape deck of dynamic image encoding device, (b) mean to have carried the replay device of moving image decoding apparatus.
Embodiment
(1) execution mode 1
With reference to Fig. 1~Figure 14, one embodiment of the present invention is described.At first, with reference to Fig. 2, the summary of moving image decoding apparatus (picture decoding apparatus) 1 and dynamic image encoding device (picture coding device) 2 is described.Fig. 2 means the functional block diagram of the concise and to the point formation of moving image decoding apparatus 1.
H.264/MPEG-4AVC moving image decoding apparatus 1 shown in Fig. 2 and dynamic image encoding device 2 have installed adopted technology in standard, (Video Coding Expert Group: the joint development Video coding expert group) is adopted technology in KTA software with codec to VCEG, TMuC (Test Model under Consideration: the test model in consideration) adopted technology in software, and by HEVC (the High-Efficiency Video Coding: technology efficient video coding) proposed of the codec as its successor.
To the coded data #1 after moving image decoding apparatus 1 input motion picture coding device 2 encoding moving pictures.1 couple of coded data #1 inputted of moving image decoding apparatus is decoded, and exports moving image #2 to outside.Prior to the detailed description of moving image decoding apparatus 1, and the formation of description code data #1 below.
(formation of coded data)
Use Fig. 3, to being generated by dynamic image encoding device 2 and being described by the configuration example of the coded data #1 of moving image decoding apparatus 1 decoding.Coded data #1 comprises sequence and a plurality of pictures that form sequence illustratively.
Fig. 3 illustrates the structure of the level below the picture layer in coded data #1.Fig. 3 (a)~(d) means respectively the picture layer that picture PICT is stipulated, the slicing layer that section S is stipulated, the tree piece layer that tree piece (Tree block) TBLK is stipulated, coding units (the Coding Unit to comprising in tree piece TBLK; The figure of the CU layer of CU) being stipulated.
(picture layer)
In picture layer, process the picture PICT (following also referred to as the object picture) of object in order to decode, the set of having stipulated the data of 1 reference of moving image decoding apparatus.Picture PICT comprises picture header PH and section S as shown in Fig. 3 (a) 1~S nS(sum that NS is the section that comprises in picture PICT).
In addition, below without distinguishing section S 1~S nSeach situation under, the subscript of sometimes omitting code is described.In addition, the data that comprise in the coded data #1 about following explanation, be subscripting other data too.
For the coding/decoding method of decision objects picture, comprise the coding parameter group of 1 reference of moving image decoding apparatus in picture header PH.For example, the example that the coding mode information (entropy_coding_mode_flag) that is illustrated in the pattern of Variable Length Code used when dynamic image encoding device 2 coding is the coding parameter that comprises in picture header PH.
The low complex degree Entropy Coding) or CAVLC (Context-based Adaptive Variable Length Coding: the self-adapting changeable long codes of based on the context) encode in the situation that entropy_coding_mode_flag is 0, this picture PICT is by LCEC (Low Complexity Entropy Coding:.In addition, in the situation that entropy_coding_mode_flag is 1, this picture PICT is by CABAC (Context-based Adaptive Binary Arithmetic Coding: the adaptive binary arithmetic coding of based on the context) encode.
In addition, picture header PH is also referred to as image parameters collection (PPS:Picture Parameter Set).
(slicing layer)
In slicing layer, process the section S (also referred to as object slice) of object in order to decode, the set of having stipulated the data of 1 reference of moving image decoding apparatus.Section S comprises slice header SH and tree piece TBLK as shown in Fig. 3 (b) 1~TBLK ncthe sequence of (sum of the tree piece that NC comprises in S for section).
For the coding/decoding method of decision objects section, comprise the coding parameter group of 1 reference of moving image decoding apparatus in slice header SH.The example that the slice type appointed information (slice_type) that slice type is carried out to appointment is the coding parameter that comprises in slice header SH.
As by the assignable slice type of slice type appointed information, enumerate (1) and when coding, only using the I section of infra-frame prediction, (2) when coding, using the P section of single directional prediction or infra-frame prediction, (3) to use the B section etc. of single directional prediction, bi-directional predicted or infra-frame prediction when coding.
The filter parameter that in slice header SH, also can comprise in addition, loop filter (not shown) the institute reference possessed by moving image decoding apparatus 1.
(tree piece layer)
In tree piece layer, process the tree piece TBLK (following also referred to as the object tree piece) of object in order to decode, the set of having stipulated the data of 1 reference of moving image decoding apparatus.
Tree piece TBLK comprises tree piece header TBLKH and coding units information CU 1~CU nL(NL is for setting the sum of the coding units information comprised in piece TBLK).At this, at first the relation between specification tree piece TBLK and coding units information CU, as described below.
Tree piece TBLK is divided into the unit of the piece size of respectively processing use for determining infra-frame prediction or inter prediction and conversion.
The said units of tree piece TBLK is cut apart by recursion type 4 fork trees and divided.Below will cut apart the tree structure obtained by these recursion type 4 fork trees and be called code tree (coding tree).
Below, will be that the unit that leaf (1eaf) is corresponding carrys out reference as coding nodes (coding node) with the node of the end of code tree.In addition, because coding nodes becomes the base unit that coding is processed, therefore following by coding nodes also referred to as coding units (CU).
That is to say coding units information (below be called CU information) CU 1~CU nLinformation corresponding to each coding nodes (coding units) of obtaining with setting that piece TBLK carries out that recursion ground 4 fork trees are cut apart.
In addition, the root of code tree (root) is set up corresponding with tree piece TBLK.In other words, tree piece TBLK comprises that with recursion ground the upper node foundation of the tree structure that 4 fork trees of a plurality of coding nodes are cut apart is corresponding.
In addition, the size of each coding nodes is all directly in length and breadth half of the size of affiliated coding nodes (, the unit of the node of upper 1 level of this coding nodes) of this coding nodes.
In addition, the obtainable Size dependence of each coding nodes size appointed information that comprise in the sequence parameter set SPS of coded data #1, coding nodes and maximum hierarchical depth (maximum hierarchical depth).For example, in the situation that tree piece TBLK is of a size of 64 * 64 pixels and maximum hierarchical depth is 3, the coding nodes in the following level of this tree piece TBLK can obtain the size of 4 kinds, i.e. central any one of 64 * 64 pixels, 32 * 32 pixels, 16 * 16 pixels and 8 * 8 pixels.
(tree piece header)
For the coding/decoding method of decision objects tree piece, the coding parameter that comprises 1 reference of moving image decoding apparatus in tree piece header TBLKH.Particularly, as shown in Fig. 3 (c), comprise appointed object tree piece and be divided into the tree piece carve information SP_TBLK of parted pattern of each CU and the quantization parameter difference △ qp (qp_delta) that specifies the size of quantization step.
Tree piece carve information SP_TBLK means the information for the code tree of cut tree piece, the information of the position in shape, size and the object tree piece of each CU specifically comprised in appointed object tree piece.
In addition, tree piece carve information SP_TBLK explicitly does not comprise CU yet shape, size.For example, tree piece carve information SP_TBLK also can mean the set of four marks of cutting apart (split_coding_unit_flag) is carried out in the subregion of object tree piece integral body or tree piece.In this case, by shape, the size of tree piece, can determine thus shape, the size of each CU by also.
In addition, quantization parameter difference △ qp be quantization parameter qp in the object tree piece, and the quantization parameter qp ' in the tree piece immediately be encoded previously of this object tree piece between difference qp-qp '.
(CU layer)
Process the CU (following also referred to as object CU) of object in order to decode, the set of having stipulated the data of 1 reference of moving image decoding apparatus in the CU layer.
At this, before the explanation of the particular content of the data that comprise, the tree structure of the data that comprise in CU is described in carrying out CU information CU.Coding nodes becomes pre-assize (prediction tree; PT) and transforming tree (transform tree; The node of root TT).Pre-assize and transforming tree are described, as described below.
In pre-assize, coding nodes is divided into 1 or a plurality of prediction piece, and has stipulated respectively to predict position and the size of piece.If change a kind of expression way, predict that piece is 1 or a plurality of unduplicated zone that forms coding nodes.In addition, pre-assize comprises by above-mentioned 1 or a plurality of prediction piece obtained of cutting apart.
Carry out prediction processing according to each this prediction piece.Below, will as the prediction unit the prediction piece also referred to as the prediction (the prediction unit of unit; PU).
There are these two kinds of the situations of the situation of infra-frame prediction and inter prediction substantially in the kind of cutting apart in pre-assize.
In the situation that be infra-frame prediction, dividing method exist 2N * 2N (with coding nodes be same size) and N * N.
In addition, in the situation that be inter prediction, dividing method exist 2N * 2N (with coding nodes be same size), 2N * N, N * 2N and N * N etc.
In addition, in transforming tree, coding nodes is divided into 1 or a plurality of transform block, and has stipulated position and the size of each transform block.If change a kind of expression way, transform block is 1 or a plurality of unduplicated zone that forms coding nodes.In addition, transforming tree comprises by above-mentioned 1 or a plurality of transform block obtained of cutting apart.
Carry out conversion process according to each this transform block.Below, will be as the transform block of unit of conversion also referred to as change of scale (transform unit; TU).
(data structure of CU information)
Next, with reference to Fig. 3 (d), the particular content of the data that comprise in CU information CU is described.As shown in Fig. 3 (d), CU information CU particularly comprises skip flag SKIP, PT information PTI and TT information TTI.
Skip flag SKIP means the mark of whether applying dancing mode about the PU of object, in the situation that the value of skip flag SKIP is 1, is that dancing mode is applied to object CU, PT information PTI and TT information TTI in its CU information CU are omitted.In addition, skip flag SKIP is omitted in the I section.
PT information PTI is the information relevant to the PT comprised in CU.In other words, PT information PTI is and the set of each relevant information of 1 that comprises in PT or a plurality of PU, is referenced when by moving image decoding apparatus 1 generation forecast image.PT information PTI comprises prediction type PType and information of forecasting PInfo as shown in Fig. 3 (d).
Prediction type PType refer to have made to order for the corresponding predicted picture generation of object PU method be the information that adopts infra-frame prediction or adopt inter prediction.
Which kind of Forecasting Methodology information of forecasting PInfo specifies according to prediction type PType, and by intraframe prediction information or inter prediction information structure.Below, the PU that applied infra-frame prediction, also referred to as PU in frame, will have been applied to the PU of inter prediction also referred to as interframe PU.
The information of shape, size and position that in addition, information of forecasting PInfo has comprised appointed object PU.As mentioned above, take PU carries out the generation of predicted picture as unit.Detailed content about information of forecasting PInfo will be narrated in the back.
TT information TTI is the information relevant to the TT comprised in CU.In other words, TT information TTI is and the set of each relevant information of 1 that comprises in TT or a plurality of TU, is referenced when by moving image decoding apparatus 1 decoded residual data.In addition, below, sometimes also TU is called to piece.
TT information TTI comprises that as shown in Fig. 3 (d) appointed object CU is divided into the TT carve information SP_TU of parted pattern of each transform block and TU information TUI1~TUINT (sum that NT is the piece that comprises in object CU).
The information of the position in the shape of each TU that TT carve information SP_TU specifically comprises for decision objects CU, size and object CU.For example, TT carve information SP_TU can and mean that the information (trafoDepth) of the degree of depth that it is cut apart realizes by the information of cutting apart (split_transform_unit_flag) of the node that means whether to be become object.
In addition, for example, in the situation that CU is of a size of 64 * 64, can obtain the size of 32 * 32 pixels to 2 * 2 pixels by cutting apart each TU obtained.
TU information TUI 1~TUI nTeach relevant other information to 1 that comprises in TT or a plurality of TU.For example, TU information TUI comprises the quantitative prediction residual error.
Each quantitative prediction residual error is to implement by 2 pairs of object pieces as the piece of processing object of dynamic image encoding device the coded data that following processing 1~3 generates.
Process 1: the prediction residual deducted predicted picture is carried out to dct transform (Discrete Cosine Transform: discrete cosine transform) among the coded object image;
Process 2: to being quantized by processing 1 conversion coefficient obtained;
Process 3: to by processing 2 conversion coefficients that quantize, carrying out Variable Length Code;
Wherein, above-mentioned quantization parameter qp is illustrated in the size (QP=2 of quantization step QP used when dynamic image encoding device 2 quantization transform coefficients qp/6).
(information of forecasting PInfo)
As mentioned above, there are these 2 kinds of inter prediction information and intraframe prediction information in information of forecasting PInfo.
Comprise in inter prediction information and carry out the coding parameter be referenced when predicted picture between delta frame when moving image decoding apparatus 1 by inter prediction.More specifically, in inter prediction information, comprise appointed object CU be divided into each interframe PU parted pattern interframe PU carve information and with the corresponding inter prediction parameter of each interframe PU.
In the inter prediction parameter, comprise with reference to image index, estimated motion vector index and motion vector residual error.
On the other hand, comprise in intraframe prediction information and carry out the coding parameter be referenced when predicted picture in delta frame when moving image decoding apparatus 1 by infra-frame prediction.More specifically, in intraframe prediction information, comprise in the frame that appointed object CU is divided into the parted pattern of PU in each frame the PU carve information and with each frame in the corresponding infra-frame prediction parameter of PU.The infra-frame prediction parameter be used to specify with each frame in the parameter of the corresponding intra-frame prediction method of PU (predictive mode).
(moving image decoding apparatus)
Below, with reference to Fig. 1~Fig. 5, to present embodiment, the formation of related moving image decoding apparatus 1 describes.
(summary of moving image decoding apparatus)
Moving image decoding apparatus 1 generates predicted picture according to each PU, by the predicted picture by generated and the decoded prediction residual addition from coded data #1, generates thus decoded picture #2, and exports generated decoded picture #2 to outside.
At this, with reference to by the coding parameter that coded data #1 is decoded and obtains, carry out the generation of predicted picture.Coding parameter refers to, the parameter of reference for the generation forecast image.In coding parameter, except the Prediction Parameters of the motion vector be referenced, the predictive mode be referenced etc., also comprise size, shape, size, shape and the original image of piece and the residual error data between predicted picture etc. of PU in intra-frame prediction in inter-picture prediction.Below, the set of the full detail except above-mentioned residual error data in the middle of the information comprised in coding parameter is called to less important information.
In addition, below enumerating the example that above-mentioned PU and TU are the situation of the size identical with CU describes.Yet, be not limited thereto, also can be applied to PU and the TU situation for the less unit than CU.
Picture (frame), section, tree piece, piece and the PU that in addition, below will become the object of decoding are called respectively object picture, object slice, object tree piece, object piece and object PU.
In addition, the size of tree piece is for example 64 * 64 pixels, and the size of PU is such as being 64 * 64 pixels, 32 * 32 pixels, 16 * 16 pixels, 8 * 8 pixels, 4 * 4 pixels etc.Yet these sizes are simple illustration, the size of tree piece and PU can be also the size beyond the above size illustrated.
(formation of moving image decoding apparatus)
Referring again to Fig. 2, the concise and to the point formation of moving image decoding apparatus 1 is described, as described below.Fig. 2 is about the functional block diagram shown in the concise and to the point formation of moving image decoding apparatus 1.
As shown in Figure 2, moving image decoding apparatus 1 possesses: Variable Length Code demultiplexing section 11, TU information decoding section 12, re-quantization and inverse transformation section 13, predicted picture generating unit 14, adder 15 and frame memory 16.
[Variable Length Code demultiplexing section]
Variable Length Code demultiplexing section 11 carries out demultiplexing by the coded data #1 that is input to 1 frame in moving image decoding apparatus 1, is separated into thus in the hierarchical structure shown in Fig. 3 the various information that comprise.For example, Variable Length Code demultiplexing section 11, with reference to the information comprised in various headers, is separated into section, tree piece successively by coded data #1.
At this, comprise the corresponding information in position in size, shape and the object slice that (1) and object picture be divided into the corresponding information of dividing method of section and (2) and the tree piece that belongs to object slice in various headers.
And Variable Length Code demultiplexing section 11, with reference to the tree piece carve information SP_TBLK comprised in tree piece header TBLKH, is divided into CU by the object tree piece.In addition, Variable Length Code demultiplexing section 11 obtains TT information TTI and PT information PTI about object CU.
Order according to the rules of the TU information TUI comprised in the TT information TTI that Variable Length Code demultiplexing section 11 will obtain about object CU and be supplied to TU information decoding section 12.In addition, the PT information PTI that Variable Length Code demultiplexing section 11 will obtain about object CU is supplied to predicted picture generating unit 14.
[TU information decoding section]
12 couples of TU information TUI from 11 supplies of Variable Length Code demultiplexing section of TU information decoding section are decoded, and generate the complete TU information TUI ' of decoding.
For example, TU information decoding section 12 is about the object piece and from decoding quantitative prediction residual error among TU information TUI.At this, with the corresponding quantitative prediction residual error of object piece, the form that can be aligned to the conversion coefficient quantized two-dimentional matrix shows.Below, the performance of the matrix of the two dimension of quantized conversion coefficient is called to coefficient matrix.
The complete TU information TUI' that will decode of TU information decoding section 12 is supplied to re-quantization and inverse transformation section 13.In addition, about the detailed action of TU information decoding section 12, will narrate in the back.
[re-quantization and inverse transformation section]
The complete TU information of the complete decoding of the decoding TUI ' based on from 12 supplies of TU information decoding section of re-quantization and inverse transformation section 13, carry out re-quantization and the inverse transformation of quantitative prediction residual error about object CU according to each piece.Re-quantization and inverse transformation section 13 carry out re-quantization and inverse dct transform (Inverse Discrete Cosine Transform: inverse discrete cosine transform), recover thus the prediction residual D of each pixel about each object PU by the quantitative prediction residual error to comprising in the complete TU information TUI ' that decodes.Re-quantization and inverse transformation section 13 are supplied to adder 15 by the prediction residual D recovered.
[predicted picture generating unit]
Predicted picture generating unit 14 each PU about comprising in object CU, be local decoder image P ' with reference to the complete image of decoding of the periphery of this PU, by infra-frame prediction or inter prediction, generates predicted picture Pred.The predicted picture Pred that predicted picture generating unit 14 will generate about object CU is supplied to adder 15.
[adder]
Adder 15 is by, from the predicted picture Pred of predicted picture generating unit 14 supply with from the prediction residual D addition of re-quantization and 13 supplies of inverse transformation section, generating and the corresponding decoded picture P of object CU thus.
[frame memory]
Record successively decoded decoded picture P in frame memory 16.In frame memory 16, at the time point of decoder object tree piece, record the decoded picture for example, with the All Trees piece of first being decoded than this object tree piece (, according to raster scan order All Trees piece) formerly corresponding.
In addition, in moving image decoding apparatus 1, All Trees piece in being directed to image and the decoded picture of setting block unit generate to be processed the time point be through with, the decoded picture #2 corresponding with the coded data #1 of 1 frame that is input to moving image decoding apparatus 1 is output to outside.
(about the scanning of coefficient)
At this, before the detailed formation of explanation TU information decoding section 12, the scanning of coefficient is described.
In the coding of coefficient, in dynamic image encoding device 2, order according to the rules scans the coefficient matrix of the set of the coefficient in the indicated object piece.
Scanning refers to, the coordinate transform of the coefficient in coefficient matrix is become to the processing of the scanning sequency index of one dimension.Coefficient after conversion is arranged ground by one dimension and is preserved, keeps.In scanning, can adopt in the past known sawtooth scan order etc.
In coding is processed, at first in the object piece, according to the sawtooth scan order, scan DC coefficient from the being positioned at top-left position coefficient to the high fdrequency component that is positioned at the position, bottom right.
In addition, the value of the coefficient of high fdrequency component has zero or, close to zero tendency, on the other hand, the value of the coefficient of low frequency component has non-zero or is worth high this characteristic of large possibility.Thereby sawtooth scan becomes the scanning sequency that early scans the coefficient of low frequency component.
In addition, in dynamic image encoding device 2, carry out the coding of coefficient and process after scanning.Carry out the coding of coefficient processes according to the order contrary with scanning.
Below, the sawtooth scan by this processing sequence also referred to as reverse order.That is to say, with respect to coefficient matrix, according to the processing of encoding of the sawtooth scan of reverse order.In addition, in the situation that the coding processing sequence is described, for convenience's sake, coefficient matrix is described as basis.
Because the situation that the high fdrequency component intermediate value is " 0 " is more, so, in the coding of present embodiment is processed, omit the coding of this zero coefficient, from the DC coefficient, the coefficient of last non-zero of take carries out as basic point.
In addition, above, for convenience of explanation, show the scanning example of 8 * 8 coefficient matrix, can say that the situation about scanning 4 * 4 coefficient matrix is also same.In addition, also scanning sequency can be altered to adaptively to method beyond sawtooth scan, such as with the same order of raster scan etc.
(about the coding of coefficient)
Below, the coefficient coding data that are encoded in dynamic image encoding device 2 are described.As mentioned above, the last nonzero coefficient of at first encoding in coding is processed.Next, under defined terms, carry out the coding based on distance of swimming pattern.When distance of swimming pattern finishes when meeting distance of swimming pattern termination condition, remaining coefficient is carried out to the coding based on amplitude mode.Thus one, whole coefficients of encoding.
Below, the data that are encoded under last nonzero coefficient, distance of swimming pattern and amplitude mode are described.
[last nonzero coefficient]
In last nonzero coefficient, the sign symbol sign of the amplitude level of the position last_pos of last nonzero coefficient, last nonzero coefficient, last nonzero coefficient is encoded.At this, the amplitude of coefficient refers to, the absolute value of coefficient.Last_pos gets the value of the form of scanning sequency index.In the situation that being the 10th (the DC coefficient is made as to the 1st), its value, scanning sequency is " 1 " at last nonzero coefficient, as described below.
last_pos:9
level:1,sign:1
[distance of swimming pattern]
In the situation that carry out distance of swimming pattern under defined terms, carry out following coding.Distance of swimming pattern refers to, the pattern that the number (0 distance of swimming) of continuous zero coefficient is encoded.The coding carried out under which kind of condition based on distance of swimming pattern will be narrated in the back.
Under distance of swimming pattern, according to each coefficient encode the amplitude level of 0 distance of swimming run, coefficient and the sign symbol sign of coefficient.The nonzero coefficient that will encode secondarily in the situation that and between previous nonzero coefficient clamping one 0 and value be " 1 ", as described below.
run:1,level:1,sign:0
In addition, in non-patent literature 1,2, will occur that the situation that level is the coefficient more than 2 is used as one of termination condition of distance of swimming pattern.
[amplitude mode]
Amplitude mode refers to, even if be zero coefficient, and also each pattern of being encoded in ground.Under amplitude mode, according to each coefficient, come the amplitude level of code coefficient and the sign symbol sign of coefficient.In the situation that the value of the coefficient that will encode is zero coefficient (value is " 0 ") for " 6 " and the coefficient that will encode, distinguish as described below.
level:6,sign:1
level:0
[remarks]
In addition, the coded data shown in above is simple illustration, also can be by encoding from above-mentioned different title or definition as grammatical feature.For example, the grammatical feature (with reference to " level_magnitude_minus2 " of non-patent literature 1,2) that the amplitude level of the coefficient of distance of swimming pattern also can be encoded as isLevelOne (1evel is 1) and be appended to this level_magnitude_minus2 (coefficient value is the situation more than 2, and what encode is the amplitude deducted after 2).
In addition, level also can be encoded as the grammatical feature of last_pos_level about last nonzero coefficient.
In addition, the level of non-distance of swimming pattern also can be encoded as the grammatical feature of level_magnitude.
(TU information decoding section)
Secondly, use Fig. 1, the formation of TU information decoding section 12 is described in further detail.Fig. 1 is the functional block diagram that the configuration example about TU information decoding section 12 is meaned.
In addition, the formation of the data relevant to coefficient in the middle of the coded data that below explanation TU information decoding section 12 comprises for the TU information TUI that decodes.In other words, the quantitative prediction residual error that below explanation TU information decoding section 12 is encoded out for decoding, be the formation of coefficient coding data.
Yet, be not limited to this, data beyond the coefficient coding data that TT information decoding section 12 comprises in also can the decoding and coding data, such as less important information etc.
As shown in Figure 1, TU information decoding section 12 possesses VLC form TBL11, Region Segmentation section (change of scale cutting unit) 121 and regional decoding section (conversion coefficient decoding unit) 122.
In addition, following the decode example of TU information TUI the complete TU information TUI ' of output decoding of explanation TU information decoding section 12 pieces of the object about 16 * 16 sizes.Yet, being not limited to this, the size of the object piece that TU information decoding section 12 decodes can be also 64 * 64,32 * 32 etc.
VLC form TBL11 is that the Code Number that can mutually convert with bit string (code) and the parameter that should decode are established corresponding form.VLC form TBL11 is referenced in the decoding of regional decoding section 122 is processed.
VLC form TBL11 adopts illustratively about the coding units of 8 * 8 sizes and prescriber forms.That is, VLC form TBL11 adopts is the form of 8 * 8 sizes less than the change of scale that becomes input (or coding units) 16 * 16 sizes.
In addition, the context during VLC form TBL11 processes according to decoding has been stipulated a plurality of, in order to can carry out the adaptive decoding processing.The attribute of context such as the position that can adopt the coefficient in processing, object piece (kind of the pixel of brightness/aberration etc., Forecasting Methodology) etc.
Region Segmentation section 121 is divided into a plurality of zones by the object piece.Below, will be called by each zone obtained cutting apart of Region Segmentation section 121 the decoding zone.
In addition, following declare area cutting part 121 is divided into the object piece of 16 * 16 sizes the example in the decoding zone of 48 * 8 sizes.Yet, being not limited to this, the gimmick of cutting apart of Region Segmentation section 121 also can adopt various gimmicks.About its variation, will describe in detail in the back.
Regional decoding section 122 is about each of the decoding zone that obtains by Region Segmentation section 121 cutting object pieces, with reference to the VLC form TBL11 stipulated according to this regional size of the decoding processing of decoding.
The example that the VLC form TBL11 that based on context following declare area lsb decoder 122 is stipulated with reference to the coding units about 8 * 8 sizes decodes and processes.
Specifically being constructed as follows of regional decoding section 122 is described., regional decoding section 122 possesses last nonzero coefficient lsb decoder 101, distance of swimming mode decoding section 102 and amplitude mode lsb decoder 103.
Last nonzero coefficient lsb decoder 101 is about decoding zone (hereinafter referred to as doing object area decoder territory) last nonzero coefficient of decoding of the object that becomes decoding.More specifically, last_pos, the level and the sign that in the regional corresponding coefficient coding data of 101 decodings of last nonzero coefficient lsb decoder and object decoding, comprise.
Distance of swimming mode decoding section 102 is about object decoding zone, the coefficient be encoded with distance of swimming pattern from decoding among the coefficient coding data.That is, distance of swimming mode decoding section 102 with reference to and the corresponding VLC form of context TBL11, run, the level and the sign that from decoding among the coefficient coding data, with distance of swimming pattern, are encoded.Below, the decoding that distance of swimming mode decoding section 102 is carried out is processed and is called the processing of distance of swimming mode decoding.
In addition, distance of swimming mode decoding section 102 carries out the processing of distance of swimming mode decoding repeatedly, until meet distance of swimming pattern termination condition.As the condition of distance of swimming pattern termination condition, surpassed the situation etc. of the coefficient of the situation of threshold value, the defined amount of having decoded such as the value of enumerating the coefficient decoded.
And, in the situation that met distance of swimming pattern termination condition, the decoding that distance of swimming mode decoding section 102 allows amplitude mode lsb decoder 103 start based on amplitude mode.
Amplitude mode lsb decoder 103 is about object decoding zone, the coefficient be encoded with amplitude mode from decoding among the coefficient coding data.That is, amplitude mode lsb decoder 103 with reference to and the corresponding VLC form of context TBL11, the level and the sign that from decoding among the coefficient coding data, with amplitude mode, are encoded.
Below, the decoding that amplitude mode lsb decoder 103 is carried out is processed and is called amplitude mode decoding processing.In addition, amplitude mode lsb decoder 103 carries out the amplitude mode decoding repeatedly to be processed, until the coefficient of decoding DC component.
In addition, when regional decoding section 122 has completed the decoding processing in each object decoding zone about the object piece, output comprises the complete TU information of the decoding TUI ' that processes the coefficient obtained by this decoding.
(handling process)
Secondly, use Fig. 4, the decoding in TU information decoding section 12 is processed and described.Fig. 4 comes the flow process of the treatment S 10 of coding/decoding coefficient to carry out illustrative flow chart about the object piece being carried out to Region Segmentation.
In addition, so the coding of dynamic image encoding device 2 is processed difference between processing with the decoding of moving image decoding apparatus 1, process processings of still decode encode, flow process in addition is roughly the same in encode processing and the processing of decoding.Thus, in Fig. 4, the coding processing of dynamic image encoding device 2 and the decoding of moving image decoding apparatus 1 are shown in the lump and process.
As shown in Figure 4, Region Segmentation section 121 cutting object pieces (S11) in moving image decoding apparatus 1 at first.
Next, enter and the corresponding circulation LP1 (S12) in each decoding zone of having cut apart.In circulation LP1, the coefficient (S13) that regional decoding section 122 comprises in the desorption coefficient coded data about object decoding zone.
Particularly, the last nonzero coefficient at first last nonzero coefficient lsb decoder 101 decoder object decoding zones.
Next, distance of swimming mode decoding section 102 carries out the processing of distance of swimming mode decoding, until meet distance of swimming pattern termination condition.
After distance of swimming pattern is through with, amplitude mode lsb decoder 103 carries out the amplitude mode decoding to be processed.
Thus one, when finishing dealing with the corresponding decoding in object decoding zone, turn back to the front (from S14 to S12) of circulation LP1, about next object decoding zone and the processing of further decoding.
Decode regional decoding while finishing dealing with when each object of piece, and circulation LP1 finishes.Then, carrying out Region Segmentation comes the treatment S 10 of desorption coefficient to finish.
(concrete example)
Secondly, except Fig. 5 and Fig. 6, also with reference to the flow chart shown in Fig. 4, the concrete example that the decoding in TU information decoding section 12 is processed is shown.Fig. 5 means object piece (change of scale) BLK of 16 * 16 sizes.In addition, Fig. 6 means the example of decoding in situation about processing in the zone that object piece BLK is divided into to 48 * 8 sizes.
In S11, Region Segmentation section 121 is divided into the object piece BLK of 16 * 16 sizes decoding zone (subunit) R11~R14 of 48 * 8 sizes as shown in Figure 5.
Then, in the S13 of circulation LP1, regional decoding section 122, according to the order of decoding regional R11, R12, R13 and R14, is processed the zone of respectively decoding shown in Fig. 6.
In addition, in Fig. 6, the arrow shown in the regional R11~R14 that decodes is interior means scanning sequency.That is, the decode scanning sequency of regional R11~R14 is sawtooth scan.
In S13, at first, last nonzero coefficient lsb decoder 101 is according to the scanning sequency last nonzero coefficient of decoding.Then, by distance of swimming mode decoding, process and amplitude mode decoding processing, till playing the coefficient of DC component according to the sawtooth scan decoding of reverse order from last nonzero coefficient.
When the decoding of the regional R11 of decoding while finishing dealing with, below about decode regional R12, R13 and the R14 processing of similarly decoding.
Like this, in the regional R11~R14 of decoding, for the coding units of 8 * 8 sizes with the processing of decoding of existing decoding process.That is, in processing, the decoding of the regional R11~R14 of decoding can adopt the last nonzero coefficient of decoding to carry out the existing decoding process of the processing of distance of swimming mode decoding and amplitude mode decoding processing.
(variation)
Preferred several variation of following account for motion picture decoding apparatus 1.
The judgement had or not of variation 1-1:[nonzero coefficient]
In the coefficient coding data, in the situation of the nonzero coefficient mark had or not that means nonzero coefficient of having encoded about each decoding zone, regional decoding section 122 also can be according to the processing of decoding that has or not of nonzero coefficient.
In the situation that, according to the processing of decoding that has or not of nonzero coefficient, form like that as follows regional decoding section 122 and get final product.
That is, at first regional decoding section 122 decoding nonzero coefficient marks, judge having or not of nonzero coefficient.Then, nonzero coefficient is marked in object decoding zone and means in the situation without nonzero coefficient, and the decoding that regional decoding section 122 skips in this subject area is processed.
With Fig. 7, describe particularly.Fig. 7 be illustrated in decoding regional R11, R13 and R14 have nonzero coefficient and in the situation that in the regional R12 of decoding without the example of nonzero coefficient.That is, decode " 0 " expression shown in regional R12 without nonzero coefficient.
In the nonzero coefficient mark, for example, in the situation that be encoded " 0 " without nonzero coefficient, in the situation that nonzero coefficient be encoded " 1 " is arranged.
In the example depicted in fig. 7, in regional R11, R13 and R14, nonzero coefficient is arranged, in regional R12 without nonzero coefficient.Thereby, nonzero coefficient mark be encoded " 1011 ".In addition, can the model of nonzero coefficient mark " 1011 " etc. directly be fixed to long codes with 4 yet, and carry out and the corresponding Variable Length Code of the occurrence frequency of model.
The following decoding of carrying out like that the example shown in Fig. 7 is processed.At first, about the regional R11 that decodes, because encoded nonzero coefficient mark " 1 ", the processing so 122 couples of regional R11 of decoding of regional decoding section decode.
Next, about the regional R12 that decodes, because encoded nonzero coefficient mark " 0 ", so regional decoding section 122 skips, the decoding of the regional R12 that decodes is processed.
About remaining decode regional R13 and R14, because similarly encoded and meaned to have the nonzero coefficient mark " 1 " of nonzero coefficient with the regional R11 of decoding, so each processing of decoding of 122 couples of regional R13 of decoding of regional decoding section and R14.
According to above formation, because can omit the unnecessary decoding processing in the decoding area unit, so can seek the raising for the treatment of effeciency.
Variation 1-2:[carrys out changing process method according to the regional position of decoding]
[1] change of scan method
In the above description, although the scan mode of respectively decoding regional employing is sawtooth scan, also can change scan method according to the regional position of decoding.
For example, as shown in Figure 8, adopt horizontal sweep among also in object piece BLK, being positioned at the regional R12 of top-right decoding.In addition, as shown in Figure 8, adopt vertical scanning among also can in object piece BLK, being positioned at the decoding zone R13 of lower left.In addition, as shown in Figure 8, also can in the regional R11 of decoding and R14, adopt sawtooth scan.
[2] pattern limits
In the above description, although the decoding of decoding regional has carried out in processing that distance of swimming mode decoding is processed and the amplitude mode decoding is processed at each, also can be configured to and carry out the processing of distance of swimming mode decoding or amplitude mode decoding according to the regional position of decoding and process central any one.
For example, be positioned at the regional R14 of bottom-right decoding of object piece BLK, owing to being high fdrequency component, therefore there is the more trend of number of zero coefficient.Thereby, in the regional R14 of decoding, also can utilize the processing of processing to decode of distance of swimming mode decoding.
[3] computational methods of change VLC form, Code Number
In distance of swimming mode decoding is processed, also can change VLC form that will reference, the computational methods of Code Number according to the regional position of decoding.The bit string that for example, also can change for meaning Code Number according to each regional position of decoding is transformed into { run, the VLC form of the group of the parameter of level}.
For example, form like that as follows the VLC form.That is, for example,, among the VLC form be referenced in the decoding zone of high fdrequency component side (zone of the decoding in Fig. 6 R12~R14), will with shorter bit string, set up corresponding as the group of level=0 in advance.In addition, for example, among the VLC form be referenced in the decoding zone of low frequency component side (zone of the decoding in Fig. 6 R11), by run, shorter value is set up corresponding with shorter bit string in advance.
There is zero coefficient and become many trend in the decoding zone of high fdrequency component side, there is the trend that nonzero coefficient becomes many, run shortens in the decoding zone of low frequency component side.
Thus, according to forming as above, can carry out with each decoding zone in the good decoding of the corresponding efficiency of distribution situation of nonzero coefficient process.That is, can reduce the code amount that should decode.The specific definition of VLC form is described with Figure 12 and Figure 13, as described below.
Figure 12 means that { run, the group of the parameter of level} is transformed into the example of the VLC form TBL11 of Code Number for inciting somebody to action.In addition, the VLC form of 2 kinds that the maximum that the figure shows run is 4 o'clock, (a) the high VLC form T1 of priority of expression level=0, (b) the low VLC form T2 of priority of expression level=0.
In addition, in VLC form T1 and VLC form T2, level=0 when the absolute value of next nonzero coefficient is " 1 ", and when the absolute value of next nonzero coefficient is greater than " 1 " level=1.
Figure 13 means above-mentioned VLC form T1 and T2 are set up to the example in corresponding situation with each decoding zone.As shown in figure 13, the regional R11 that decodes is corresponding with VLC form T2 foundation, and the regional R12~R14 that decodes sets up corresponding with VLC form T1.
The decoding handling process of distance of swimming mode decoding section 102 is described, as described below.At first, distance of swimming mode decoding section 102 is according to the corresponding relation shown in Figure 13, in the distance of swimming mode decoding of the regional R11 of decoding is processed with reference to VLC form T2.
Use Figure 12 (b) to describe in more detail, as described below.That is,, in VLC form T2, give run short { run, the less Code Number (shorter code) of combination distribution of level}.
For example,, in VLC form T1, to { run, the combination of level}=(0,0) distributes minimum Code Number " 0 ".In addition, consider occurrence frequency, give { run, the combination distribution time little Code Number " 1 " of level}=(4,0).
About combination in addition, if level is identical, give the longer larger Code Number of distribution of run.
About becoming { run, the combination of level}, the distribution Code Number " 7 " when run=3 of level=0.In addition, about becoming { run, the combination of level}, the distribution Code Number " 8 " when run=3 of level=1.
In the R11 of the decoding of low frequency component side zone, due to the trend that has run and shorten, therefore by short to run, { run, the combination of level} distributes less Code Number, thereby can improve code efficiency.
In addition, distance of swimming mode decoding section 102 is according to the corresponding relation shown in Figure 13, in the distance of swimming mode decoding of decoding regional R12~14 is processed with reference to VLC form T1.
Use Figure 12 (a) to be illustrated in greater detail, as described below.That is,, in VLC form T1, give { run, the less Code Number (shorter code) of combination distribution of level} that become level=0.
In VLC form T1, to { run, the combination of level}=(0,0) distributes minimum Code Number " 0 ".In addition, consider occurrence frequency, give { run, the combination distribution time little Code Number " 1 " of level}=(4,0).
Give { run, the larger Code Number of combination distribution ratio combinations thereof of level} that become level=0.That is, according to the length of run, give { run, difference allocation of codes numbering " 5 "~" 8 ", level}=(0,1)~(0,3).
In the decoding of high fdrequency component side zone R12~14, due to the high trend of possibility with decoding level=0, therefore by give become level=0 { run, the combination of level} distributes less Code Number, thereby can improve code efficiency.
In addition, although show the example of carrying out the conversion of the parameter-code in the processing of distance of swimming mode decoding with VLC form T1 and T2, be not limited to this, also can realize the conversion process be equal to VLC form T1 and T2 by computing.
In addition, illustrated although above with the form of VLC form T1 and these 2 kinds of T2 and carried out the example that distance of swimming mode decoding is processed, be not limited to this, also can pre-define than the form of 2 more kinds of kind.For example, distance of swimming mode decoding section 102 also can be configured to, the mutual different VLC form of reference in the regional R11~R14 of decoding.
[4] dynamic optimization
Explanation performed by regional decoding section 122 with the example corresponding dynamic optimization of decoding situation.
The renewal of [4-1] VLC form
The occurrence frequency of the value of regional decoding section 122 also count enable parameters, and rewrite the Code Number of VLC form according to occurrence frequency.
That is, the Code Number of the value of the parameter that regional decoding section 122 also can be high by occurrence frequency is rewritten into the value of less (code is shorter), and by occurrence frequency, the Code Number of the value of low parameter is rewritten into the more value of large (code is longer).
With Figure 14, describe, as described below.Such as illustrated in Figure 14, in the VLC form T3 before optimization, distributed Code Number CN-1 to the value y of certain parameter, distributed Code Number CN to the value x of another parameter.
Suppose that regional decoding section 122 is with reference to x and the Code Number CN of correspondence with it of the VLC form T3 before optimization.
Now, regional decoding section 122 subtracts 1 by Code Number CN, thereby Code Number that will be corresponding with x advances to CN-1.On the other hand, the Code Number of the y that regional decoding section 122 will be originally corresponding with CN-1 increases 1 and sets up corresponding with CN.At this, the advanced processing of this Code Number is called to optimization.VLC form T4 shown in Figure 14 is the form obtained after above-mentioned optimization.
And then, at this, in the situation that regional decoding section 122 is with reference to x, the Code Number that regional decoding section 122 will be corresponding with x is made as CN-2.
Thus one, regional decoding section 122 distributes shorter Code Number ground according to the occurrence frequency of the value of the parameter that will decode, in the decoding processing, dynamically upgrades the VLC form.
[4-2] are about adaptive speed
The adaptive speed of dynamic optimization is described, as described below.In Figure 14, just Code Number is subtracted to 1 when the value of reference primary parameter, but be not limited to this, also Code Number can be subtracted more than 2.
The speed of adaptive speed for example can show as the size that reduces Code Number.That is to say, the situation that is " 1 " than the amount of the minimizing of Code Number, for the situation of " 2 " more can be called " adaptive speed is fast ".
In addition, making run, in the situation of the VLC form dynamic optimization of level}, also can be in the decoding zone of high fdrequency component side the amount of increase during by optimization be made as " 0 ", and be made as " 1 " in the low frequency component side.
That is, also can be configured to and not carry out optimization in the decoding zone of high fdrequency component side.Adopt the reasons are as follows of this formation described.That is, be easy in the high fdrequency component side distance of swimming that elongated, specific { run, the possibility that level} occurs with high frequency is low.Thereby, according to above-mentioned formation, can prevent because just carry out the increase of the caused amount of calculation of optimization when the value of parameter occurring.
In addition, also can in the regional decoding section 122 of moving image decoding apparatus 1, carry out above-mentioned optimization.
[5] change of distance of swimming pattern termination condition
Also can change distance of swimming pattern termination condition according to the regional position of decoding.For example, also can in the decoding zone of high fdrequency component side, by distance of swimming pattern termination condition, establish strictly, establish loosely by distance of swimming pattern termination condition in the decoding zone of low frequency component side.That is to say, as long as be positioned at upper left decoding zone in the object piece, more be easy to the end runs pattern.In addition, also can in the decoding zone of high fdrequency component side, not carrying out the amplitude mode decoding processes.
In the decoding zone of high fdrequency component side, there is the trend that the absolute value of coefficient diminishes.Thereby the possibility elongated in the middle reaches, decoding zone of high fdrequency component side journey is high.Thus, preferably carry out distance of swimming mode decoding processes as far as possible.
In addition, in distance of swimming mode decoding section 102, also can determine whether that starting distance of swimming mode decoding processes.Distance of swimming mode decoding section 102 for example also can be based on last the decoded time point of nonzero coefficient can reference data, in the situation that " the absolute value integral body that can judge coefficient is little ", determines and start distance of swimming mode decoding and process.
In the situation that " the absolute value integral body that can judge coefficient is little ", because the elongated possibility of the distance of swimming is high, so carrying out distance of swimming mode decodings, distance of swimming mode decoding section 102 processes.
In addition, the data that the decoded time point of last nonzero coefficient can reference, such as the information of forecasting of enumerating the object piece, with the last corresponding information of nonzero coefficient, in addition be encoded and decoded mark etc. in coder side.
Particularly, in the situation that the value of last coefficient is greater than 1, distance of swimming mode decoding section 102 also can skip distance of swimming mode decoding and process.
In addition, also can carry out according to the regional position of decoding the threshold value of the various judgements in coefficient of variation decoding processing.
Variation 1-3:[decides according to the predictive mode of object piece having or not of cutting apart]
Region Segmentation section 121 also can be configured to according to the predictive mode of object piece and decide having or not of cutting apart.
For example, Region Segmentation section 121 also can be in the situation that the predictive mode of object piece be infra-frame prediction is cut apart, in the situation that do not cut apart for inter prediction.
In addition, in the situation that the predictive mode of object piece is inter prediction and Region Segmentation section 121, cut apart, regional decoding section 122 also can only decode from the coefficient of the 1st to the 64th according to scanning sequency in the object piece.That is, in this case, also can only decode 64 coefficients of the upper left side that is positioned at the object piece of regional decoding section 122.
In this case, can obtain and can omit this effect of decoding that means to cut apart the mark had or not.
Number and the size in variation 1-4:[zone]
In the example shown in Fig. 5 and Fig. 6, illustrated that Region Segmentation section 121 is divided into 4 situations that decoding is regional by the object piece.Yet, being not limited to this, Region Segmentation section 121 also can be divided into the object piece zone more than 4.
In addition, the object piece is not limited to 16 * 16 sizes.For example, the object piece can be 32 * 32 sizes, can be also 64 * 64 sizes.
For example, in the situation that the object piece is 64 * 64 sizes, Region Segmentation section 121 also can be divided into the object piece decoding zone more than 4 as described below.
That is, Region Segmentation section 121 also can be divided into the object piece decoding zone of 64 8 * 8 sizes.In addition, Region Segmentation section 121 also can be divided into the object piece decoding zone of 16 16 * 16 sizes.In addition, Region Segmentation section 121 also can be divided into the object piece decoding zone of 4 32 * 32 sizes.
In addition, Region Segmentation section 121 cutting object pieces and the decoding zone that obtains is not limited to square.For example, the decoding zone can be also rectangle.In addition, also the object piece can be divided into to decoding zone (being equivalent to the decoding zone R11 in Fig. 5) and these 2 zones, the zone of decoding in addition (being equivalent to the decoding zone R12~R14 in Fig. 5) of 8 * 8 sizes of upper left.
In addition, each regional shape of decoding can be not identical yet.For example, the object piece one group Di is cut apart to obtain the decoding zone according to every 64 coefficients as shown in Figure 9 on the sawtooth scan order.
That is, as shown in Figure 9, also the object piece can be divided into to the regional R21~R24 of decoding.Numeral " 64 " shown in decoding in the zone of regional R21~R24 is illustrated in zone and comprises 64 coefficients.
The regional R21 that decodes is the zone that comprises the DC coefficient, and this shape is meaned with right-angled triangle in the figure.In addition, the decode shape of regional R22 and R23 is meaned with trapezoidal respectively.In addition, the regional R24 that decodes is the zone of the most close high fdrequency component side in the middle of each zone, and this shape is meaned with right-angled triangle.
In addition, in Fig. 9, for convenience of explanation, the shape of regional R21 and R24 meaned with right-angled triangle, and the shape of regional R22 and R23 is meaned with trapezoidal, but should note: in fact on stricti jurise, say, do not become right-angled triangle, trapezoidal.
The number of the coefficient comprised in each decode regional shape and the zone of respectively decoding in addition, can be not identical yet.
Like this, Region Segmentation section 121 can at random be divided into the object piece a plurality of decodings zone of the size less than the size of this object piece.
Variation 1-5:[regional processing sequence of decoding]
In the example shown in Fig. 5 and Fig. 6, illustrated that regional decoding section 122 is according to the processing of decoding of the order (so-called raster scan order) of decoding regional R11, R12, R13 and R14.Yet, being not limited to this, regional decoding section 122 also can be according to the processing of decoding of order in addition.
For example, regional decoding section 122 also can be according to the order of decoding regional R14, R13, R12 and R11 or as another example and according to the order processing of decoding of decoding regional R11, R13, R12 and R14.In addition, in the situation that the regional number of decoding more than 4, for example be 16, also can be processed according to the sawtooth scan order.
(dynamic image encoding device)
At first, referring to Figure 10~Figure 14, the related dynamic image encoding device of present embodiment 2 is described.
(summary of dynamic image encoding device)
Dynamic image encoding device 2 is to generate the go forward side by side device of line output of coded data #1 by encoding input image #10 in brief.
(formation of dynamic image encoding device)
At first, the configuration example of dynamic image encoding device 2 is described with Figure 10.Figure 10 is the functional block diagram that the formation about dynamic image encoding device 2 is meaned.As shown in figure 10, dynamic image encoding device 2 possesses coding configuration part 21, re-quantization and inverse transformation section 22, predicted picture generating unit 23, adder 24, frame memory 25, subtracter 26, transform and quantization section 27 and variable-length encoding part 28.
Coding configuration part 21 generates and relevant view data and the various set information of encoding based on input picture #10.
Particularly, coding configuration part 21 generates view data and the set information of next.
At first, coding configuration part 21 is divided into section unit, tree block unit successively by input picture #10, generates thus and the corresponding CU image of object CU #100.
In addition, the result of coding configuration part 21 based on dividing processing generates header information H '.Header information H ' comprises the position corresponding CU information CU ' in the corresponding information in position in (1) and size, shape and the object slice of tree piece that belongs to object slice and (2) and the size, shape and the object tree piece that belong to the CU that respectively sets piece.
And then coding configuration part 21 generates PT set information PTI ' with reference to CU image #100 and CU information CU '.Comprise the relevant information of all combinations that (1) object CU may be partitioned into parted pattern and the predictive mode that (2) can distribute to each PU of each PU in PT set information PTI '.
Coding configuration part 21 is supplied to subtracter 26 by CU image #100.In addition, coding configuration part 21 is supplied to variable-length encoding part 28 by header information H '.In addition, coding configuration part 21 is supplied to predicted picture generating unit 23 by PT set information PTI '.
The quantitative prediction residual error of each piece of 22 couples of transformation into itself of re-quantization and inverse transformation section and quantization unit 27 supplies is carried out re-quantization and inverse dct transform (Inverse Discrete Cosine Transform: inverse discrete cosine transform), recover thus the prediction residual of each piece.
In addition, re-quantization and inverse transformation section 22 unify the prediction residual of each piece according to the parted pattern by TT carve information (aftermentioned) appointment, generate and the corresponding prediction residual D of object CU.Re-quantization and inverse transformation section 22 will be supplied to adder 24 with the generated corresponding prediction residual D of object CU.
In predicted picture generating unit 23 reference frame memories 25, local decoder image P ' and the PT set information PTI ' of record, generate the corresponding predicted picture Pred with object CU.Predicted picture generating unit 23 will generate and process the Prediction Parameters obtains and set PT set information PTI ' for by predicted picture, and the PT set information PTI ' after setting is transferred to variable-length encoding part 28.In addition, because performed predicted picture generates that to process the performed processing of the predicted picture generating unit 14 that possesses with moving image decoding apparatus 1 identical by predicted picture generating unit 23, so in this description will be omitted.
Adder 24 will, from the predicted picture Pred of predicted picture generating unit 23 supply with from the prediction residual D addition of re-quantization and 22 supplies of inverse transformation section, generate the corresponding decoded picture P with object CU thus.
Record successively decoded decoded picture P in frame memory 25.In frame memory 25, record: at the time point of decoder object tree piece, the corresponding decoded picture of first decoded all tree pieces than this object tree piece (for example, according to raster scan order all tree pieces) formerly.
Subtracter 26 deducts predicted picture Pred among CU image #100, generates thus the corresponding prediction residual D with object CU.Subtracter 26 is supplied to transform and quantization section 27 by generated prediction residual D.
Discrete cosine transform) and quantize 27 couples of prediction residual D of transform and quantization section carry out dct transform (Discrete Cosine Transform:, generate thus the quantitative prediction residual error.
Particularly, transform and quantization section 27 is with reference to CU image #100 and CU information CU ', decides object CU to be divided into 1 or a plurality of parted pattern.In addition, according to determined parted pattern, prediction residual D is divided into and the corresponding prediction residual of each piece.
In addition, transform and quantization section 27 is by carrying out dct transform (Discrete Cosine Transform: discrete cosine transform) after having generated the prediction residual in the frequency field, generate the quantitative prediction residual error of each piece by quantizing prediction residual in this frequency field with the corresponding prediction residual of each piece.
In addition, transform and quantization section 27 generates TT set information TTI ', and this TT set information TTI ' comprises that TT carve information and the object CU of parted pattern of quantitative prediction residual error, the appointed object CU of each generated piece may be partitioned into the relevant information of whole parted patterns of each piece.Transform and quantization section 27 is supplied to re-quantization and inverse transformation section 22 by generated TT set information TTI'.
In addition, transform and quantization section 27 generates the TU set information TUI ' of the quantitative prediction residual error that comprises the object piece, and is supplied to variable-length encoding part 28.
Variable-length encoding part 28 generates coded data #1 based on TU set information TUI ', PT set information PTI ' and header information H ', the line output of going forward side by side.In addition, the detailed content about variable-length encoding part 28 will be described below.
(variable-length encoding part)
Secondly, use Figure 11, the formation of variable-length encoding part 28 is illustrated in further detail.Figure 11 is the block diagram that the configuration example about variable-length encoding part 28 is meaned.
As shown in figure 11, variable-length encoding part 28 possesses TU information coding section 280, header information coding section 40, PTI information coding section 41 and coded data multiplexing unit 42.
280 couples of TU set information TUI ' of TU information coding section are encoded and are supplied to coded data multiplexing unit 42.In addition, 40 couples of header information H ' of header information coding section are encoded and are supplied to coded data multiplexing unit 42.In addition, 41 couples of PTI information PTI ' of PTI information coding section are encoded and are supplied to coded data multiplexing unit 42.
The multiplexing TU set information of coded data multiplexing unit 42 TUI ', header information H ' and PTI information PTI ' generate coded data #1, the line output of going forward side by side.
At this, about the formation of TU information coding section 280, illustrate in further detail, as described below.In addition, below the explanation quantitative prediction residual error that TU set information TUI ' comprises for encoding, be the formation that coefficient matrix obtains the coefficient coding data.
In addition, below TU information coding section's 280 pieces of the object about 16 * 16 sizes are described and the example of encode TU information TUI the complete TU information of output encoder TUI '.Yet, being not limited to this, the size of the object piece that TU information coding section 280 is coded can be also 64 * 64,32 * 32 etc.
As shown in figure 11, TU information coding section 280 possesses VLC form TBL21, Region Segmentation section (change of scale cutting unit) 281 and regional code section (transform coefficients encoding unit) 282.
VLC form TBL21 be defined each parameter, and the bit string of coded data be the form of the corresponding relation between code.
VLC form TBL21 adopts illustratively about the coding units of 8 * 8 sizes and prescriber forms.That is, VLC form TBL21 adopts is the form of 8 * 8 sizes less than the change of scale that becomes input (or coding units) 16 * 16 sizes.
In addition, the context during VLC form TBL21 processes according to coding has been stipulated a plurality of, in order to can carry out the adaptive coding processing.
Region Segmentation section 281 is divided into a plurality of zones by the object piece.Below, will be called coding region by each zone obtained cutting apart of Region Segmentation section 281.
In addition, following declare area cutting part 281 is divided into the object piece of 16 * 16 sizes the example of 4 coding regions.That is, Region Segmentation section 281 is divided into the object piece of 16 * 16 sizes the coding region of 48 * 8 sizes.
Region Segmentation section 281 for example can be as used the such cutting object piece as shown in Fig. 5.In addition, in the following description, also decoding zone R11~R14 shown in Figure 5 can be called again to coding region R11~R14.
That is, as shown in Figure 5, Region Segmentation section 281 is divided into coding region R11~R14 by object piece BLK.
Yet, being not limited to this, the gimmick of cutting apart of Region Segmentation section 281 also can be applied various gimmicks.About its variation, will describe in detail in the back.
Regional code section 282 is about each of the coding region that obtains by Region Segmentation section 281 cutting object pieces, with reference to the VLC form TBL21 stipulated according to the size of this coding region processing of encoding.That is, following declare area coding section 282 is based on context with reference to the encode example of processing of the VLC form TBL21 stipulated about 8 * 8 sizes.
Specifically being constructed as follows of regional code section 282 is described., regional code section 282 possesses last nonzero coefficient coding section 201, distance of swimming pattern-coding section 202 and amplitude mode coding section 203.
Last nonzero coefficient coding section 201 is about coding region (hereinafter referred to as making the object coding region) the last nonzero coefficient of encoding of the object that becomes coding.More specifically, last nonzero coefficient coding section 201 is about the last nonzero coefficient comprised in the coefficient matrix corresponding with object coding zone encode last_pos, level and sign.
Distance of swimming pattern-coding section 202 implements distance of swimming pattern-coding for the coefficient matrix relevant to the object coding zone and processes.That is, distance of swimming pattern-coding section 202 with reference to and the corresponding VLC form of context TBL21, by distance of swimming pattern about the nonzero coefficient comprised in coefficient matrix encode successively run, level and sign.Below, the coding that distance of swimming pattern-coding section 202 is carried out is processed and is called the processing of distance of swimming pattern-coding.
In addition, distance of swimming pattern-coding section 202 carries out the processing of distance of swimming pattern-coding repeatedly, until meet distance of swimming pattern termination condition.As the condition of distance of swimming pattern termination condition, surpassed the situation etc. of the coefficient of the situation of threshold value, the defined amount of having encoded such as the value of enumerating the coefficient of encoding out.
And, in the situation that met distance of swimming pattern termination condition, the coding that distance of swimming pattern-coding section 202 allows amplitude mode coding section 203 start based on amplitude mode.
Amplitude mode coding section 203, about the object coding zone, carrys out the coefficient comprised in the code coefficient matrix by amplitude mode.That is, amplitude mode coding section 203 with reference to and the corresponding VLC form of context TBL21, by level and the sign of the coefficient that comprises in amplitude mode code coefficient matrix successively.
Below, encode coding that section 203 carries out of amplitude mode is processed and is called the amplitude mode coding and processes.In addition, amplitude mode coding section 203 repeatedly carries out amplitude mode coding and processes, until according to the scanning sequency in the object coding zone initial coefficient of encoding.
In addition, when regional code section 282 has completed the coding processing in each object coding zone about the object piece, output comprises the complete TU set information of the coding TUI ' that processes the coefficient obtained by this coding.
In addition, regional code section 282 processing of can encoding as shown in Figure 6.In the following description, also decoding zone R11~R14 shown in Figure 6 can be called again to coding region R11~R14.
That is, regional code section 282 is according to the processing of encoding of the order of coding region R11, R12, R13 and the R14 shown in Fig. 6.In addition, in each of coding region R11~R14, at first last nonzero coefficient lsb decoder 101 is according to the scanning sequency last nonzero coefficient of decoding.Then, by distance of swimming mode decoding, process and the amplitude mode decoding is processed, the scanning sequency of decoding from last nonzero coefficient by the object coding zone according to the reverse order of scanning sequency is until initial coefficient.
(handling process)
As mentioned above, because the coding handling process of dynamic image encoding device 2 is roughly the same with the decoding handling process of using moving image decoding apparatus 1 illustrated in fig. 4, so description is omitted at this.
(variation)
Preferred several variation of following account for motion picture coding device 2.
Variation 1-1 ': [judgement had or not of nonzero coefficient]
The regional code section 282 that dynamic image encoding device 2 possesses also can be configured to, and about each coding region, judges having or not of nonzero coefficient, and the nonzero coefficient mark of the result judged of coded representation being included in the coefficient coding data.In addition, in this formation, regional code section 282 can omit the coding with the corresponding coefficient of coding region that does not have nonzero coefficient.
About the concrete example of coding region and nonzero coefficient mark because with variation 1-1 at moving image decoding apparatus 1 in the content narrated roughly the same, so in this description will be omitted.Wherein, also " decoding zone " in the explanation of variation 1-1, " decoding is processed " and " regional decoding section 122 " can be called again respectively to " coding region ", " coding is processed " and " regional code section 282 ".
Variation 1-2 ': [according to the position of coding region, carrying out changing process method]
[1] change of scan method
In the above description, although the employing of the scan mode of each coding region is sawtooth scan, also can change scan method according to the position of coding region.About concrete scan method because for example with variation 1-2[1 at moving image decoding apparatus 1] in the content of explanation identical, so in this description will be omitted.Wherein, also can be by variation 1-2[1] explanation in " decoding zone ", " decoding is processed " and " regional decoding section 122 " be called again respectively " coding region ", " coding is processed " and " regional code section 282 ".
[2] pattern limits
In the above description, although carried out the processing of distance of swimming pattern-coding and the processing of amplitude mode coding in the coding of each coding region is processed, also can be configured to according to the position of coding region and carry out the processing of distance of swimming pattern-coding.Because concrete processing example for example with variation 1-2[2 at moving image decoding apparatus 1] in the content of explanation identical, so in this description will be omitted.Wherein, also can be by variation 1-2[2] explanation in " decoding zone ", " decoding is processed " and " regional decoding section 122 " be called again respectively " coding region ", " coding is processed " and " regional code section 282 ".
[3] computational methods of change VLC form, Code Number
In distance of swimming pattern-coding is processed, also can change VLC form that will reference, the computational methods of Code Number according to the position of coding region.The bit string that for example, also can change for meaning Code Number according to the position of each coding region is transformed into { run, the VLC form of the group of the parameter of level}.Because concrete processing example for example with variation 1-2[3 at moving image decoding apparatus 1] in the content of explanation identical, so in this description will be omitted.Wherein, also can be by variation 1-2[3] explanation in " decoding zone ", " decoding is processed " and " regional decoding section 122 " be called again respectively " coding region ", " coding is processed " and " regional code section 282 ".
[4] dynamic optimization
Explanation performed by regional code section 282 with the example corresponding dynamic optimization of coding situation.
The renewal of [4-1] VLC form
The occurrence frequency of the value of regional code section 282 also count enable parameters, and rewrite the Code Number of VLC form according to occurrence frequency.
That is, the Code Number of the value of the parameter that regional code section 282 also can be high by occurrence frequency is rewritten into the value of less (code is shorter), and by occurrence frequency, the Code Number of the value of low parameter is rewritten into the more value of large (code is longer).Because concrete processing example for example with variation 1-2[4-1 at moving image decoding apparatus 1] in the content of explanation identical, so in this description will be omitted.Wherein, also can be by variation 1-2[4-1] explanation in " decoding zone ", " decoding is processed " and " regional decoding section 122 " be called again respectively " coding region ", " coding is processed " and " regional code section 282 ".
[4-2] are about adaptive speed
About the adaptive speed of the dynamic optimization in dynamic image encoding device 2 because with variation 1-2[4-2 at moving image decoding apparatus 1] in the content that illustrated identical, so in this description will be omitted.
[5] change of distance of swimming pattern termination condition
Also can change distance of swimming pattern termination condition according to the position of coding region.In addition, also can judge that distance of swimming pattern starts condition according to the position of coding region.About its detailed content because with variation 1-2[5 at moving image decoding apparatus 1] explanation in illustrated, so description is omitted at this.
Variation 1-3 ': [according to the predictive mode of object piece, deciding having or not of cutting apart]
Region Segmentation section 281 also can be configured to according to the predictive mode of object piece and decide having or not of cutting apart.Because concrete processing is for example identical with the content of explanation in variation 1-3 at moving image decoding apparatus 1, so in this description will be omitted.
Variation 1-4 ': [number and the size in zone]
Region Segmentation section 281 also can be divided into the object piece zone more than 4.Because concrete processing is for example identical with the content of explanation in variation 1-4 at moving image decoding apparatus 1, so in this description will be omitted.
Variation 1-5 ': [processing sequence of coding region]
The scanning sequency that performed coding is processed by regional code section 282 is not limited to above-mentioned example.Regional code section 282 also can be configured to and for example carry out the processing identical with the processing illustrated in variation 1-5 at moving image decoding apparatus 1.
In addition, as above-mentioned, [variation] of moving image decoding apparatus 1 can be applied to dynamic image encoding device 2.On the contrary, even if about [variation] of the dynamic image encoding device 2 shown in this, process being altered to decoding and processing by encoding, also can be applicable to moving image decoding apparatus 1 thus.
(action effect)
As discussed above, moving image decoding apparatus 1 is configured to, this conversion coefficient of decoding among the TU information TUI of the coded data obtained from conversion coefficient encodes, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, moving image decoding apparatus 1 possesses: Region Segmentation section 121, and it will be divided into as the object piece of above-mentioned change of scale a plurality of decodings zone; With regional decoding section 122, the conversion coefficient that it is decoded in above-mentioned decoding zone and comprise with reference to VLC form TBL11, this VLC form TBL11 is for obtain the decoded information of above-mentioned conversion coefficient, the VLC form TBL11 be assigned with according to each above-mentioned decoding zone among TU information TUI.
In addition, dynamic image encoding device 2 is configured to, conversion coefficient is encoded, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, dynamic image encoding device 2 possesses: Region Segmentation section 281, and it will be divided into a plurality of coding regions as the object piece of above-mentioned change of scale; With regional code section 282, the conversion coefficient that it is encoded in above-mentioned change of scale and comprise with reference to VLC form TBL21, this VLC form TBL21 is the VLC form TBL21 for the above-mentioned conversion coefficient of encoding, the VLC form TBL21 be assigned with according to each above-mentioned coding region.
Above-mentioned formation according to moving image decoding apparatus 1, the VLC form 11 of the regulation processing of decoding because the decoding based on about 8 * 8 sizes zone, so than the size based on about the primary object piece (16 * 16) and the decode situation of processing of the VLC form of regulation can reduce the size of VLC form.In addition, about the form that means scanning sequency similarly, the processing because the decoding zone based on about 8 * 8 sizes and prescriber forms are decoded, so can reduce the size of form.
In addition, about dynamic image encoding device 2, also can obtain same action effect.
(2) execution mode 2
Based on Figure 15~Figure 22, another embodiment of the present invention is described, as described below.In addition, for convenience of explanation, give identical label about the parts with function identical with the accompanying drawing illustrated in described execution mode 1, and the description thereof will be omitted.
Coding and the decoding of the coefficient based on the relative position appointment below are shown.In addition, in the following description, the size of object piece is assumed to be 16 * 16 as an example.
(moving image decoding apparatus)
At first, the formation of moving image decoding apparatus 1 is described with reference to Figure 15, as described below.In related moving image decoding apparatus 1, TU information decoding section 12 is altered to the TU information decoding 12A of section shown in Figure 15 in present embodiment among the moving image decoding apparatus 1 shown in Fig. 2.
Below the TU information decoding 12A of section shown in explanation Figure 15, as described below.That is, as shown in figure 15, the TU information decoding 12A of section possesses VLC form TBL30, distance of swimming amplitude mode lsb decoder 310, relative position mode decoding section 320 and tupe control part 330.
VLC form TBL30 has set up corresponding form with the mutual disposable Code Number of bit string (code) and the parameter that should decode.VLC form TBL30 possesses: by the distance of swimming amplitude mode of 310 references of distance of swimming amplitude mode lsb decoder described later with form TBL31 with by the relative position pattern form TBL32 of 320 references of relative position mode decoding section.
Distance of swimming amplitude mode can adopt the form same with the VLC form TBL11 of the TU information decoding section 12 shown in Fig. 1 with form TBL31.Thus, at this, the description thereof will be omitted.In addition, about the relative position pattern, with the definition of form TBL32, will narrate in the back.
Distance of swimming amplitude mode lsb decoder 310 carries out the processing of distance of swimming mode decoding by the control of tupe control part 330 and the amplitude mode decoding is processed.Below, decoding that will be performed by distance of swimming amplitude mode lsb decoder 310 is processed and is called distance of swimming amplitude mode decoding processing.In addition, about distance of swimming mode decoding, process and amplitude mode decoding processing, because illustrated in described execution mode 1, at this, the description thereof will be omitted.
Distance of swimming amplitude mode lsb decoder (decoding unit) 310 possesses last nonzero coefficient lsb decoder 311, distance of swimming mode decoding section 312 and amplitude mode lsb decoder 313.
Last nonzero coefficient lsb decoder 311, distance of swimming mode decoding section 312 and amplitude mode lsb decoder 313 have respectively the function same with using last nonzero coefficient lsb decoder 101, distance of swimming mode decoding section 102 and amplitude mode lsb decoder 103 illustrated in fig. 1.Thus, about its function, carried out explanation, so the description thereof will be omitted at this.
In addition, distance of swimming mode decoding section 102 and amplitude mode lsb decoder 103 are configured to: in distance of swimming amplitude mode decoding is processed with reference to distance of swimming amplitude mode form TBL31.
The decoding of the coefficient coding data that the relative position that relative position mode decoding section 320 carries out nonzero coefficient is encoded is processed.Relative position mode decoding section 320 particularly possesses last nonzero coefficient lsb decoder 321, relative position lsb decoder (relative position decoding unit) 322 and coefficient positions determination section (position determination unit) 323.
Last nonzero coefficient in last nonzero coefficient lsb decoder 321 decoder object pieces.Last_pos, the level and the sign that in last nonzero coefficient lsb decoder 321 decodings coefficient coding data relevant to the object piece, comprise.In addition, last nonzero coefficient lsb decoder 321 is transformed into last_pos to take the coordinate that the DC coefficient is initial point (0,0) and show (1astx, lasty).Below will be by take that coordinate that this DC coefficient is initial point (0,0) shows and the position that means is called absolute position.
Relative position lsb decoder 322 is about the object piece, and among the coefficient coding data that are encoded from the relative position of nonzero coefficient, decoding becomes the relative position (dx, dy) of nonzero coefficient of decoder object and the value (1evel and sign) of nonzero coefficient.At this, the relative position of nonzero coefficient refers to, the relative position of, nonzero coefficient that become decoder object that look from the absolute position of the previous nonzero coefficient decoded.In addition, below the performance of the position of this nonzero coefficient based on relative position being called to relative position specifies.
Relative position (the dx of the nonzero coefficient that coefficient positions determination section 323 decodes according to relative position lsb decoder 322, dy) and the absolute position (x of the previous nonzero coefficient decoded, y), decide the absolute position of the nonzero coefficient that becomes decoder object.
From last nonzero coefficient C 0until the nonzero coefficient C of n+1 n+1till relative position, for example can show enough following relational expression (1-1)~(1-3).
C 0:x 0=lastx,y 0=lasty…(1—1)
C 1:x 1=x 0+dx,y 1=y 0+dy…(1—2)
C n+1:x n+1=x n+dx,y n+1=y n+dy…(1—3)
Coefficient positions determination section 323 is used above-mentioned relation formula (1-1)~(1-3) to decide the absolute position of the nonzero coefficient that becomes decoder object.
Tupe control part 330 judges whether the nonzero coefficient that becomes decoder object is positioned at the zone of regulation, and according to this result of determination, controlling is by decode processing or by relative position mode decoding section 320 processing of decoding of distance of swimming amplitude mode lsb decoder 310.
Whether the position that particularly, tupe control part 330 is judged the nonzero coefficient that becomes decoder object is in the zone of 8 * 8 sizes of low frequency component side.
That is,, according to the illustration of above-mentioned relation formula (1-1)~(1-3), judge the nonzero coefficient C that becomes decoder object nposition (x n, y n) whether be " x n<8& & y n<8 ".In addition, “ & & " be presentation logic " with " operator.
If become the position of nonzero coefficient of decoder object in the zone of 8 * 8 sizes of low frequency component side, tupe control part 330 allows distance of swimming amplitude mode lsb decoder 310 carry out from next nonzero coefficient C n+1the decoding risen is processed.
If become the position of nonzero coefficient of decoder object not in the zone of 8 * 8 sizes of low frequency component side, tupe control part 330 allows relative position mode decoding section 320 carry out decoding to process.
(handling process)
Secondly, use Figure 16, the decoding in the TU information decoding 12A of section is processed and described.Figure 16 is that the flow process of the treatment S 20 about specify the coding/decoding nonzero coefficient by relative position is carried out illustrative flow chart.
In addition, in Figure 16, the coding processing of dynamic image encoding device 2 and the decoding of moving image decoding apparatus 1 are shown in the lump and process.
As shown in figure 16, when the treatment S 20 of the nonzero coefficient of specifying by relative position to decode starts, at first in moving image decoding apparatus 1, tupe control part 330 allows relative position mode decoding section 320 carry out decoding to process.With respect to this, the last nonzero coefficient (S21) of last nonzero coefficient lsb decoder 321 decoder object pieces.
Whether the position that tupe control part 330 is judged the last nonzero coefficient decoded in S21 was in the zone of coefficient of relative position appointment was carried out in decoding (S22).
In the situation that be that decoding was carried out in the zone of coefficient of relative position appointment ("Yes" in S22), tupe control part 330 allows relative position mode decoding section 320 carry out decoding and processes.With respect to this, the position of the coefficient of relative position appointment was carried out in 322 decodings of relative position lsb decoder, and coefficient positions determination section 323 is decoded and become the coefficient of decoder object (S23) by the absolute position that determines this coefficient.
Below, the position of decoded coefficient, in the situation that decoding was carried out in the zone of coefficient of relative position appointment, is proceeded the decoding of the coefficient based on the relative position appointment and is processed (S22, S23).
The position of decoded coefficient was not in the situation that decoding was carried out in the zone of coefficient of relative position appointment ("No" in S22), and tupe control part 330 allows distance of swimming amplitude mode lsb decoder 310 carry out decoding and processes.With respect to this, distance of swimming mode decoding section 312 carries out distance of swimming mode decoding and processes, and then amplitude mode lsb decoder 313 is carried out the amplitude mode decoding and processed (S24).Thus one, all coefficients of object piece are decoded, by relative position, specify the processing of desorption coefficient to finish.
(concrete example)
Use Figure 17, the concrete example that the decoding in the TU information decoding 12A of section is processed describes.Figure 17 means the figure of the execution example that the decoding of the TU information decoding 12A of section is processed.
As shown in figure 17, object piece BLK consists of 2 regional R1 and R2.At this, object piece BLK and above explanation similarly are assumed to be the piece of 16 * 16 sizes, and regional R1 is assumed to be the zone of 8 * 8 sizes of low frequency component side.
In addition, the arrow shown in regional R1 means the scanning sequency in regional R1, and this scanning sequency is sawtooth scan.In addition, regional R2 is for specifying the zone of desorption coefficient by relative position.
When the processing of specifying desorption coefficient by relative position starts, the last nonzero coefficient C at first last nonzero coefficient lsb decoder 321 decoder object piece BLK n(S21).
Because last nonzero coefficient C nbe arranged in regional R2 (S22 "Yes"), process so tupe control part 330 allows relative position mode decoding section 320 carry out decoding.
Then, relative position lsb decoder 322 decoding nonzero coefficient C n-1relative position, coefficient positions determination section 323 is according to this decoded relative position and last nonzero coefficient C nabsolute position decide nonzero coefficient C n-1absolute position.And then, decoding level, sign, nonzero coefficient C thus decodes n-1(S23).
Because at nonzero coefficient C n-1below and until nonzero coefficient C 1be positioned at regional R2, so, with similarly above-mentioned, relative position mode decoding section 320 carries out decoding and processes (S22, S23).
And then, if until nonzero coefficient C 0till relative position mode decoding section 320 when having completed decoding and processing, because nonzero coefficient C 0be arranged in regional R1 (S22 "No"), process so tupe control part 330 allows distance of swimming amplitude mode lsb decoder 310 carry out decoding.
In distance of swimming amplitude mode lsb decoder 310, distance of swimming mode decoding section 312 is by nonzero coefficient C 0as basic point, carry out distance of swimming mode decoding and process among regional R1.In addition, when the processing of distance of swimming mode decoding finishes, amplitude mode lsb decoder 313 is carried out the amplitude mode decoding and is processed.
At this, the decoding of distance of swimming amplitude mode lsb decoder 310 is processed and can be adopted the processing same with the processing of the coding units of 8 * 8 sizes of decoding by (reverse order) sawtooth scan.In addition, initial decoded nonzero coefficient C in regional R1 0be preferably the last nonzero coefficient in the regional R1 of 8 * 8 sizes.
(embodiment)
The embodiment more specifically of the TU information decoding 12A of section below is described.Relative position pattern used in the decoding of relative position (dx, dy) can form as follows like that with form TBL32.
That is, nonzero coefficient and the relative distance between previous nonzero coefficient that also can process object are less, at more corresponding short code (bit string) in form TBL32 for the relative position pattern.Process the nonzero coefficient of object and the relative distance between previous nonzero coefficient, for example can derive according to the relative position (dx, dy) of the nonzero coefficient of processing object.
Perhaps, also can, in the relative position pattern with in form TBL32, by shorter code, with the establishment of (dx, the dy) that mean the relative distance that occurrence frequency is higher, stand corresponding.
According to above-mentioned formation, because can distribute adaptively shorter code according to relative distance, occurrence frequency, so the minimizing of the code amount that can seek to decode.
The length-1 on the limit that in addition, the maximum of the absolute value of dx and dy is the coded object piece.That is to say, in the situation that the maximum of the absolute value that the coded object piece is 16 * 16, dx and dy is 15.
(variation)
[change in the zone of regulation]
Tupe control part 330 also can be judged the size of object piece, and according to this judgement, to control be by decode processing or by relative position mode decoding section 320 processing of decoding of distance of swimming amplitude mode lsb decoder 310.
For example, if the object piece is of a size of below 8 * 8, tupe control part 330 allows distance of swimming amplitude mode lsb decoder 310 carry out the decoding processing.In addition, if the object piece is of a size of more than 16 * 16, tupe control part 330 allows relative position mode decoding section 320 carry out the decoding processing.
In addition, if the number of the nonzero coefficient that will decode in the object piece for example, for the number of regulation, below 64, tupe control part 330 allows distance of swimming amplitude mode lsb decoder 310 carry out decoding to process.
In addition, in the coefficient coding data, also can show by the appointment of relative position all nonzero coefficients.In this case, all nonzero coefficients of relative position mode decoding section 330 decoding.
In addition, tupe control part 330 also can change the zone of regulation according to the size of slice type, predictive mode, object piece.
For example, in the situation that the infra-frame prediction of having encoded in the object piece, tupe control part 330 is set as the zone of 8 * 8 sizes of low frequency component side in the zone of regulation.In addition, for example, in the situation that the inter prediction of having encoded in the object piece, tupe control part 330 is set as the zone of 4 * 4 sizes of low frequency component side in the zone of regulation.
[change of form for the relative position pattern]
Also can be according to the absolute position (x of nonzero coefficient n, y n) prepare a plurality of relative position pattern form TBL32.This relative position pattern with form TBL32 preferably in the absolute position of each nonzero coefficient in advance the scope based on dx, the retrievable value of dy carry out optimization.
And relative position mode decoding section 420 also can be according to the absolute position (x of nonzero coefficient n, y n) change will reference relative position pattern form TBL32.
According to above-mentioned formation because can switch adaptively will reference relative position pattern form TBL42, so can reduce the code amount that will decode.
In addition, about the detailed content of this variation, will after the explanation of dynamic image encoding device 2 in carry out.
[performance of the relative position of nonzero coefficient]
Above, although the relative position of nonzero coefficient shows with the form of (dx, dy), be not limited thereto.For example, relative position also can show according to direction and distance.
[about the VLC form]
Also can form as shown in Figure 20 relative position pattern form TBL32.In the relative position pattern shown in Figure 20, with in form TBL32, the value of dx or dy is less, more corresponding less Code Number.
For example, at dx or dy, be 0 o'clock, Code Number is " 0 ".That is to say, if (dx, dy)=(0,1), allocation of codes numbering " 0,1 ".
With further reference to Figure 20, " 1 " and dx, dy " 1 " sets up corresponding, " 2 " and dx, dy " 1 " sets up corresponding.Like this, in the relative position pattern, with in form TBL32, if absolute value is identical value, code becomes negative situation and is assigned with less Code Number.
In addition, in the relative position pattern shown in Figure 20, use in form TBL32, preferably the more little more corresponding short code of Code Number.
(variation)
[scope based on dx, the retrievable value of dy and make the optimized gimmick of VLC form]
Also can be according to the absolute position (x of coefficient n, y n) prepare a plurality of relative position pattern form TBL32.This relative position pattern with form TBL32 preferably in the absolute position of each coefficient in advance the scope based on dx, the retrievable value of dy carry out optimization.
Use Figure 21 and Figure 22, in following its concrete configuration example that illustrates.Figure 21 は illustrates the example that the regional R2 in the object piece BLK shown in Figure 17, Figure 19 (describing in the back) further is subdivided into to regional R2a, regional R2b and these 3 zones of regional R2c.
In addition, Figure 22 illustrates an example of setting up corresponding VLC form with regional R2a, regional R2b and regional R2c.
The value that this figure (a) is illustrated in x or y does not become the relative position pattern form Td1 of 323 references of relative position determination section of relative position mode decoding section 320 in the situation of the above positive value of regulation.
In addition, the value that this figure (b) is illustrated in x or y does not become the relative position pattern form Td2 of 323 references of relative position determination section of relative position mode decoding section 320 in the situation of the following negative value of regulation.
Object piece BLK shown in Figure 21 consists of regional R1, regional R2a~R2c.In Figure 21, the position of the DC coefficient of object piece BLK utilizes the coordinate demonstration of (0,0) to be meaned.In addition, in object piece BLK, sawtooth scan sequentially go up the position of last coefficient, the position use (15,15) of the component of high frequency is meaned.
In Figure 21, illustratively using regional R1 as the foursquare medial region of the Ru that is left upper apex with (0,0).In addition, regional R2a is as the foursquare medial region that is left upper apex with (8,0).In addition, regional R2b is as the foursquare medial region that is left upper apex with (0,8), the foursquare medial region that regional R2c is left upper apex as (8,8).
Relative position pattern form Td1 shown in Figure 22 (a), value " 0 " the allocation of codes numbering " 0 " of giving coefficient.Below, give coefficient value " 1 ", " 1 " ..., " 7 ", " 7 " respectively allocation of codes numbering " 1 ", " 2 " ..., " 13 ", " 14 ".
That is to say, the situation that the absolute value of coefficient is less is assigned with less Code Number.In addition, in the situation that the absolute value of coefficient is identical, positive and negative label is different, and the situation of negative value is assigned with the Code Number less than positive value.For example, give coefficient value " 1 ", " 1 " ..., " 7 ", " 7 " respectively allocation of codes numbering " 1 ", " 2 " ..., " 13 ", " 14 ".
In addition, in the scope that the absolute value of coefficient is " 8 "~" 15 ", only define negative value " 8 "~" 15 ".
On the other hand, the relative position pattern form Td2 shown in Figure 22 (b), in the scope that the absolute value of coefficient is " 0 "~" 7 ", become the definition identical with form Td1 with the relative position pattern.
In addition, in the scope that the absolute value of coefficient is " 8 "~" 15 ", only define positive value " 8 "~" 15 ".
Relative position determination section 323 switches VLC form that will reference as described below according to the retrievable scope of the position of (dx, dy).
About regional R2a, because the retrievable scope of dx is-15≤dx≤7, so relative position determination section 323 is with reference to relative position pattern form Td1.In addition, because the retrievable scope of dy is-7≤dy≤15, so relative position determination section 323 is with reference to relative position pattern form Td2.
In addition, about regional R2b, because the retrievable scope of dx is-7≤dx≤15, so relative position determination section 323 is with reference to relative position pattern form Td2.In addition, because the retrievable scope of dy is-15≤dy≤7, so relative position determination section 323 is with reference to relative position pattern form Td1.
In addition, about regional R2c, because the retrievable scope of dx is-15≤dx≤7, so relative position determination section 323 is with reference to relative position pattern form Td1.In addition, because the retrievable scope of dy is-15≤dy≤7, so relative position determination section 323 is with reference to relative position pattern form Td1.
As more than, based on dx, the retrievable scope of the value of dy and make VLC form optimization, can improve code efficiency thus.
(dynamic image encoding device)
At first, with reference to Figure 18, the formation of dynamic image encoding device 2 is described, as described below.In related dynamic image encoding device 2, TU information coding section 280 is altered to the TU information coding 280A of section shown in Figure 18 in present embodiment among the variable-length encoding part 11 of the dynamic image encoding device 2 shown in Figure 11.
Below, the TU information coding 280A of section shown in Figure 18 is described, as described below.That is, as shown in figure 18, the TU information coding 280A of section possesses VLC form TBL40, distance of swimming amplitude mode coding section 410, relative position pattern-coding section 420 and tupe control part 430.
VLC form TBL40 is that the bit string that has defined each parameter and coded data is the form of the corresponding relation between code.VLC form TBL40 possesses: by the distance of swimming amplitude mode of 410 references of distance of swimming amplitude mode coding section described later with form TBL41 with by the relative position pattern form TBL42 of 420 references of relative position pattern-coding section.
Distance of swimming amplitude mode can adopt the form same with the VLC form TBL21 of the TU information coding section 280 shown in Figure 11 with form TBL41.Thus, at this, the description thereof will be omitted.In addition, the definition about the relative position pattern with form TBL42, will narrate in the back.
Distance of swimming amplitude mode coding section 410 carries out the processing of distance of swimming pattern-coding and amplitude mode coding by the control of tupe control part 330 and processes (below be called distance of swimming amplitude mode coding process).In addition, about distance of swimming pattern-coding, process and the processing of amplitude mode coding, because carried out explanation in described execution mode 1, so the description thereof will be omitted at this.
Distance of swimming amplitude mode coding section 410 possesses last nonzero coefficient coding section 411, distance of swimming pattern-coding section 412 and amplitude mode coding section 413.
Last nonzero coefficient coding section 411, distance of swimming pattern-coding section 412 and amplitude mode coding section 413 have respectively the function same with using last nonzero coefficient coding section 201, distance of swimming pattern-coding section 202 and amplitude mode coding section 203 illustrated in fig. 11.Thus, about its function, carried out explanation, so the description thereof will be omitted at this.
In addition, distance of swimming pattern-coding section 202 and amplitude mode coding section 203 are configured to reference to distance of swimming amplitude mode form TBL41.
The relative position that relative position pattern-coding section 420 coding becomes in the object piece of nonzero coefficient of coded object generates the coefficient coding data.Particularly, relative position pattern-coding section 420 possesses last nonzero coefficient coding section 421, relative position calculating part (relative position coding unit) 422 and relative position coding section (relative position coding unit) 423.
Last nonzero coefficient in last nonzero coefficient coding section 421 coded object pieces.Last nonzero coefficient coding section 421 in the object piece according to the sawtooth scan of reverse order encode last_pos, level and the sign of last nonzero coefficient.In addition, last nonzero coefficient coding section 421 is transformed into last_pos to take the coordinate that the DC coefficient is initial point (0,0) and show (1astx, lasty).Below will utilize and take that coordinate that this DC coefficient is initial point (0,0) shows and the position that means is called absolute position.
Relative position calculating part 422, according to the absolute position of this coefficient and the absolute position of previous nonzero coefficient of encoding out, calculates the relative position of the coefficient that becomes coded object.
Relative position calculating part 422 can calculate based on aforesaid relational expression (1-1)~(1-3) relative position (dx, dy) of the nonzero coefficient that becomes coded object.
Relative position coding section 423 is with reference to relative position pattern form TBL42, relative position (the dx of the nonzero coefficient that the order encoding relative position calculating part 422 by according to the rules calculates, dy) and the level of nonzero coefficient and sign, generate thus the coefficient coding data.
Which type of order to specify by relative position the nonzero coefficient of encoding about relative position coding section 423 with in the object piece, will be described in detail in the embodiment of back.
Tupe control part 430 judges whether the nonzero coefficient that becomes coded object is positioned at the zone of regulation, and according to this result of determination, controlling is by encode processing or by relative position pattern-coding section 420 processing of encoding of distance of swimming amplitude mode coding section 410.
About the gimmick of the control of tupe control part 430, because the tupe control part 330 based on to the TU information decoding 12A of section carried out the gimmick of explanation, so the description thereof will be omitted at this.
(handling process)
As mentioned above, because the coding handling process of dynamic image encoding device 2 is roughly the same with the decoding handling process of using the moving image decoding apparatus 1 shown in Figure 16, so description is omitted at this.
(embodiment)
[about coded sequence]
Below, with Figure 19 illustrate about relative position calculating part 422 in the object piece with which type of order by the relative position embodiment of nonzero coefficient that specifies to encode.Figure 19 is the figure that the example about the coding of the nonzero coefficient based on the relative position appointment is meaned.
Object piece BLK shown in Figure 19 is assumed to be 16 * 16 sizes illustratively, and regional R1 is assumed to be the zone of 8 * 8 sizes of the low frequency component side in this object piece.In addition, regional R2 is the zone beyond the regional R1 in the object piece.
In addition, tupe control part 430, allow distance of swimming amplitude mode coding section 410 carry out coding and process about regional R1, about regional R2, allow relative position pattern-coding section 420 carry out coding and process.
In addition, hypothesis illustratively below: detect in advance N nonzero coefficient C (1) in regional R2~C (N), and detect the last nonzero coefficient C in regional R1 0.
Relative position coding section 423 carrys out relevance ground coding nonzero coefficient (or, also can show as and be unified into a relative position of ploughing the coding nonzero coefficient) through following operation.
Operation [1]
At first, relative position coding section 423 is by the last nonzero coefficient C in regional R1 0as basic point, by the selection reference of regulation, among the nonzero coefficient C (1) from regional R2~C (N), select and nonzero coefficient C 0associated next nonzero coefficient C 1.
As the selection reference of regulation, be for example by trying respectively the C calculated by relative position calculating part 422 0and the coding of the relative position (dx, dy) between nonzero coefficient C (1)~C (N) and the code amount that obtains.In this case, relative position coding section 423 measures little nonzero coefficient by code and is chosen as next nonzero coefficient C 1.
Operation [2]
Relative position coding section 423, using selected nonzero coefficient as basic point, carries out the selection operation based on above-mentioned selection reference in order repeatedly, until the unselected nonzero coefficient in regional R2 becomes nothing.
More usually, can be said to is C to above-mentioned selection reference nwith C n+1between the code amount of relative position (dx, dy).
In addition, the only simple illustration of above-mentioned selection reference, be not limited to this.For example, can be also C nwith C n+1between manhatton distance (| dx|+|dy|), C nwith C n+1between Euclidean distance (dx 2+ dy 2) etc.
By repeatedly carrying out the selection operation of above-mentioned operation [2], nonzero coefficient C thus 0~C nconnected.Result can obtain the illustrated such nonzero coefficient C of Figure 19 0~C ncontact.
Operation [3]
Next, position, absolute value (1evel), the sign symbol (sign) of the 423 coding nonzero coefficient CN of relative position coding section.Above-mentioned position for example can utilize the lower right relative position (dx apart with the coded object piece n, dy n) specify, also can utilize the LastPos based on scanning sequency to specify.In addition, nonzero coefficient C ncan not also to have scanned the last nonzero coefficient in the situation of coded object piece by some scanning sequency.
Operation [4]
Relative position coding section 423 according to above-mentioned operation [2] in the reverse order of selection operation, from nonzero coefficient C n-1to C 0ground each relative position (dx, dy) of coding, absolute value (1evel), sign symbol (sign).
When the coding to nonzero coefficient C0 completes, tupe control part 430 allows distance of swimming amplitude mode coding section 410 carry out coding and processes.Distance of swimming amplitude mode coding section 410 processes by the distance of swimming amplitude mode coding nonzero coefficient C that encodes 0the coefficient of regional R1 in addition.
[about the VLC form]
The relative position pattern also can form as shown in Figure 20 with form TBL42.About the relative position pattern form shown in Figure 20, owing to carrying out explanation, therefore in this description will be omitted.
(variation)
[scope based on dx, the retrievable value of dy and make the optimized gimmick of VLC form]
Also can be according to the absolute position (x of coefficient n, y n) prepare a plurality of relative position pattern form TBL42.This relative position pattern with form TBL42 preferably in the absolute position of each coefficient in advance the scope based on dx, the retrievable value of dy carry out optimization.
Because the concrete processing of this variation is identical with the processing of using Figure 21 and Figure 22 to illustrate in the explanation of moving image decoding apparatus, so the description thereof will be omitted at this.Wherein, suppose also " relative position determination section 323 " to be called again to " relative position coding section 423 ".
[other]
In addition, about present embodiment, [variation] of related moving image decoding apparatus 1, also can be applicable to dynamic image encoding device 2.
(action effect)
As discussed above, moving image decoding apparatus 1 is configured to, this conversion coefficient of decoding among the TU information TUI of the coded data be encoded from conversion coefficient, this conversion coefficient is to obtain by according to each change of scale, the pixel value of object images being carried out to frequency translation, moving image decoding apparatus 1 possesses: relative position lsb decoder 320, and its decoding becomes the relative position of the conversion coefficient of decoder object at a distance of the previous conversion coefficient decoded; With coefficient positions determination section 323, it is position and the above-mentioned relative position in above-mentioned change of scale according to the previous above-mentioned conversion coefficient decoded, and determines the position of the above-mentioned conversion coefficient that becomes above-mentioned decoder object.
In addition, dynamic image encoding device 2 is configured to, conversion coefficient is encoded, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, dynamic image encoding device 2 possesses: relative position coding section 423, its coding becomes the position of above-mentioned conversion coefficient of coded object with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out.
In the zone of high fdrequency component side, there is the trend that coefficient becomes sparse.Thereby, in the situation that the distance of swimming of having encoded according to scanning sequency, there is the trend that the length of run becomes very long.Thereby, there is the trend that coding and decoding must adopt larger form or code amount to increase.In addition, the size of object piece is larger, and these trend are particularly outstanding.
In addition, mean run, the size of the VLC form of the combination of level}, basically with scanning sequency on run length maximum, be that the area of object piece is proportional.
According to the above-mentioned formation of moving image decoding apparatus 1, the decoding of carrying out based on relative position because do not carry out the decoding of the length based on run to process is processed, so also can not adopt expression { run, the VLC form of the combination of level}.
Its result, can reduce the size of VLC form.In addition, about dynamic image encoding device 2, also can obtain same action effect.The form of the scanning sequency of the coded object piece integral body that in addition, expression is larger does not need yet.
(3) execution mode 3
Based on Figure 23~Figure 25, another embodiment of the invention is described, as described below.In addition, for convenience of explanation, give identical label about the parts with function identical with the accompanying drawing illustrated in described execution mode 1, and the description thereof will be omitted.
Hand-off process A below is described: " the only processing of n coefficient of the low frequency component side in coding/decoding object piece (for example, in the situation that the size more than 16 * 16 refers to n=64) " and treatments B: the method that the processing of encoding in " carrying out the processing (with reference to Fig. 4, S10) that Region Segmentation is carried out the coding/decoding coefficient "/decoding is processed.
In addition, suppose the coding method identifier of having encoded for hand-off process A and treatments B in coded data.That is,, in following example, the coding method identifier has been specified any one that process in the middle of A and treatments B.
Carried out explanation although carry out the situation of above-mentioned treatments B about TU information decoding section 12, in following example, hypothesis TU information decoding section 12 also carries out above-mentioned processing A except above-mentioned treatments B.
In addition, in the following description, the size of object piece is assumed to be 16 * 16 as an example.
(handling process)
Carry out the handling process of coding/decoding when with Figure 23, hand-off process A and treatments B being described.Figure 23 carries out the flow chart that an example of the handling process of coding/decoding is meaned about hand-off process A and treatments B the time.
In Figure 23, the coding processing of dynamic image encoding device 2 and the decoding of moving image decoding apparatus 1 are shown in the lump and process.In addition, although below about the action of moving image decoding apparatus 1 side, be illustrated, the action of dynamic image encoding device 2 sides is also roughly the same.
In addition, in following example, the coding method identifiers are judged by the TU information decoding section 12 of hypothesis judgement moving image decoding apparatus 1.
When processing beginning, as shown in figure 23, at first coding method identifiers (S101) are judged by TU information decoding section 12.
In the situation that the coding method identifier means to process A (processing A in S101), TU information decoding section 12 carries out and processes A (S102).That is, TU information decoding section 12 64 of the upper lefts coefficient of the low frequency component side that is positioned at the object piece of only decoding.
At this, explain the coefficient that is made as decoder object in processing A with Figure 24, as described below.Also can carry out the piece of infra-frame prediction or carry out the piece of inter prediction according to the object piece, change the coefficient that is made as decoder object in processing A.
Concrete example is shown, as described below.At first, about carrying out the piece of inter prediction, as shown in Figure 24 (a), the coefficient of regional RInter that TU information decoding section 12 will be positioned at upper left 8 * 8 sizes of low frequency component side is made as decoder object.
In addition, about carrying out the piece of infra-frame prediction, the coefficient that TU information decoding section 12 will be positioned at the regional RIntra shown in Figure 24 (b) is made as decoder object.That is, TU information decoding section 12 will be made as decoder object from 64 coefficients of the 1st DC coefficient to the on the sawtooth scan order.The number of the coefficient comprised in numeral " 64 " the expression zone shown in the RIntra of zone.
In addition, in Figure 24 (b), for convenience of explanation, although the shape of regional RIntra with right-angled triangle, meaned, be noted that stricti jurise is said actually, regional RIntra does not become right-angled triangle.
On the other hand, in the situation that the coding method identifier means treatments B (treatments B in S101), TU information decoding section 12 carries out treatments B (with reference to S10, Fig. 4).
(data structure)
Carry out the data structure of illustration coefficient coding data with Figure 25.Figure 25 is the figure that the data structure about the coefficient coding data is meaned.
As mentioned above, coding method identifier FLG has specified the mark of processing A or treatments B.
Process A in the situation that coding method identifier FLG has specified, the coefficient coding data can adopt the data structure shown in DATA1 (below be labeled as coefficient coding data DATA1).64 coefficient datas in upper left (run, level, sign) that comprise the low frequency component side that is positioned at the object piece in encoded data D ATA1.
On the other hand, in the situation that coding method identifier FLG has specified treatments B, the coefficient coding data can adopt the data structure shown in DATA2 (below be labeled as coefficient coding data DATA2).
Coefficient coding data DATA2 comprises nonzero coefficient mark * n (n=number of regions), coefficient data [zone 1]~[regional n].
As mentioned above, the nonzero coefficient mark refers to having or not of the nonzero coefficient of decoding in zone.Due to the number in coding and decoding zone only, so the nonzero coefficient mark means with " * n ".
For example, if nonzero coefficient is labeled as " 1 (very) ", is illustrated in this decoding zone nonzero coefficient is arranged, if nonzero coefficient is labeled as " 0 (puppet) ", be illustrated in this decoding zone without nonzero coefficient.
Coefficient data [regional x] (x=1~n) comprises the coefficient data in the zone of respectively decoding.In addition, in the situation that the nonzero coefficient of nonzero coefficient mark in meaning regional x without, omit coefficient data [regional x].
(action effect)
In the situation that the size of object piece is large, because the number change of the coefficient that will encode is many, the difference of therefore processing the efficiency between A and treatments B is showed greatlyr.In other words, processing between A and treatments B the most proud image characteristics difference.Thus, if only utilize a kind of processing mode to be encoded, likely make code efficiency reduce greatlyr.
Carry out coding/decoding when switching above-mentioned processing A and treatments B, can suppress or improve the reduction because of the caused code efficiency of variation of processing mode thus.According to efficient activity, also may offset the code amount for the coded system identifier of hand-off process A and treatments B.
(variation)
The unit of in addition, the coded system identifier being encoded is arbitrarily.For example, can be encoded with LCU unit.
In addition, also can, according to the situation of coding, by condition criterion, come hand-off process A and treatments B.In this case, even if do not express and process A and treatments B by the coding method identifier, also can determine the processing mode that acquiescence is carried out, therefore also can omit the coding method identifier.
As the benchmark of condition criterion, can be made as attribute, the state of piece, the parameter of regulation.
More specifically, as described below.Also can the size based on object piece (change of scale) be judged.For example, also can be configured to, if 16 * 16 are carried out and process A, if 32 * 32 are carried out treatments B.
In addition, also can utilize the predictive mode of object piece to be judged.For example, also can be configured to, in the situation that be that intra prediction mode is carried out processing A, if inter-frame forecast mode is carried out treatments B.
The change of scale size of adjacent block that in addition, also can be based on the object piece is judged.For example, also can be configured to, for example, if the change of scale of the left adjacent block of object piece is more than given size (32 * 32 sizes), carries out in this object piece and process A, otherwise carry out treatments B.
In addition, except " more than given size ", also can apply " more than the size of object piece ", " being less than given size ", " equal'sing given size " etc. decision condition.
In the situation that the edge comprised in the little object piece of the size of change of scale change is many, the number that has thus the nonzero coefficient comprised in the object piece becomes many trend.Thus, more preferably be configured in the situation that the little treatments B of carrying out of the size of change of scale.
In addition, also can the judgement for the treatment of mode by the parameter of the regulation beyond the coding method identifier.Also can carry out the hand-off process mode based on additional profile identifiers such as headers.The rank of the ability of regulation decoder that in addition, also can be additional based on header etc., the complexity of bit stream is carried out the hand-off process mode.
In addition, although treatments B is made as the processing that Region Segmentation is carried out the coding/decoding coefficient of carrying out shown in Fig. 4, be not limited to this.Treatments B can be also to specify the treatment S 20 of coding/decoding coefficient by relative position shown in Figure 16.
In addition, for example dynamic image encoding device 2 also can try the coding for the treatment of S 10 and treatment S 20, or the estimated code amount, and by code efficiency, the coding method identifier is appointed as in good processing.
(4) execution mode 4
Based on Figure 26~Figure 36, an execution mode more of the present invention is described, as described below.In addition, for convenience of explanation, give identical label about the parts with function identical with the accompanying drawing illustrated in described execution mode 1, and the description thereof will be omitted.
(Region Segmentation of level)
Below explanation carries out by object piece level ground the method that Region Segmentation is carried out coding/decoding.
Dynamic image encoding device 2 also can be cut apart object piece BLK level ground in the coding of object piece BLK.In this case, in the coding of object piece BLK, the TU information coding section 280 of dynamic image encoding device 2 carries out following processing ENC and DIV repeatedly.
Process ENC: do not encoded in the zone of dividing processing object;
Process DIV: the zone of dividing processing object will be made as next zone of processing object by cutting apart each zone obtained.
In addition, the object piece becomes the zone that is made as at first the processing object.In addition, the good processing of code efficiency on the whole in the middle of the 280 selection processing P of TU information coding section and Q.
For example, TU information coding section 280 is about processing object, can only carry out the situation of processing ENC, with the tentative DIV of processing after carry out again between the situation of processing ENC, the comparison code amount, decision yard quantitative change is such treatment step less.
With Figure 26, the example that it is concrete is described.Figure 26 illustration is carried out the situation about cutting apart of 2 levels.
As shown in figure 26, object piece BLK comprises the 1st level coding region R10, R20, R30, the R40 in the situation about cutting apart of carrying out 1 level.
In addition, about the 1st level coding region R30 and R40, be divided into 2 levels.That is, the 1st level coding region R30 comprises the 2nd level coding region R31~R34, and the 1st level coding region R40 comprises the 2nd level coding region R41~R44 in addition.
(decision of cutting apart)
The object piece BLK shown in Figure 26 is cut apart by the operation of next in TU information coding section 280.
Operation [1]
TU information coding section 280, using object piece BLK as the zone of processing object, carries out and processes ENC and process DIV.That is, regional code section 282 carries out and processes ENC, and 282 storages of regional code section are because being directed to the caused code amount of the performed processing ENC of object piece BLK.In addition, Region Segmentation section 281 is directed to object piece BLK and carries out processing DIV, obtains the 1st level coding region R10~R40.
Operation [2]
Regional code section 282 carries out processing ENC about each of the 1st level coding region R10~R40, and storage is directed to the code amount of the performed processing ENC of the 1st level coding region R10~R40 integral body.
Operation [3]
TU information coding section 280 compares the code amount of operation [1] and the code amount of operation [2].At this, suppose that the code amount of operation [2] is little.
Operation [4]
Region Segmentation section 281 also carries out and processes DIV the 1st level coding region R10~R40, obtains the coding region of the 2nd level.
Operation [5]
TU information coding section 280 is relatively because of until the caused code amount of the coding of the 1st level and because of until the caused code amount of the coding of the 2nd level, adopts and yard measures a little treatment step.
Operation [6]
According to the comparative result of operation [5], suppose: the code amount be encoded to about the 1st level coding region R10, R20 in the situation of the 2nd level is many, and the code amount be encoded to about the 1st level coding region R30, R40 in addition in the situation of the 2nd level is few.
Thus, the 1st level coding region R10, R20, the 2nd level coding region R31~R34, R41~R44 have finally been determined.
(dividing mark and nonzero coefficient mark)
The dividing mark of the situation of cutting apart that 282 coded representations of regional code section are performed by Region Segmentation section 281.
Carry out particularly illustration, as described below.Regional code section 282 is about the cut apart fixed zone performed by Region Segmentation section 281, code division mark " 1 ".In addition, regional code section 282 is about the fixed zone of the situation of not cut apart by Region Segmentation section 281, code division mark " 0 ".
In addition, in Figure 26, with hatched region representation, be the zone that has a nonzero coefficient at least, with hatched region representation, be the zone of nonzero coefficient no one.
Therefore, regional code section 282 also codified mean to process the nonzero coefficient mark had or not of the nonzero coefficient in the zone of object.For example, in the situation that have the nonzero coefficient nonzero coefficient to be labeled as " 1 ", in the situation that be labeled as " 0 " without the nonzero coefficient nonzero coefficient.
(labelled tree)
With Figure 27, the example with form coding dividing mark and the nonzero coefficient mark of labelled tree FT is described.Figure 27 illustrates the performance example (quaternary tree performance) of the labelled tree FT of cutting apart situation and coefficient distribution situation that means the object piece BLK shown in Figure 26.
Hierarchical structure by ROOT amplitude, LEVEL1 and the LEVEL2 of labelled tree FT forms.The ROOT amplitude of labelled tree FT and LEVEL1 are corresponding to dividing mark.The LEVEL2 of labelled tree FT is corresponding to the zero coefficient mark.
In the ROOT of labelled tree FT amplitude, the dividing mark FRoot of object piece BLK is encoded.In addition, in the LEVEL1 of labelled tree FT, dividing mark F10, F20, F30, the F40 of the 1st level coding region R10, R20, R30 and R40 are encoded.
And, in the LEVEL2 of the leaf that becomes labelled tree FT (endpoint node), the nonzero coefficient mark is encoded.
In Figure 27, the leaf surrounded by circle illustrates the nonzero coefficient mark of the 2nd level coding region R41, R42, R43 and R44 according to order from left to right, respectively coding " 1 ", " 0 ", " 0 " and " 1 ".
In addition, about the 2nd level coding region R41, R42, R43 and R44, although encoded, amount to the nonzero coefficient mark of 4, also the model of these 4 marks can be made as to 1 group, carry out and the corresponding Variable Length Code of the occurrence frequency of this model.According to this Variable Length Code, can cut down fifty-fifty the code amount of quaternary tree.
The many zone about nonzero coefficient, because the length of run on average shortens, therefore undivided situation has the trend that code efficiency is improved.
With respect to this, in the situation that zero coefficient is many, be that the distribution density of coefficient is low, because the length of run is on average elongated, therefore, by adopting the coding based on Region Segmentation, can improve code efficiency thus.
(decoding in zone is processed)
Secondly, with Figure 28, the processing of decoding by the coefficient coding data after above-mentioned gimmick coding is described in moving image decoding apparatus 1.Figure 28 is the flow chart that the example about the flow process of the decoding treatment S 200 in the zone in moving image decoding apparatus 1 is meaned.
In addition, because regional decoding treatment S 200 is that recursion type is processed, therefore, in the calling of upper, the object piece is carried out to each step comprised in the decoding treatment S 200 of execution area as the zone of processing object.
As shown in figure 28, at first, in moving image decoding apparatus 1, Region Segmentation section 121 comes the zone whether divided (S201) of determination processing object by the reference dividing mark.
In the not divided situation in zone of processing object ("No" in S201), carry out coefficient decoding treatment S 210.
That is, at first regional decoding section 122 judges whether have nonzero coefficient (S211) in zone by reference nonzero coefficient mark.
In the situation that have nonzero coefficient ("Yes" in S211) in zone, regional decoding section 122 processes decoding in the lump by the decoding of distance of swimming amplitude mode and processes the coefficient whole (S212) comprised in the zone of object.
With respect to this, in the situation that do not have nonzero coefficient ("No" in S211) in zone, regional decoding section 122 skips the decoding in the zone of processing object and processes.
210 end of coefficient decoding treatment S, the decoding processing of processing the zone of object finishes, and carries out the decoding treatment S 200 in the zone relative with the zone of next processing object.
On the other hand, in the divided situation in zone of processing object ("Yes" in S201), carry out cut zone decoding treatment S 220.
That is, the dividing mark (S221) that at first 122 decodings of regional decoding section are encoded about each zone, enter into and the cut apart corresponding circulation LP200 in each zone.
In circulation LP200, the decoding treatment S 200 in zone is carried out (S223) by recursion ground, then turns back to the front (S224 to S223) of circulation LP200, carries out in order the processing in circulation about each cut apart zone later.
During end process, cross circulation LP200 when the Zone Full about cut apart, cut zone decoding treatment S 220 finishes.Then, the decoding treatment S 200 in executory zone finishes.In the situation that recursion call regional decoding treatment S 200, control the source of calling that turns back to.
In addition, the above coding owing to having avoided unnecessary mark, therefore, in the piece of the size of further not cutting apart, dividing mark also can be judged to be puppet all the time.In addition, in the situation that process the zone of object, be object piece itself, the nonzero coefficient mark also can be judged to be very all the time.
(data structure)
Below, be illustrated in the data structure of coefficient coding data decoded in regional decoding treatment S 200 with Figure 29~Figure 32.
(1) preserve dispersedly the example of mark
Data structure in the situation of preserving dispersedly mark is described with Figure 29.As shown in figure 29, preserve dividing mark FRoot in the front of coefficient coding data.If dividing mark FRoot is " 0 ", the coefficient coding data adopt the data structure shown in DATA11 (below be labeled as coefficient coding data DATA11) as an example.The coefficient data (run, level, sign) that comprises the amount of 16 * 16 of object piece in encoded data D ATA11.
On the other hand, if dividing mark Froot is " 1 ", the coefficient coding data adopt the data structure shown in DATA12 (below be labeled as coefficient coding data DATA12) as an example.
Coefficient coding data DATA12 comprises area information [zone 1] F1~area information [regional n] Fn.In addition, above-mentioned " regional n " refers to " zone " of the 1st level.If the object piece BLK shown in use Figure 26 carries out illustration, regional R10~R40 is equivalent to above-mentioned " zone ".
At this, the detailed data structure of area pellucida domain information according to Figure 29 [zone 1] F1.Area information [zone 1] F1 comprises dividing mark [zone 1] F10.
At this, nonzero coefficient mark and coefficient data, in the situation that dividing mark [zone 1] F10 comprises coefficient information F12 for " 0 ", comprise coefficient information F11 in the situation of this external dividing mark [zone 1] F10 for " 1 ".
In the situation that dividing mark [zone 1] F10 is " 0 ", because further do not carry out Region Segmentation, so just comprise nonzero coefficient mark [1] and coefficient data [1] in coefficient information.
In the situation that dividing mark [zone 1] F10 is " 1 ", suppose that zone 1 is divided into n.In this case, comprise nonzero coefficient mark [1-1], coefficient data [1-1]~nonzero coefficient mark [1-n], coefficient data [1-n] in coefficient information F11.
About other area information, because F1 is identical with area information [zone 1], so the description thereof will be omitted.
Secondly, the concrete example of coefficient coding data DATA12 shown in Figure 29 is described with Figure 30.Figure 30 is the figure that the example of coefficient coding data that mean to use the illustrated object piece BLK of Figure 26 is shown.
Because the object piece BLK shown in Figure 26 specifies 4 fork trees to cut apart with the ROOT amplitude, so dividing mark Froot is " 1 ".
In addition, coefficient coding data DATA12 comprises area information F1~F4 corresponding to decoding zone R10~R40 comprised with object piece BLK.In coefficient coding data DATA12, according to the order of area information F1, F2, F3 and F4, carry out save data.Below, the data about comprising in area information F1~F4, describe in order.
The data that comprise in declare area information F1, as described below.Because decode, regional R10 is not further cut apart, thus in area information F1 dividing mark [1]=0.In addition because the regional R10 that decodes comprises nonzero coefficient, so in area information F1 nonzero coefficient mark [1]=1.In addition, area information F1 comprises coefficient data [1].
Because decode, not divided and one of the nonzero coefficient of regional R20 does not comprise yet, so comprise dividing mark [2]=0, nonzero coefficient mark [2]=0 in area information F2.
The data that comprise in declare area information F3, as described below.The regional R30 that decodes is divided into the regional R31~R34 of decoding.Thus, area information F3 comprises dividing mark [3]=1.
In addition, in decoding regional R31, R33, comprise nonzero coefficient, in decoding regional R32, R34, one of nonzero coefficient does not comprise yet.
Thus, about the regional R31 that decodes, comprise nonzero coefficient mark [3-1]=1 and coefficient data [3-1].About decoding regional R33 too.
In addition, about regional R32 and the R34 of decoding, comprise respectively nonzero coefficient mark [3-2]=0 and nonzero coefficient mark [3-4]=0.
The data that comprise in declare area information F4, as described below.The regional R40 that decodes is divided into the regional R41~R44 of decoding.Thus, area information F4 comprises dividing mark [4]=1.
In addition, in decoding regional R41, R44, comprise nonzero coefficient, in decoding regional R42, R43, one of nonzero coefficient does not comprise yet.
Thus, about the regional R41 that decodes, comprise nonzero coefficient mark [4-1]=1 and coefficient data [4-1].About decoding regional R44 too.
In addition, about regional R42 and the R43 of decoding, comprise respectively nonzero coefficient mark [4-2]=0 and nonzero coefficient mark [4-3]=0.
(2) labelled tree is stored in the lump to the example of data front
Illustrate labelled tree is stored in to the data structure in the situation of data front in the lump with Figure 31.As shown in figure 31, preserve labelled tree FT in the front of coefficient coding data.
At this, in the situation that labelled tree FT means without the cutting apart of object piece, employing be the data structure shown in DATA21 (coefficient coding data DATA21).The coefficient data (run, level, sign) of 16 * 16 that comprises the object piece in encoded data D ATA21.
On the other hand, in the situation that labelled tree FT means the object piece is cut apart once is above, employing be the data structure shown in DATA22 (coefficient coding data DATA22).
Coefficient coding data DATA22 comprises coefficient data [zone 1]~coefficient data [regional n].In addition, above-mentioned " regional n " refers to " zone " further do not cut apart.Use the object piece BLK shown in Figure 26 to carry out illustration, regional R10, regional R31 etc. are equivalent to above-mentioned " zone ".
Secondly, the concrete example of coefficient coding data DATA22 shown in Figure 31 is described with Figure 32.Figure 32 is the figure that the example of coefficient coding data that mean to use the illustrated object piece BLK of Figure 26 is shown.
At first preserve labelled tree FT in coefficient coding data DATA22.At first preserve dividing mark FRoot in labelled tree FT.Next, preserve in order the mark relevant to the regional R10~R40 of decoding.
In coefficient coding data DATA22, preserve the regional coefficient data of respectively decoding after labelled tree FT.With coefficient data, there is coefficient data [1], coefficient data [3-1], coefficient data [3-3], coefficient data [4-1] and the coefficient data [4-4] that the regional R10 of more than one decoding, R31, R33, R41 and R44 are corresponding to preserve in order.
(variation)
[omission of the counting of run]
Below with Figure 33 and Figure 34, illustrate that zone that nonzero coefficient is labeled as " 0 (puppet) " is not contained in the example in the counting of run.Figure 33 and Figure 34 illustrate the 1st level coding region R30 of Figure 26.
In addition, in the following description, suppose object piece BLK is 16 * 16 sizes illustratively.Suppose that the 1st level coding region R30 is 8 * 8 sizes, suppose that the 2nd level coding region R31~R34 is 4 * 4 sizes.
Now, TU information coding section 280 for example, in the situation that the coding region of processing object is to carry out the judgement had or not of Region Segmentation and nonzero coefficient below given size (8 * 8 sizes), but scanning and coding carry out with 8 * 8 units.
Figure 33 is the figure that the 1st level coding region R30 about Figure 26 at length means.Such as shown in figure 33, Region Segmentation section 281 is divided into the 2nd level coding region R31~R34 by the 1st level coding region R30.
Regional code section 282 is about the 1st level coding region R30 and code division mark " 1 ", and then coding nonzero coefficient mark " 1001 ".
In addition, regional code section 282 is using the 1st level coding region R30 processing of encoding as the object coding zone.Spread all over the arrow shown in the 1st level coding region R30 integral body and mean scanning sequency.
At this, distance of swimming pattern-coding section 202 is labeled as at nonzero coefficient in the coded object zone R32 of " 0 (puppet) " and R34 and does not count run.
That is,, in the coding of coefficient is processed, distance of swimming pattern-coding section 202 carries out the processing of distance of swimming pattern-coding as follows like that.
At first, the coding of supposing the nonzero coefficient A1 shown in Figure 34 completes.Then, in the coding of next nonzero coefficient is processed, distance of swimming pattern-coding section 202 sequentially reads coefficient according to the sawtooth scan of the reverse order shown in arrow in Figure 34.At this, the next nonzero coefficient on the sawtooth scan of reverse order order be A3 (below, utilize nonzero coefficient A3 carry out with reference to).There are 9 zero coefficients between nonzero coefficient A1 and nonzero coefficient A3.
Distance of swimming pattern-coding section 202 is not contained in the zero coefficient in the 2nd level coding region R32 and R34 in the counting of run.Thereby, in the coding of next nonzero coefficient A3 is processed, only count zero coefficient A2 in the 2nd level coding region R33 as run.Thus, the 202 coding run=1 of distance of swimming pattern-coding section.
In addition, distance of swimming pattern-coding section 202 also can determine whether according to the two-dimensional coordinate value of each coefficient it is that nonzero coefficient is labeled as the zone of " 0 ".
In addition, decode like that as follows to process in moving image decoding apparatus 1 side and get final product.That is,, arrange the processing of the coefficient of preserving on ground in again being stored in coefficient matrix by scanning sequency after carrying out each coefficient of decode when, be labeled as at nonzero coefficient in the zone of " 0 " to skip and preserve processing and get final product.
[Region Segmentation based on regulation]
Below, use Figure 35 and Figure 36 Benq in the Region Segmentation of regulation.
Desorption coefficient also can be cut apart to come about the decoder object zone of regulation as predesignated in TU information decoding section 12.
Figure 35 and Figure 36 mean the partitioning scheme in the object piece BLK of 16 * 16 sizes.In addition, in Figure 35 and Figure 36, object piece BLK is divided into the 1st level regional R101~R104 that decodes.In addition, about divisible the 1st level decoding zone to the 2nd level decoding zone, enclose dotted line (for example, refer to the 1st level decode regional R102~R104) in Figure 35.
That is, the Region Segmentation section 121 of TU information decoding section 12 also can be set as cutting apart all the time the decoder object zone (divisible) of regulation, also can be set as not cutting apart all the time (indivisible).
For example, as shown in figure 35, the regional R101 that also the 1st level of upper left 8 * 8 can be decoded in object piece BLK is made as indivisible.Divisible the 1st level of the Region Segmentation section 121 regional R102~R104 that decodes.
In addition, in the zone that approaches the DC component, exist the possibility of nonzero coefficient high.Thus, in decoding regional R101, the 1st level there is the situation of not carrying out cutting apart than the situation of being cut apart and the trend that the code efficiency of dynamic image encoding device 2 sides is improved.
In addition, for example, as shown in figure 36, the regional R104 that also can only the 1st level of bottom right 8 * 8 be decoded in object piece BLK is made as divisible.Divisible the 1st level of the Region Segmentation section 121 regional R104 that decodes.
Because the zone of high fdrequency component side exists the possibility of zero coefficient high, the situation of therefore being cut apart has the trend that the code efficiency of dynamic image encoding device 2 sides is improved.
In addition, the Region Segmentation section 121 decoding zone that also ration of division given size is larger all the time.In addition, the decoding zone that Region Segmentation section 281 also can the size that ration of division given size is not little all the time.For example, form like that as described below.
The decoding zone of Region Segmentation section 121 size that also ration of division 8 * 8 sizes are larger all the time.In addition, Region Segmentation section 121 also can be set as " 1 " by upper (being the object piece) dividing mark all the time.
In addition, the zone of 4 * 4 sizes that comprise in the object piece of the above size of 4 * 4 sizes also can further not cut apart in Region Segmentation section 121.
In addition, Region Segmentation section 121 also can not carry out above the cutting apart of 3 levels.
In addition, this variation also can be applied the TU information coding section 280 of dynamic image encoding device 2.
(other variation)
Also the Region Segmentation based on regulation shown in the selection of the coded system shown in execution mode 3, execution mode 4 only can be applied to the object piece of specific dimensions, shape, specific slice type.
For example, there is the more trend of zero coefficient in the B section.Thus, in B section, also can not cut apart and 64 of the upper lefts coefficient of coded object piece only.
According to these distortion, can suppress the increase of flag code amount.
In addition, by shown in execution mode 4, cutting apart the zone that gimmick obtains, also can carry out the coding of the coefficient based on the relative position appointment shown in execution mode 2.
In addition, the variation of [number and the size in zone] shown in execution mode 1 also can be applicable to execution mode 2~4.
In addition, the present invention also can show as described below.
; in picture decoding apparatus involved in the present invention; the change of scale cutting unit is divided into a plurality of subunits by change of scale; the conversion coefficient decoding unit is with reference to the decoded information for the above-mentioned conversion coefficient of acquisition among coded data, the decoded information be assigned with according to each above-mentioned subunit, the conversion coefficient comprised in the above-mentioned subunit of decoding.
In addition, in the conversion coefficient obtained according to each change of scale, the pixel value of object images being carried out to frequency translation is encoded obtained coded data, the length of coding continuous zero coefficient along the scanning sequency of regulation, the above-mentioned decoded information defined in the zone about being arranged in lower frequency side, the length of above-mentioned continuous zero coefficient is shorter overstepping the bounds of proprietyly joins short code.
In the zone of low frequency component side, the frequency occurred due to nonzero coefficient is high, therefore has the trend that the length of continuous zero coefficient shortens.
According to above-mentioned formation, distribute shorter code because the length of continuous zero coefficient is shorter, so can be considered according to regional position the good decoding processing of efficiency of above-mentioned trend.Thus, can reduce the code amount that will decode.
In addition, the above-mentioned decoded information defined in the zone about being arranged in high frequency side, about the group of the parameter of the absolute value that comprises above-mentioned conversion coefficient, join shorter code to the component of the parameter that above-mentioned absolute value is 1.
There is the trend that the absolute value integral body of conversion coefficient diminishes in the zone of high fdrequency component side.Thereby, if the non-vanishing coefficient of conversion coefficient has absolute value and becomes 1 trend.
According to above-mentioned formation, can join shorter code to the component that uprises such parameter at the position of high fdrequency component side occurrence frequency.
In addition, the group of parameter for example refer under distance of swimming pattern { the absolute value here is corresponding to level for run, the group of level}.
Thus, can be considered according to regional position the good decoding processing of efficiency of above-mentioned trend.Thus, can reduce the code amount that will decode.
In addition, in above-mentioned decoded information, according to the length of code, carry out specified order, above-mentioned decoded information updating block carrys out said sequence in advance according to the position of the above-mentioned subunit in above-mentioned change of scale in above-mentioned renewal.
According to above-mentioned formation, the order of coming in advance the length according to the code to parametric distribution to determine according to the position of the above-mentioned subunit in above-mentioned change of scale.
The order determined according to the length of above-mentioned code for example refers to Code Number.That is, in the above-described configuration, by Code Number in advance, thus will give parametric distribution yard length be updated to shorter length.
According to above-mentioned formation, just can realize the renewal of the length of code by the comparatively simple process of Code Number in advance etc.
In addition, above-mentioned change of scale cutting unit carries out above-mentioned recursion according to the position in the zone of cutting object and at least one party in the middle of size and cuts apart.
According to above-mentioned formation, owing to also can not understanding the mark that representation is cut apart, therefore can reduce the code amount that will decode.
(information of remarks)
A side of the present invention is described, as described below.; picture decoding apparatus involved in the present invention is configured in order to solve above-mentioned problem; this conversion coefficient of decoding among the coded data obtained from conversion coefficient encodes; this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains; above-mentioned picture decoding apparatus possesses: the change of scale cutting unit, and it is divided into a plurality of subunits by above-mentioned change of scale; With the conversion coefficient decoding unit, the conversion coefficient that it is decoded in above-mentioned subunit and comprise with reference to decoded information, this decoded information is for obtain the decoded information of above-mentioned conversion coefficient, the decoded information be assigned with according to each above-mentioned subunit among above-mentioned coded data.
In addition, picture coding device involved in the present invention is configured in order to solve above-mentioned problem, conversion coefficient is encoded, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, above-mentioned picture coding device possesses: the change of scale cutting unit, and it is divided into a plurality of subunits by above-mentioned change of scale; With the transform coefficients encoding unit, the conversion coefficient that it is encoded in above-mentioned change of scale and comprise with reference to coded message, this coded message is for the coded message of the above-mentioned conversion coefficient of encoding, the coded message that is assigned with according to each above-mentioned subunit.
In addition, the data structure of coded data involved in the present invention is configured in order to solve above-mentioned problem, by conversion coefficient is encoded and is generated, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, in the data structure of above-mentioned coded data, comprise the relative position of the position of the above-mentioned conversion coefficient that becomes coded object with respect to the position of previous above-mentioned conversion coefficient of encoding out, the picture decoding apparatus of thus above-mentioned coded data being decoded, according to the previous position of above-mentioned conversion coefficient in above-mentioned change of scale decoded, with above-mentioned relative position, determine the position of the above-mentioned conversion coefficient that becomes decoder object.
According to above-mentioned formation, the change of scale that at first will become the object of decoding in decoding is processed is divided into a plurality of subunits.
Change of scale refers to, pixel value is transformed into to the unit of frequency field.As change of scale, such as size of enumerating 64 * 64 pixels, 32 * 32 pixels, 16 * 16 pixels etc.
In the situation that change of scale is 16 * 16 sizes, subunit can be for example the zone of 8 * 8 sizes.
In addition, according to above-mentioned formation, will be made as the processing object, the conversion coefficient comprised in this subunit of decoding by cutting apart each ground of a plurality of subunits obtained.Order to the decoding subunit is not particularly limited, also can be according to the random order processing of decoding.
In addition, in the above-described configuration, when the decoding conversion coefficient, reference is assigned to the decoded information of each unit of a plurality of subunits.
Decoded information refers to, for the information of the parameter value of the regulation of reproduction conversion coefficient among the code from coded data (bit string).For example, decoded information means the form for the foundation correspondence of the parameter value of the regulation of reproduction conversion coefficient among the code from coded data.Again for example, decoded information is the calculating formula for the parameter value of the regulation of induced transformation coefficient among the code from coded data.
That is to say, in the above-described configuration, use the less subunit about the size than original transform unit and the decoded information stipulated, the conversion coefficient of decoding.
Thereby the situation that the decoded information of stipulating than the size based on about original transform unit is decoded and processed, can bring into play amount of information, this effect of the amount of calculation based on decoded information that can reduce decoded information.
And then can say, because can reduce to become the number of the conversion coefficient of object in decoding is processed, so the size of the scanning form that the scanning sequency of conversion coefficient is defined also can be made as less.
In addition, furtherly, can suppress lowlyer by amount, the disposal ability of the memory that necessitates in decoding is processed.
In addition, subunit also can be consistent with any one of coding units in the technology of non-patent literature 1,2.In this case, can be in above-mentioned coding units general by predefined VLC form, be decoded information.
In addition, according to the picture coding device formed as described above or the data structure of coded data, can bring into play the effect same with picture decoding apparatus involved in the present invention.
In addition, preferred in picture decoding apparatus involved in the present invention, above-mentioned conversion coefficient decoding unit is with reference to nonzero information, when this nonzero information means the conversion coefficient without the non-zero in above-mentioned subunit, omit the decoding of above-mentioned subunit and process, this nonzero information means the having or not of conversion coefficient of the non-zero in above-mentioned subunit.
According to above-mentioned formation, can avoid carrying out unnecessary decoding according to having or not of nonzero coefficient in area unit and process.
In addition, in picture decoding apparatus involved in the present invention preferably, above-mentioned decoded information according to the position of the above-mentioned subunit in above-mentioned change of scale by adaptively defining.
In change of scale, in the low frequency component side that comprises the DC component and high fdrequency component side, the appearance trend difference of the value of conversion coefficient.For example, near the low frequency component side DC component occurs that the possibility of nonzero coefficient is high.In addition, occur that in the high fdrequency component side possibility of zero coefficient is high.
According to the position adaptively defining, for example referring to, is that low frequency component side or low frequency component side are carried out the adaptively defining decoded information according to the position of above-mentioned subunit.
In addition, self adaptation refers to, according to above-mentioned appearance trend, carrys out allocation of codes.For example in the high fdrequency component side, refer to, distribute shorter code to the little value of absolute value of zero coefficient or conversion coefficient.
In addition, for example in the low frequency component side, refer to, distribute shorter code to the large value of absolute value of nonzero coefficient or conversion coefficient.
According to above-mentioned formation, can carry out the decoding good with the corresponding efficiency in regional position and process.Can reduce the code amount that will decode.
In addition, in picture decoding apparatus involved in the present invention, preferably, possess: the decoded information updating block, it is according to the occurrence frequency of the parameter that means above-mentioned conversion coefficient, and the code that will in above-mentioned decoded information, be assigned to this parameter is updated to shorter code.
According to above-mentioned formation, the high parameter about occurrence frequency, can distribute shorter code.That is, can make the occurrence frequency of parameter dynamically be reflected into the length of code.
Thus, the high parameter about occurrence frequency, can reduce the code amount that will decode.
In addition, preferred in picture decoding apparatus involved in the present invention, above-mentioned conversion coefficient decoding unit carries out following decoding to be processed: carried out the 1st mode decoding process under rated condition after, carry out the 2nd mode decoding process, the decode length of continuous nonzero coefficient, the absolute value of conversion coefficient and the code of conversion coefficient of the 1st mode decoding process wherein, the absolute value of the 2nd mode decoding process decoding conversion coefficient and the code of conversion coefficient.
According to above-mentioned formation, in decoding is processed, carried out this 1st mode decoding process of symbol (sign) of the absolute value (1evel) of length (run), conversion coefficient of the continuous nonzero coefficient of decoding and conversion coefficient under rated condition after, carry out the absolute value (1evel) of decoding conversion coefficient and this 2nd mode decoding process of symbol (sign) of conversion coefficient.The 1st mode decoding process refers to so-called distance of swimming pattern, and the 2nd mode decoding process refers to so-called amplitude mode.
The absolute value of afore mentioned rules condition such as the number of enumerating the conversion coefficient decoded, conversion coefficient etc.In addition, rated condition can be also the position in the change of scale of subunit, the corresponding condition of appearance trend of coefficient.
The technology that distance of swimming pattern and amplitude mode are for example adopted by non-patent literature 1,2.Among decoding in each zone is processed, can general this existing formation.Thus, can realize high code efficiency.
In addition, in picture decoding apparatus involved in the present invention, preferably, above-mentioned conversion coefficient decoding unit changes the afore mentioned rules condition according to the position of the above-mentioned subunit in above-mentioned change of scale.
Change the afore mentioned rules condition according to the position of the above-mentioned subunit in above-mentioned change of scale and for example refer to, be difficult to finish the 1st mode decoding process in the zone of low frequency component side, be easy to finish the 1st mode decoding process in the zone of high fdrequency component side.
Length with nonzero coefficient continuous in the zone of the low frequency component side trend that length shorter, continuous nonzero coefficient in the zone of high fdrequency component side becomes longer that becomes.Thereby, in the situation that the length of continuous nonzero coefficient is elongated, preferentially adopt the 1st mode decoding process.
In addition, also comprise the change of only carrying out the 1st mode decoding process and not carrying out the 2nd mode decoding process in the change of rated condition.
According to above-mentioned formation, can realize that the efficient decoding based on distance of swimming amplitude mode is processed.
In addition, in picture decoding apparatus involved in the present invention, preferably, possess: localized area decoding unit, its regulation zone that is defined in the low frequency component side in above-mentioned change of scale conversion coefficient of decoding; And switch unit, its switching is processed, is processed with the decoding performed by above-mentioned localized area decoding unit by above-mentioned change of scale cutting unit and the performed decoding of above-mentioned conversion coefficient decoding unit.
According to above-mentioned formation, can suitably switch to any one the good decoding processing mode of code efficiency of being decoded in the middle of the decoding processing mode of conversion coefficient by above-mentioned change of scale cutting unit and the performed decoding processing mode of above-mentioned conversion coefficient decoding unit and the regulation zone that is defined in the low frequency component side in above-mentioned change of scale.
In addition, in picture decoding apparatus involved in the present invention, above-mentioned a plurality of subunits of having cut apart are cut apart on preferred above-mentioned change of scale cutting unit recursion ground.
Existence has carried out to the zone of less size the few situation of code amount that situation that decoding processes will be decoded.According to above-mentioned formation, in the situation that it is few that the code amount that situation that decoding processes will decode has been carried out in the zone of less size, the processing of can decoding efficiently.
Furtherly, when nonzero coefficient is many, there is the more efficient situation of situation of not cut apart.In addition, when coding during the having or not of nonzero coefficient, because can control meticulously the zone of less size, so more effective.
Another side of the present invention is described, as described below.; picture decoding apparatus involved in the present invention is configured in order to solve above-mentioned problem; this conversion coefficient of decoding among the coded data obtained from conversion coefficient encodes; this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains; above-mentioned picture decoding apparatus possesses: the relative position decoding unit, and its decoding becomes the relative position of the conversion coefficient of decoder object at a distance of the previous conversion coefficient decoded; And position determination unit, its according to the previous above-mentioned conversion coefficient decoded the position in above-mentioned change of scale and above-mentioned relative position determine the position of the above-mentioned conversion coefficient that becomes above-mentioned decoder object.
In addition, picture coding device involved in the present invention is configured in order to solve above-mentioned problem, conversion coefficient is encoded, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, above-mentioned picture coding device possesses: the relative position coding unit, its coding becomes the position of above-mentioned conversion coefficient of coded object with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out.
In addition, the data structure of coded data involved in the present invention is configured in order to solve above-mentioned problem, by conversion coefficient is encoded and is generated, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains, in the data structure of above-mentioned coded data, comprise the relative position of the position of the above-mentioned conversion coefficient that becomes coded object with respect to the position of previous above-mentioned conversion coefficient of encoding out, thus, the picture decoding apparatus that above-mentioned coded data is decoded, according to the previous position of above-mentioned conversion coefficient in above-mentioned change of scale decoded, with above-mentioned relative position, determine the position of the above-mentioned conversion coefficient that becomes decoder object.
According to above-mentioned formation, the position according to the previous above-mentioned conversion coefficient decoded in above-mentioned change of scale and above-mentioned relative position are determined the position of the above-mentioned conversion coefficient that becomes above-mentioned decoder object.Thus, can determine based on relative position relevance ground the position of conversion coefficient.In addition, change of scale refers to, the unit of regulation converted.
In the situation that above-mentioned run is encoded, because scanning sequency is according to the rules counted the length of run, even if therefore close as the relative position of the nonzero coefficient of benchmark and the two-dimensional coordinate of next nonzero coefficient in change of scale, when result also exists run elongated, the situation that exists thus the code amount to increase.
This trend easily becomes in the zone of sparse high fdrequency component particularly remarkable at conversion coefficient.In addition, run is elongated to be referred to, must prepare larger form correspondingly.
With respect to this, if utilize relative position to determine the position of conversion coefficient, can cut down in this case the code amount.
According to above-mentioned formation, because utilize relative position to determine the position of conversion coefficient, therefore can reduce the code amount that should decode.
Its result, can bring into play amount of information, this effect of the amount of calculation based on decoded information that can reduce decoded information.
In addition, furtherly, can suppress lowlyer by amount, the disposal ability of the memory that necessitates in decoding is processed.
In addition, according to the picture coding device formed as described above or the data structure of coded data, can bring into play the effect same with picture decoding apparatus involved in the present invention.
In addition, preferred in picture decoding apparatus involved in the present invention, possess: decoding unit, its zone about the low frequency component side in above-mentioned change of scale and carry out that the 1st mode decoding is processed and the 2nd mode decoding is processed, the 1st mode decoding is processed length, the absolute value of conversion coefficient and the code of conversion coefficient of the continuous nonzero coefficient of decoding, and the 2nd mode decoding is processed the absolute value of decoding conversion coefficient and the code of conversion coefficient.
Above-mentioned decoding unit is carried out the decoding of so-called distance of swimming amplitude mode.For example, if this change of scale is 16 * 16 sizes, the zone of upper left 8 * 8 sizes that comprise the DC component is enumerated in the zone of the low frequency component side in change of scale.
In the zone of low frequency component side, because conversion coefficient becomes so not sparse, so the length of run also becomes shorter.Thus, can carry out efficiently the decoding based on distance of swimming amplitude mode.
According to above-mentioned formation, become and do not have so sparse zone about conversion coefficient, can carry out efficiently the decoding based on distance of swimming amplitude mode.
In addition, in picture decoding apparatus involved in the present invention, preferably, above-mentioned decoding unit changes the size of above-mentioned zone according to the characteristic that becomes the change of scale of decoder object.
The characteristic of change of scale refers to, the slice type of change of scale, predictive mode, change of scale size etc.Both above-mentioned various characteristics can be adopted in combination, also above-mentioned various characteristics can be alternatively adopted.For example, also can change the size of above-mentioned zone according at least one of the slice type of change of scale, predictive mode, change of scale size etc.
Be of a size of at 16 * 16 o'clock at change of scale, for example can form like that as follows.That is,, if predictive mode is frame mode, the size in the zone of above-mentioned low frequency component side is made as to 8 * 8.In addition, if predictive mode is inter-frame mode, the size in the zone of above-mentioned low frequency component side is made as to 4 * 4.
Thus, can change according to the characteristic of change of scale the zone by the decoding of distance of swimming amplitude mode.
Its result, can carry out the decoding based on distance of swimming amplitude mode efficiently.
In addition, above-mentioned moving image decoding apparatus 1 and each piece of dynamic image encoding device 2 can be realized by being formed on the logical circuit on integrated circuit (IC chip), also can adopt CPU (Central Processing Unit: CPU) realize on software on hardware.
In the situation that be the latter, above-mentioned each device possess the order of carrying out the program that realizes each function CPU, read-only memory), launch RAM (the Random Access Memory: of said procedure random access memory), storage devices such as memory (recording medium) of preservation said procedure and various data etc. preserved ROM (the Read Only Memory: of said procedure.And, by the software that is recorded as embodied on computer readable and realizes above-mentioned functions, be that the recording medium supply of program code (execute form program, intermediate code program, source program) of control program of above-mentioned each device is to above-mentioned each device, the program code recorded in recording medium is read and carried out to this computer (or CPU, MPU), the purpose that also attainable cost is invented thus.
As aforementioned recording medium, such as can adopt the band classes such as disk, cassette tape, comprise floppy disk (registered trade mark)/disks such as hard disk, programmable logic device), FPGA (Field Programmable Gate Array: the logical circuit class etc. such as field programmable gate array) the semiconductor memory class such as card class, the quick ROM of mask rom/EPROM/EEPROM/ such as the dish class of CD-ROM/MO/MD/DVD/CD-CDs such as R/ Blu-ray Disc (registered trade mark), IC-card (comprising storage card)/light-card or PLD (Programmable logic device:.
In addition, above-mentioned each device can be configured to and can be connected with communication network, via communication network, supply the said procedure code.This communication network is as long as energy transmission procedure code is not particularly limited.For example, can utilize internet, internal network, external network, LAN, ISDN, VAN, CATV communication network, Virtual Private Network (Virtual Private Network), telephone wire road network, mobile radio communication, satellite communication network etc.In addition, form the transmission medium of this communication network also so long as the medium of energy transmission procedure code gets final product, be not defined as specific formation or kind.For example, both can be at IEEE1394, USB, power line is carried, the cable tv circuit, telephone wire, (Asymmetric Digital Subscriber Line: ADSL (Asymmetric Digital Subscriber Line)) circuit etc. is used in wired ADSL, also can be at IrDA, the infrared rays such as remote controller, Bluetooth (registered trade mark), IEEE802.11 is wireless, HDR (High Data Rate: high data rate), NFC (Near Field Communication: the near field wireless communication technique), DLNA (Digital Living Network Alliance: DLNA), the portable phone net, satellite circuit, during bottom surface ripple digital network etc. are wireless, be used.In addition, even if, in the form of computer data signal that be specific with electric transmission at the said procedure code, in being embedded into carrier wave, also can realize the present invention.
<<application examples > >
Above-mentioned dynamic image encoding device 2 and moving image decoding apparatus 1 also can be equipped in the various devices of the transmission of carrying out moving image, reception, record, playback and be used.In addition, moving image can be the proper motion image photographed by camera etc., can be also the artificial sport image (comprising CG and GUI) generated by computer etc.
At first, illustrate and above-mentioned dynamic image encoding device 2 and moving image decoding apparatus 1 can be used in to the situation of transmission and the reception of moving image with reference to Figure 37.
Figure 37 (a) is the block diagram that the formation of the dispensing device PROD_A that has carried dynamic image encoding device 2 is shown.As shown in Figure 37 (a), dispensing device PROD_A possesses: the PROD_A1 of coding section, and it obtains coded data by encoding moving pictures; Modulation portion PROD_A2, it,, by the coded data modulated carrier that utilizes the PROD_A1 of coding section to obtain, obtains modulation signal thus; With sending part PROD_A3, it sends the modulation signal that modulation portion PROD_A2 obtains.Above-mentioned dynamic image encoding device 2 is used as the PROD_A1 of this coding section.
Dispensing device PROD_A, as the source of supply of the moving image that inputs to the PROD_A1 of coding section, also can also possess: the camera PROD_A4 of taking moving image, recorded the recording medium PROD_A5 of moving image, for the image processing part PROD_A7 of the input terminal PROD_A6 from outside input motion image and generation or manuscript picture.In Figure 37 (a), although illustration dispensing device PROD_A possesses the formation of above-mentioned whole parts, also can omit a part.
In addition, recording medium PROD_A5 can be the medium that has recorded the moving image be not encoded, and can be also the medium that has recorded the moving image after the recording coded system different with the coded system from transmission use is encoded.In the situation that be the latter, also can make the lsb decoder (not shown) of the coded data of reading from recording medium PROD_A5 according to recording coded system decoding between recording medium PROD_A5 and the PROD_A1 of coding section.
Figure 37 (b) is the block diagram that the formation of the receiving system PROD_B that has carried moving image decoding apparatus 1 is shown.As shown in Figure 37 (b), receiving system PROD_B possesses: acceptance division PROD_B1, and it receives modulation signal; Demodulation section PROD_B2, its modulation signal received by demodulation acceptance division PROD_B1 obtains coded data; With lsb decoder PROD_B3, its coded data obtained by decoding demodulation section PROD_B2 obtains moving image.Above-mentioned moving image decoding apparatus 1 is used as this lsb decoder PROD_B3.
The supply destination of the moving image that receiving system PROD_B exports as lsb decoder PROD_B3, also can also possess the display PROD_B4 that shows moving image, for the recording medium PROD_B5 that records moving image and for the lead-out terminal PROD_B6 to outside output movement image.In Figure 37 (b), although illustration receiving system PROD_B possesses the formation of above-mentioned these parts, also can omit a part.
In addition, recording medium PROD_B5 can be the medium that has recorded the moving image be not encoded, and can be also the medium after the recording coded system different with the coded system from transmission use is encoded.In the situation that be the latter, also can make the coding section (not shown) of the moving image that obtains from lsb decoder PROD_B3 according to recording coded system coding between lsb decoder PROD_B3 and recording medium PROD_B5.
In addition, the transmission modulation signal transmission medium can be both wireless can be also wired.In addition, the transmission form of transmission modulation signal can be both that broadcast (at this, refer to and send the transmission form that destination is not determined in advance) can be also communication (at this, refer to and send the transmission form that destination is determined in advance).That is, also can realize by any mode of radio broadcasting, wired broadcasting, radio communication and wire communication the transmission of modulation signal.
For example, the broadcasting station of received terrestrial digital broadcasting (broadcasting equipment etc.)/receiving station (television receiver etc.) is the example with the dispensing device PROD_A/ receiving system PROD_B of radio broadcasting transmitting-receiving modulation signal.In addition, the broadcasting station of cable television broadcasting (broadcasting equipment etc.)/receiving station (television receiver etc.) is the example with the dispensing device PROD_A/ receiving system PROD_B of wired broadcasting transmitting-receiving modulation signal.
In addition, used internet VOD (Video On Demand: video request program) server (work station etc. the)/client computer (television receiver, personal computer, smart mobile phone etc.) such as service, moving image share service be the example of receiving and dispatching by correspondence the dispensing device PROD_A/ receiving system PROD_B of modulation signal (usually, what in LAN, as transmission medium, adopt is any one in the middle of wireless or wired, and what in WAN, as transmission medium, adopt is wired).At this, comprise Desktop PC, portable PC and tablet PC in personal computer.In addition, also comprise the multifunctional portable telephone terminal in smart mobile phone.
In addition, the client computer of moving image share service is decoded and is shown in the function of display except the coded data to downloading from server, also have to by camera to moving image encoded and uploaded to the function of server.That is, the client computer of moving image share service is brought into play function as dispensing device PROD_A and receiving system PROD_B both sides.
Secondly, illustrate and above-mentioned dynamic image encoding device 2 and moving image decoding apparatus 1 can be used in to the record of moving image and the situation in playback with reference to Figure 38.
Figure 38 (a) is the block diagram that the formation of the tape deck PROD_C that has carried above-mentioned dynamic image encoding device 2 is shown.As shown in Figure 38 (a), tape deck PROD_C possesses: the PROD_C1 of coding section, and it obtains coded data by encoding moving pictures; With write section PROD_C2, its coded data that PROD_C1 of coding section is obtained is written to recording medium PROD_M.Above-mentioned dynamic image encoding device 2 is used as the PROD_C1 of this coding section.
In addition, recording medium PROD_M can be both that (1) is as HDD (Hard Disk Drive: hard disk drive), the solid state driving machine) etc. SSD (Solid State Drive: the medium that is built in like that the type in tape deck PROD_C, can be also that (2) are as the SD storage card, USB (Universal Serial Bus: the medium of the type that USB) fast storage etc. is connected with tape deck PROD_C like that, can be also that (3) are as DVD (Digital Versatile Disc: Digital versatile disc), registered trade mark) etc. BD (Blu-ray Disc: be loaded onto like that the medium in the built-in drive assembly of tape deck PROD_C (not shown).
In addition, tape deck PROD_C, as the source of supply of the moving image that inputs to the PROD_C1 of coding section, also can also possess: the camera PROD_C3 of taking moving image, for the input terminal PROD_C4 from outside input motion image, for the acceptance division PROD_C5 of receiving moving pictures and the image processing part C6 of generation or manuscript picture.In Figure 38 (a), although illustration tape deck PROD_C possesses the formation of above-mentioned whole parts, also can omit a part.
In addition, acceptance division PROD_C5 can be both the parts that receive the moving image be not encoded, and can be also the parts of the coded data after receiving coded system with the transmission use different from recording coded system and being encoded.In the situation that be the latter, also can make the lsb decoder for transmission (not shown) of the coded data after decoding is encoded with the coded system of transmitting use between acceptance division PROD_C5 and the PROD_C1 of coding section.
As such tape deck PROD_C, such as enumerating (in this case, input terminal PROD_C4 or acceptance division PROD_C5 become the main source of supply of moving image) such as DVD register, BD register, HDD (Hard Disk Drive) registers.In addition, video camera (in this case, camera PROD_C3 becomes the main source of supply of moving image), personal computer (in this case, acceptance division PROD_C5 or image processing part C6 become the main source of supply of moving image), smart mobile phone (in this case, camera PROD_C3 or acceptance division PROD_C5 become the main source of supply of moving image) etc. is also the example of this tape deck PROD_C.
Figure 38 (b) is the piece that the formation of the replay device PROD_D that has carried above-mentioned moving image decoding apparatus 1 is shown.As shown in Figure 38 (b), replay device PROD_D possesses: read the PROD_D1 of section, it reads the coded data that is written to recording medium PROD_M; With lsb decoder PROD_D2, it is read by decoding the coded data that the PROD_D1 of section reads and obtains moving image.Above-mentioned moving image decoding apparatus 1 is used as this lsb decoder PROD_D2.
In addition, recording medium PROD_M can be both that (1) is built in the medium of the type in replay device PROD_D as HDD, SSD etc., can be also the medium of (2) type of being connected with replay device PROD_D as SD storage card, USB fast storage etc., can be also that (3) are loaded onto the medium of the built-in drive assembly of replay device PROD_D (not shown) as DVD, BD etc.
In addition, the supply destination of the moving image that replay device PROD_D exports as lsb decoder PROD_D2 also can also possess: show the display PROD_D3 of moving image, for the lead-out terminal PROD_D4 to outside output movement image and the sending part PROD_D5 that sends moving image.In Figure 38 (b), although illustration replay device PROD_D possesses the formation of above-mentioned whole parts, also can omit a part.
In addition, sending part PROD_D5 can be both the parts that send the moving image be not encoded, and can be also the parts of the coded data after sending coded system with the transmission use different from recording coded system and being encoded.In the situation that be the latter, also can make to transmit the coding section (not shown) of coded system encoding moving pictures of use between lsb decoder PROD_D2 and sending part PROD_D5.
As this replay device PROD_D, such as enumerating DVD player, BD player, HDD player etc. (the lead-out terminal PROD_D4 that in this case, television receiver etc. is connected becomes the main supply destination of moving image).In addition, television receiver (in this case, display PROD_D3 becomes the main supply destination of moving image), the numeral billboard is (also referred to as electronic board, BBS (Bulletin Board System) etc., display PROD_D3 or sending part PROD_D5 become the main supply destination of moving image), desktop type PC (in this case, lead-out terminal PROD_D4 or sending part PROD_D5 become the main supply destination of moving image), portable or tablet PC (in this case, display PROD_D3 or sending part PROD_D5 become the main supply destination of moving image), smart mobile phone (in this case, display PROD_D3 or sending part PROD_D5 become the main supply destination of moving image) etc. be also the example of this replay device PROD_D.
Industrial applicibility
The picture coding device of the coded data after the picture decoding apparatus that the coded data after the present invention can suitably be applied to view data is encoded is decoded and image data generating are encoded.In addition, can suitably be applied to be generated and by the data structure of the coded data of picture decoding apparatus reference by picture coding device.
Label declaration
1 moving image decoding apparatus (picture decoding apparatus)
2 dynamic image encoding devices (picture coding device)
12,12A TU information decoding section
121 Region Segmentation sections (change of scale cutting unit)
122 regional decoding sections (conversion coefficient decoding unit)
280,280A TU information coding section
281 Region Segmentation sections (change of scale cutting unit)
282 regional code sections (transform coefficients encoding unit)
320 relative position mode decoding sections
321 last nonzero coefficient lsb decoders
322 relative position lsb decoders (relative position decoding unit)
323 coefficient positions determination sections (position determination unit)
310 distance of swimming amplitude mode lsb decoders (decoding unit)
420 relative position pattern-coding sections
421 last nonzero coefficient coding sections
422 relative position calculating parts (relative position coding unit)
423 relative position coding sections (relative position coding unit)
BLK object piece (change of scale)
R11~R14 decoding zone (subunit)
TBL11, TBL30 VLC form (decoded information)
TBL21, TBL40 VLC form (coded message)
TUI TU information (coded data)

Claims (15)

1. a picture decoding apparatus, this conversion coefficient of decoding among the coded data obtained from conversion coefficient encodes, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains,
Above-mentioned picture decoding apparatus possesses:
The change of scale cutting unit, it is divided into a plurality of subunits by above-mentioned change of scale; With
The conversion coefficient decoding unit, the conversion coefficient that it is decoded in above-mentioned subunit and comprise with reference to decoded information, this decoded information is for obtain the decoded information of above-mentioned conversion coefficient, the decoded information be assigned with according to each above-mentioned subunit among above-mentioned coded data.
2. picture decoding apparatus according to claim 1, is characterized in that,
Above-mentioned conversion coefficient decoding unit is with reference to nonzero information, when this nonzero information means the conversion coefficient without the non-zero in above-mentioned subunit, omits the decoding of above-mentioned subunit and processes, and this nonzero information means the having or not of conversion coefficient of the non-zero in above-mentioned subunit.
3. picture decoding apparatus according to claim 1 and 2, is characterized in that,
Above-mentioned decoded information according to the position of the above-mentioned subunit in above-mentioned change of scale by adaptively defining.
4. according to the described picture decoding apparatus of any one in claims 1 to 3, it is characterized in that,
Above-mentioned picture decoding apparatus possesses: the decoded information updating block, and it is according to the occurrence frequency of the parameter that means above-mentioned conversion coefficient, and the code that will in above-mentioned decoded information, be assigned to this parameter is updated to shorter code.
5. according to the described picture decoding apparatus of any one in claim 1 to 4, it is characterized in that,
Above-mentioned conversion coefficient decoding unit carries out following decoding to be processed: carried out the 1st mode decoding process under rated condition after, carry out the 2nd mode decoding process, the decode length of continuous nonzero coefficient, the absolute value of conversion coefficient and the code of conversion coefficient of the 1st mode decoding process wherein, the absolute value of the 2nd mode decoding process decoding conversion coefficient and the code of conversion coefficient.
6. picture decoding apparatus according to claim 5, is characterized in that,
Above-mentioned conversion coefficient decoding unit changes the afore mentioned rules condition according to the position of the above-mentioned subunit in above-mentioned change of scale.
7. according to the described picture decoding apparatus of any one in claim 1 to 6, it is characterized in that,
Above-mentioned picture decoding apparatus possesses:
The localized area decoding unit, its regulation zone that is defined in the low frequency component side in above-mentioned change of scale conversion coefficient of decoding; With
Switch unit, its switching is processed, is processed with the decoding performed by above-mentioned localized area decoding unit by above-mentioned change of scale cutting unit and the performed decoding of above-mentioned conversion coefficient decoding unit.
8. according to the described picture decoding apparatus of any one in claim 1 to 7, it is characterized in that,
Above-mentioned a plurality of subunits of having cut apart are cut apart on above-mentioned change of scale cutting unit recursion ground.
9. a picture coding device, encoded to conversion coefficient, and this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains,
Above-mentioned picture coding device possesses:
The change of scale cutting unit, it is divided into a plurality of subunits by above-mentioned change of scale; With
The transform coefficients encoding unit, the conversion coefficient that it is encoded in above-mentioned change of scale and comprise with reference to coded message, this coded message is for the coded message of the above-mentioned conversion coefficient of encoding, the coded message that is assigned with according to each above-mentioned subunit.
10. the data structure of a coded data, by conversion coefficient is encoded and generated, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains,
In the data structure of above-mentioned coded data,
Comprise expression and whether make above-mentioned change of scale be divided into the dividing mark of a plurality of subunits,
The picture decoding apparatus that above-mentioned coded data is decoded, when above-mentioned dividing mark means to make above-mentioned change of scale to be divided into a plurality of subunit, be divided into a plurality of subunits by above-mentioned change of scale, and according to each subunit above-mentioned conversion coefficient of decoding.
11. a picture decoding apparatus, this conversion coefficient of decoding among the coded data obtained from conversion coefficient encodes, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains,
Above-mentioned picture decoding apparatus possesses:
The relative position decoding unit, its decoding becomes the relative position of the conversion coefficient of decoder object with respect to the previous conversion coefficient decoded; With
Position determination unit, its according to the previous above-mentioned conversion coefficient decoded the position in above-mentioned change of scale and above-mentioned relative position determine the position of the above-mentioned conversion coefficient that becomes above-mentioned decoder object.
12. picture decoding apparatus according to claim 11, is characterized in that,
Above-mentioned picture decoding apparatus possesses: decoding unit, its zone for the low frequency component side in above-mentioned change of scale and carry out that the 1st mode decoding is processed and the 2nd mode decoding is processed, the 1st mode decoding is processed length, the absolute value of conversion coefficient and the code of conversion coefficient of the continuous nonzero coefficient of decoding, and the 2nd mode decoding is processed the absolute value of decoding conversion coefficient and the code of conversion coefficient.
13. picture decoding apparatus according to claim 12, is characterized in that,
Above-mentioned decoding unit changes the size of above-mentioned zone according to the characteristic that becomes the change of scale of decoder object.
14. a picture coding device, encoded to conversion coefficient, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains,
Above-mentioned picture coding device possesses:
The relative position coding unit, its coding becomes the position of above-mentioned conversion coefficient of coded object with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out.
15. the data structure of a coded data, by conversion coefficient is encoded and generated, this conversion coefficient carries out frequency translation according to each change of scale by the pixel value of object images and obtains,
In the data structure of above-mentioned coded data,
The position that comprises the above-mentioned conversion coefficient that becomes coded object is with respect to the relative position of the position of previous above-mentioned conversion coefficient of encoding out,
The picture decoding apparatus that above-mentioned coded data is decoded, position and above-mentioned relative position according to the previous above-mentioned conversion coefficient decoded in above-mentioned change of scale, come definite position that becomes the above-mentioned conversion coefficient of decoder object.
CN201280020067.0A 2011-04-27 2012-04-27 Image decoding apparatus, image encoding apparatus, and data structure of encoded data Pending CN103493494A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2011100081 2011-04-27
JP2011-100081 2011-04-27
PCT/JP2012/061478 WO2012147966A1 (en) 2011-04-27 2012-04-27 Image decoding apparatus, image encoding apparatus, and data structure of encoded data

Publications (1)

Publication Number Publication Date
CN103493494A true CN103493494A (en) 2014-01-01

Family

ID=47072477

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280020067.0A Pending CN103493494A (en) 2011-04-27 2012-04-27 Image decoding apparatus, image encoding apparatus, and data structure of encoded data

Country Status (4)

Country Link
JP (2) JP6051156B2 (en)
CN (1) CN103493494A (en)
RU (2) RU2653319C1 (en)
WO (1) WO2012147966A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109840471A (en) * 2018-12-14 2019-06-04 天津大学 A kind of connecting way dividing method based on improvement Unet network model
CN112400323A (en) * 2018-07-06 2021-02-23 华为技术有限公司 Image encoder, image decoder, and corresponding methods

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2653319C1 (en) * 2011-04-27 2018-05-07 Шарп Кабусики Кайся Images decoding device, images encoding device and data structure of encoded data
ES2949998T3 (en) * 2018-06-03 2023-10-04 Lg Electronics Inc Method and device for processing a video signal using a reduced transform
CN110456983A (en) * 2019-04-17 2019-11-15 上海酷芯微电子有限公司 Data store organisation and method towards deep learning chip sparse calculation

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1497503A (en) * 2002-10-08 2004-05-19 株式会社Ntt都科摩 Image coding method and device, decoding device and method, coding and decoding program
JP2006033508A (en) * 2004-07-16 2006-02-02 Olympus Corp Adaptive variable length coding device, adaptive variable length decoding device, adaptive variable length coding/decoding method, and adaptive variable length coding/decoding program
CN101822065A (en) * 2007-10-12 2010-09-01 高通股份有限公司 Entropy coding of interleaved sub-blocks of video block

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4931034B2 (en) * 2004-06-10 2012-05-16 株式会社ソニー・コンピュータエンタテインメント Decoding device, decoding method, program, and program recording medium
CN1779716A (en) * 2005-05-26 2006-05-31 智多微电子(上海)有限公司 Realization of rapid coding-decoding circuit with run-length
BRPI0922846A2 (en) * 2008-12-03 2018-01-30 Nokia Corp switching between dct coefficient coding modes
US8406546B2 (en) * 2009-06-09 2013-03-26 Sony Corporation Adaptive entropy coding for images and videos using set partitioning in generalized hierarchical trees
JP2011049816A (en) * 2009-08-27 2011-03-10 Kddi R & D Laboratories Inc Moving image encoding device, moving image decoding device, moving image encoding method, moving image decoding method, and program
KR101457894B1 (en) * 2009-10-28 2014-11-05 삼성전자주식회사 Method and apparatus for encoding image, and method and apparatus for decoding image
RU2653319C1 (en) * 2011-04-27 2018-05-07 Шарп Кабусики Кайся Images decoding device, images encoding device and data structure of encoded data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1497503A (en) * 2002-10-08 2004-05-19 株式会社Ntt都科摩 Image coding method and device, decoding device and method, coding and decoding program
JP2006033508A (en) * 2004-07-16 2006-02-02 Olympus Corp Adaptive variable length coding device, adaptive variable length decoding device, adaptive variable length coding/decoding method, and adaptive variable length coding/decoding program
CN101822065A (en) * 2007-10-12 2010-09-01 高通股份有限公司 Entropy coding of interleaved sub-blocks of video block

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MARTA KARCZEWICZ 等: "《Variable Length Coding for Coded Block Flag and Large Transform》", 《JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11,2ND MEETING: GENEVA, CH, JCTVC-B098》, 28 July 2010 (2010-07-28) *
SUNIL LEE 等: "《Efficient coefficient coding method for large transform in VLC mode》", 《EFFICIENT COEFFICIENT CODING METHOD FOR LARGE TRANSFORM IN VLC MODE",SUNIL LEE等,JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11 3RD MEETING: GUANGZHOU, CN, JCTVC-C210》, 15 October 2010 (2010-10-15) *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112400323A (en) * 2018-07-06 2021-02-23 华为技术有限公司 Image encoder, image decoder, and corresponding methods
US11368696B2 (en) 2018-07-06 2022-06-21 Huawei Technologies Co., Ltd. Picture encoder, picture decoder and corresponding methods of encoding and decoding
CN112400323B (en) * 2018-07-06 2022-09-23 华为技术有限公司 Image encoder, image decoder, and corresponding methods
CN109840471A (en) * 2018-12-14 2019-06-04 天津大学 A kind of connecting way dividing method based on improvement Unet network model
CN109840471B (en) * 2018-12-14 2023-04-14 天津大学 Feasible road segmentation method based on improved Unet network model

Also Published As

Publication number Publication date
WO2012147966A1 (en) 2012-11-01
RU2609096C2 (en) 2017-01-30
JP6051156B2 (en) 2016-12-27
RU2653319C1 (en) 2018-05-07
JPWO2012147966A1 (en) 2014-07-28
RU2013152171A (en) 2015-06-10
JP6407944B2 (en) 2018-10-17
JP2017085586A (en) 2017-05-18

Similar Documents

Publication Publication Date Title
US10547861B2 (en) Image decoding device
CN104054343B (en) Picture decoding apparatus, picture coding device
CN103891293B (en) Method and apparatus for the adaptive loop filter of chromatic component
CN106604038B (en) Offset decoding device, offset encoding device, and image filter device
CN103858430A (en) Image decoding apparatus, image decoding method and image encoding apparatus
TWI513278B (en) Largest coding unit (lcu) or partition-based syntax for adaptive loop filter and sample adaptive offset in video coding
TWI520584B (en) Signaling syntax elements for transform coefficients for sub-sets of a leaf-level coding unit
CN103959778A (en) Intra pcm (IPCM) and lossless coding mode video deblocking
TW201342925A (en) Throughput improvement for CABAC coefficient level coding
CN104350753A (en) Arithmetic decoding device, image decoding device, arithmetic encoding device, and image encoding device
CN103947213A (en) Loop filtering control over tile boundaries
CN103636223A (en) Multiple zone scanning order for video coding
CN104221380A (en) Common spatial candidate blocks for parallel motion estimation
CN104012096A (en) Performing motion vector prediction for video coding
JP6407944B2 (en) Image decoding apparatus, image encoding apparatus, image decoding method, image encoding method, control program, and recording medium
JP2010193509A (en) Decoding system and decoding device
CN104067524A (en) Context adaptive entropy coding with a reduced initialization value set
CN104604233A (en) Image decoding device
WO2012077719A1 (en) Image decoding device and image coding device
WO2012043676A1 (en) Decoding device, encoding device, and data structure

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1191161

Country of ref document: HK

RJ01 Rejection of invention patent application after publication

Application publication date: 20140101

RJ01 Rejection of invention patent application after publication
REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1191161

Country of ref document: HK