WO2006025339A1 - 復号化装置、符号化装置、復号化方法、符号化方法 - Google Patents
復号化装置、符号化装置、復号化方法、符号化方法 Download PDFInfo
- Publication number
- WO2006025339A1 WO2006025339A1 PCT/JP2005/015679 JP2005015679W WO2006025339A1 WO 2006025339 A1 WO2006025339 A1 WO 2006025339A1 JP 2005015679 W JP2005015679 W JP 2005015679W WO 2006025339 A1 WO2006025339 A1 WO 2006025339A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video data
- image
- data
- additional information
- resolution
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/59—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
- H04N19/139—Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/01—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
- H04N7/0135—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level involving interpolation processes
- H04N7/014—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level involving interpolation processes involving the use of motion vectors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/01—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
- H04N7/0125—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level one of the standards being a high definition standard
Definitions
- Decoding device encoding device, decoding method, encoding method
- the present invention relates to a video data encoding apparatus and decoding apparatus that involve resolution conversion.
- Patent Document 1 discloses a technique for converting an interlaced image into a progressive image as a decoding device that involves resolution conversion.
- a progressive image is generated by using a motion vector included in a coded bitstream of an interlaced image.
- Non-Patent Document 1 a progressive image pixel is estimated from an interlaced image using a filter or the like, and high resolution is realized by motion prediction of an image region similar to the pixel.
- Patent Document 2 in addition to scalable encoding that encodes a base video that is a base and an extended video that is an extension of the base video, the pixel values of the base video are encoded, A technique for encoding a difference between pixel values of a base moving image is disclosed.
- Patent Document 1 Japanese Patent Laid-Open No. 10-126749
- Non-Patent Document 1 “Sequential Scanning Conversion Method for Interlaced Scanning Images Using Motion Compensation” (Taichiro Kurita, Yukio Sugiura, Theory of Science (D-II), V01.J78-D-II, no.l , pp.40-49, Jan. 1995)
- Patent Document 2 International Publication 2004 / 73312A1 Pamphlet
- Non-Patent Document 1 if the accuracy of the filter for high resolution estimation is poor, an incorrect motion vector may be used, and a high resolution image may be used. There is a problem that the accuracy of is bad.
- Patent Document 1 since the motion vector included in the bitstream is not necessarily equal to the motion of the image, the motion vector may be incorrect, and the accuracy of the high-resolution image is poor. There is a problem.
- Non-Patent Document 1 since the motion vector is detected in the decoding device, it can be closer to the motion of the image than the motion vector included in the bitstream, but the processing load on the decoding device is remarkably high. There is a problem of increasing.
- Patent Document 2 there is a problem that it is difficult to obtain a low bit rate because a difference pixel value of an extended moving image that is obtained only by a pixel value of the base moving image is encoded.
- An object of the present invention is to provide a decoding device and a coding device that can generate a high-resolution image from a low-resolution image with a low bit rate, a low processing amount, and high image quality.
- the decoding apparatus of the present invention includes additional information including a motion vector indicating the motion of an image in the first video data, and the same content as the first video data, and the first video.
- Acquisition means for acquiring stream data that is encoded data of the second video data having a resolution lower than that of the data, decoding means for decoding the stream data into an image of the second video data, and decoding Conversion means for converting the image of the second video data into the third video data having the same resolution as the first video data by interpolating using the attached calorie information.
- the decoding apparatus can obtain the additional information indicating the motion of the image together with the stream data, it is not necessary to detect the motion of the image for generating a high-resolution image, and real-time at the time of decoding. In addition, since it is not necessary to detect motion, the amount of processing for high resolution can be reduced. In addition, since the additional information does not include a code indicating the pixel value of the first video data, a low bit rate can be obtained.
- the conversion means includes an extraction means for extracting a motion vector from the additional information, and an interpolation means for interpolating a pixel in the image of the second video data using the extracted motion vector. You may do it.
- a high-resolution image can be obtained by interpolating pixels according to the motion vector with respect to the image of the second video data, so that the processing amount can be reduced.
- the conversion means includes extraction means for extracting a motion vector from the additional information; A first interpolation means for interpolating pixels using the image pixels of the second video data image decoded by the decoding means, and an image of the second video data decoded by the decoding means.
- the second interpolation means for interpolating pixels in the image of the second video data using the extracted motion vector, and the third video data by selectively using the first interpolation means and the second interpolation means.
- Generation means for generating the image may be provided.
- the image of the third video data can be efficiently generated by selectively using the interpolation in the image by the first interpolation means and the interpolation between the images by the second interpolation means.
- Power S can be.
- the second interpolation means includes a converted image of the third video data, an image of the third video data to be converted, a memory means for holding, and a third video data to be converted.
- First identifying means for identifying a pixel position to be interpolated in an image and first corresponding to a pixel position identified by the first identifying means in a converted third video data image according to a moving beta. 2 specifying means, a reading means for reading out the pixel value specified by the second specifying means, and an interpolated pixel at the pixel position specified by the first specifying means according to the pixel value read by the reading means. It may be provided with a writing means for writing values.
- the pixels to be interpolated in the image of the third video data to be converted are obtained from the converted image of the third video data according to the motion vector. Can be interpolated.
- the additional information includes a motion vector in units of blocks constituting the first video data
- the first specifying means specifies a pixel position to be interpolated in the block
- the specifying means may specify the pixel corresponding to the pixel position specified by the first specifying means in accordance with the motion vector in block units.
- the additional information includes two motion vectors in units of blocks constituting the first video data, and includes two images of the second specifying means and the converted third video data. Then, two pixels corresponding to the pixel position specified by the first specifying means are specified according to two motion vectors, and the reading means reads the values of the two pixels specified by the second specifying means, The writing means may calculate the value of the interpolated pixel based on the values of the two pixels read by the reading means.
- the third video data can have higher image quality.
- the generating means when the image of the second video data is intra-coded, the generating means generates an image of the third video data by using the first interpolation means for the image,
- the image of the second video data when the image of the second video data is subjected to inter-picture predictive coding, the image of the third video data may be generated by using the second interpolation means for the image.
- the encoding method of the present invention includes conversion means for converting the first video data into second video data having a resolution lower than that of the first video data, and the second video data as stream data.
- Encoding means for encoding generating means for indicating the movement of the first video data and generating additional information for interpolating pixels in the image of the second video data, and a code indicating the pixel value of the first video data
- output means for outputting the stream data and the additional information without being output.
- the decoding device uses the high-resolution image. Can be generated with high image quality.
- the decoding method, the encoding method, and the program for realizing the decoding method of the present invention have the same configuration as described above.
- the decoding apparatus of the present invention can obtain the additional information indicating the motion of the image together with the stream data, so that it is not necessary to detect the motion of the image in order to generate a high-resolution image and the decoding is performed. Sometimes it is not necessary to detect the motion in real time. The amount of processing can be reduced. However, since the additional information does not include a code indicating the pixel value of the first video data, a low bit rate can be achieved.
- FIG. 1 is a block diagram showing a schematic configuration of an encoding device and a decoding device according to the present invention.
- FIG. 2 is a block diagram showing a configuration of a code key unit 101 and an additional information generation unit 102.
- FIG. 3 is an explanatory diagram of a differential motion vector.
- FIG. 4A is an explanatory diagram of high-resolution image generation by spatiotemporal pixel interpolation.
- FIG. 4B is an explanatory diagram of high-resolution image generation by spatiotemporal pixel interpolation.
- FIG. 5 is a diagram showing the relationship between a low resolution image and a high resolution image.
- FIG. 6A is an explanatory diagram showing codes of an interpolation image generation mode.
- FIG. 6B is an explanatory diagram showing codes in the interpolation image generation mode.
- FIG. 7 is a flowchart showing an image code key process.
- FIG. 8 is a flowchart showing generation mode selection processing.
- FIG. 9A is an explanatory diagram showing a stream format of additional information associated with stream data.
- FIG. 9B is an explanatory diagram showing a stream format of additional information associated with stream data.
- FIG. 10 is a flowchart showing decryption processing.
- FIG. 11A is a flowchart showing the high-resolution image generation process in S 103 of FIG.
- FIG. 11B is a flowchart showing in more detail the high resolution processing shown in step S 113 of FIG. 11A.
- FIG. 11C is an explanatory diagram of MC interpolation processing.
- FIG. 11D is a flowchart showing in more detail the MC interpolation process shown in step S122 of FIG. 11B.
- Fig. 11E shows the MC-BID interpolation process shown in step S123 of Fig. 11B in more detail. It is a flowchart shown in FIG.
- FIG. 11F is a flowchart showing in more detail the INTRA-MC mixed interpolation process shown in step S124 of FIG. 11B.
- FIG. 12 is a flowchart showing another example of image encoding processing.
- FIG. 13A shows an example of a physical format of a flexible disk which is a recording medium body.
- FIG. 13B shows an appearance, a cross-sectional structure, and a flexible disk as seen from the front of the flexible disk.
- FIG. 13C shows a configuration for recording and reproducing the above program on the flexible disk FD.
- FIG. 14 is a block diagram showing an overall configuration of a content supply system that realizes a content distribution service.
- FIG. 15 is a diagram showing a mobile phone exl l5 using an image encoding method and an image decoding method.
- FIG. 16 is a diagram showing the appearance of a mobile phone.
- FIG. 17 is a diagram showing a digital broadcasting system.
- FIG. 1 is a block diagram showing a schematic configuration of an encoding device and a decoding device according to Embodiment 1 of the present invention.
- the encoding device 1 includes a resolution reduction unit 100, an encoding unit 101, and an additional information generation unit 102. More specific devices of the encoding device 1 are, for example, a computer exl 11, an internet service provider exl02, a streaming server exl03, and the like.
- the resolution reduction unit 100 converts the high resolution video data HV1 into the low resolution video data LV1. Convert to Low resolution video data
- the resolution of LV1 is lower than the resolution of high resolution video data.
- the high-resolution video data HV1 is VGA (640 * 480 pixels)
- the low-resolution video data LV1 is QVGA (320 * 240).
- the encoding unit 101 compresses and encodes the low resolution video data LV1.
- This compression encoding is, for example, MPEG1, 2, 4, 4AVC or the like.
- the encoded low resolution video data LVI is output as a low resolution video stream LVS.
- the additional information generating unit 102 generates additional information AI for increasing the resolution of the second video data.
- Additional information AI includes motion information indicating the motion of an image in the high-resolution video data HV1, and conversion mode information for generating high-resolution video data from the low-resolution video data.
- the conversion mode information includes (A) a first mode indicating that pixels should be interpolated using temporally and spatially surrounding pixels, and (B) additional information includes a forward motion vector.
- the resolution should be increased by acquiring a partial image from the already increased resolution image according to the motion vector.
- the conversion mode is selected in units of macroblocks in order to realize a low processing amount and high image quality in the decoding device 2.
- the decoding device 2 includes a decoding device 200 and a high-resolution unit 202.
- Specific devices of the decoding device 2 are a computer exl l l, a television ex401, an STBex407, etc., and are devices capable of displaying a high-resolution image.
- the decoding device 200 includes a decoding unit 201, and decodes the low-resolution video stream LVS.
- the decoding unit 201 corresponds to the encoding unit 101 and performs decoding using, for example, MPEG1, 2, 4, 4AVC or the like.
- the low-resolution video stream LVS after decoding is output as low-resolution video data LV2.
- Specific devices of the decoding device 200 are, for example, a computer exl ll, PDA exl l2, a mobile phone exl l4, a mobile phone exl l 5, a digital camera exl l6, a DVD recorder ex420, etc. Or a device that selectively displays a low resolution image.
- the high resolution unit 202 converts the low resolution video data LV2 into a high resolution based on additional information AI. High resolution video image data HV2.
- FIG. 2 is a block diagram showing a detailed configuration of the code key unit 101 and the additional information generation unit 102.
- the encoding unit 101 includes a subtractor 110, an orthogonal transform unit 111, a quantization unit 112, a variable length encoding unit 113, an inverse quantization unit 114, an inverse orthogonal transform unit 115, an adder 116, and a prediction.
- An image generation unit 117 and a motion vector detection unit 118 are provided.
- the configuration of the encoding unit 101 may be configured by a conventional technique such as MPEG1, 2, 4, 4AVC, etc., and the detailed description thereof is omitted.
- the additional information generation unit 102 includes a high-resolution image generation unit 121, a motion vector detection unit 122, a spatio-temporal interpolation image generation unit 123, a generation mode selection unit 124, and a variable length encoding 125.
- the high-resolution image generation unit 121 has an internal memory that stores an image with a high resolution, and has already been increased in resolution according to the motion vector detected by the motion vector detection unit 122. By acquiring a partial image from the already-processed image, the resolution of the low-resolution video data locally decoded in the encoding unit 101 is increased (the second to fourth modes). This high resolution is executed in the second to fourth modes (B) to (D). The resolution enhancement is performed in the additional information generation unit 102, and the generation mode selection unit 124 evaluates the accuracy of the resolution increase and the amount of generated coding, and is used to select a generation mode.
- the motion vector detector 122 detects a motion vector from the high-resolution video data HV1. For example, when the generation mode selected by the generation mode selection unit is the second mode, the motion vector detection unit 122 searches for a forward image from among the already high-resolution images. To detect. Similarly, in the third mode, a motion vector is detected by searching for a backward image, and in the fourth mode, a forward motion vector and a backward motion vector are detected or in the same direction. A plurality of motion vectors are detected.
- FIG. 3 is an explanatory diagram of the difference motion vector.
- the right side of the figure represents the current input image included in the high-resolution video data.
- the left side shows an image with already high resolution.
- the hatched part on the right represents the block that is the target of motion vector detection in the input image.
- the broken line portion on the left represents a region of a similar (or the same) partial image searched from an already resolution-enhanced image.
- the high-resolution MV in the figure shows motion vector detection.
- the motion vector detected by the output unit 122 is shown.
- the hatched portion on the left represents a region of the partial image detected by the motion vector detection unit 118 in the corresponding low-resolution image.
- a stream MV in the figure shows a motion vector detected from the low-resolution image by the motion vector detection unit 118.
- the high-resolution MV and the stream MV are scaled to the same size.
- the variable-length encoding unit 125 encodes the differential motion vector between the high-resolution MV and the stream MV. Thereby, the code amount of motion information can be reduced. As shown in Fig. 3, the high-resolution MV and the stream MV are considered to have almost the same value. The high-resolution MV can express more accurate movement.
- the spatiotemporal interpolation image generation unit 123 generates a high-resolution image by interpolating the pixels using pixels existing in the temporal and spatial surroundings. This higher resolution is executed in the first mode (A).
- FIGS. 4A and 4B are explanatory diagrams of high-resolution image generation that realizes double resolution in the horizontal and vertical directions by spatiotemporal pixel interpolation.
- Fig. 4A vertical circles represent pixels in the same image.
- Figure 4A shows the pixels of three images at different times.
- the hatched circles indicate the pixels of the low-resolution image, and the open circles indicate the pixels that should be interpolated in the high-resolution image.
- interpolation is performed using information on surrounding pixels as shown in the figure. At this time, pixels with already high resolution of images having different times may be used.
- the interpolated pixel is generated by weighting and averaging each of a plurality of surrounding pixels.
- FIG. 4B shows two images. For example, when the pixel indicated by b is generated by spatio-temporal pixel interpolation, the pixel is similarly interpolated using temporally and spatially adjacent pixels.
- the spatiotemporal interpolation image generation unit 123 interpolates pixels by filtering a plurality of pixels existing in the temporal direction and the spatial direction.
- the generation mode selection unit 124 selects a high-resolution image generation mode (the above-described conversion mode) for each block.
- a selection criterion for example, first, four signs in the sign key 101 are used.
- the above (A) to (D) may be selected corresponding to the encoding modes (a) to (d), and the accuracy of the high-resolution image and the amount of generated code are evaluated. May be selected.
- the four coding modes in the coding unit 101 are: (a) Intra coding mode, (b) Forward prediction coding mode, (c) Backward prediction coding mode, and (d) Bi-predictive coding. Mode.
- FIG. 5 is a diagram showing a relationship between a low resolution image and a high resolution image.
- II, B2, B3, ... indicate low-resolution images in display order.
- I indicates the intra-coded picture in (a) above.
- P indicates the unidirectional prediction code picture of (b) or (c).
- B represents the bi-predictive coded picture of (d) above.
- the numbers next to I, B, and P indicate the display order.
- the numbers in () indicate the encoding order.
- H1, H2, H3, ... in the lower part of the figure indicate high-resolution images corresponding to the respective low-resolution images.
- the Hl, H5, and H6 pictures are increased in resolution by temporal and spatial pixel interpolation in the (A) first mode.
- the H2, H3, H8, and H9 pictures have the same resolution as the above (A) mode, and the resolution is improved by the spatio-temporal interpolation.
- the resolution is increased by acquiring partial images.
- H4, H7, and H10 pictures are either high resolution by spatio-temporal interpolation in the mode (A) above, or a part from a high-resolution picture in the forward direction according to the motion vector (B)
- the resolution is increased by acquiring a typical image.
- different modes can be used for each block in the picture.
- variable length encoding 125 uses the conversion mode information representing the first mode as additional information. Variable length coding is performed, and when the selected generation mode is the second to fourth modes, conversion mode information and motion information are variable length coded as additional information. At that time, the variable length coding unit 125 performs variable length coding on the motion information as a difference motion vector.
- FIGS. 6A and 6B are explanatory views showing generation mode codes.
- the encoding mode column indicates the encoding mode of the low resolution image.
- the interpolation generation mode column indicates a corresponding high-resolution image generation mode (conversion mode). That is, “INTRA interpolation” indicates (A), “MC FWD” indicates (B), “MC BWD” indicates (C), and “MC BID” indicates (D).
- ⁇ MC Weigh shows that in the case of (D) above, a high-resolution image is generated by linear prediction with weighting using multiple high-resolution images.
- INTRA Interpolation Weight indicates that in the case of (A) above, a plurality of high resolution images are used to generate a high resolution image by weighting filtering.
- “INTRA-MC mixing” indicates that the above (A) and any of the above (B) to (D) are mixed to generate a high-resolution image.
- the code of the generation mode is assigned in association with the code mode of the block of the low resolution image corresponding to the block of the high resolution image. That is, the code of the generation mode is assigned so as to be shortened (to be 0) when the code key mode and the generation mode are similar. Now, pay attention to the generation mode “MC BID” column.
- the generation mode power of a block with a high-resolution image is S "MC BID ⁇
- the encoding mode of the block of the low-resolution image corresponding to the block is" INTER- BID ",” INTER- BWD ' ⁇ "I NTER-
- the codes of the corresponding block of the high resolution image are “0”, “3”, “3”, and “6”, respectively.
- FIG. 6B is a diagram showing a variable-length code table in the generation mode more specifically.
- a table T1 indicates a variable length code table when the code mode is the above (a).
- tables T2, ⁇ 3, and ⁇ 4 show variable length code tables when the code mode is (b) (c) (d) above.
- the code of the generation mode is “0 ⁇ .
- the code of the generation mode is “: T”.
- the codes are assigned so as to be short.
- the encoding of the generation mode is not limited to this.
- the coding of the generation mode may be a coding method using the probability of the amount of generated code (R, so-called “arithmetic coding”).
- FIG. 7 shows image encoding processing in encoding section 101 and additional information generation section 102 and It is a flowchart which shows an additional information production
- encoding section 101 performs encoding in block units (more precisely, macroblock units) (S71), and generation mode selection section 124 receives the macroblock from variable length encoding section 113.
- the sign of the lock is obtained (S72).
- the locally decoded picture of the low-resolution image is stored in the reference memory in the predicted image generation unit 117 in units of blocks.
- the high-resolution image generation unit 121 and the spatiotemporal interpolation image generation unit 123 generate a high-resolution image corresponding to the encoded low-resolution image (S73), and the motion vector detection unit 122 Detects the motion vector of the newly input high-resolution video data HV1 using the generated high-resolution image as a search target (S74), and detects the motion vector detected by the motion vector detection unit 118. Then, the difference motion vector from the motion vector of the high resolution image is calculated (S75).
- the spatiotemporal interpolation image generation unit 123 generates a high resolution image from the corresponding low resolution image by pixel interpolation by spatiotemporal interpolation (S76).
- the generation mode selection unit 124 selects an optimal generation mode based on the encoding residual of the low resolution image and the motion vector (S77).
- the variable length coding unit 125 performs variable length coding on the additional information (S78). That is, the variable length encoding unit 125 encodes the selected generation mode, and also encodes the difference motion vector if the selected generation mode is the second to fourth modes.
- FIG. 8 is a flowchart showing the generation mode selection process in S77 of FIG.
- the generation mode selection unit 124 has the information amount of the coding residual acquired in S72 smaller than the threshold value Thl, and the motion vector detected in S74 or the motion detected by the motion vector detection unit 118 If the vector is smaller than the threshold Th2, the generation mode corresponding to the encoding mode is selected from the first to fourth modes (higher resolution using the motion vectors (B) to (D) above). Select (S83).
- the generation mode selection unit 124 when the information amount of the sign key residual obtained in S72 is larger than the threshold value TH1, or the motion vector detection or motion vector detection unit 118 detected in S74 If the motion vector detected by (1) is larger than the threshold value TH2, the first mode (higher resolution by spatio-temporal pixel interpolation in (A) above) is selected as the generation mode (S84). [0066] In this generation mode selection process, when the motion in the high-resolution image and the low-resolution image is severe, the first mode is set as the generation mode to suppress the increase in the code amount.
- the generation mode selector 124 selects the first mode ((A) above as the generation mode. Select higher resolution by spatio-temporal pixel interpolation). Specifically, the generation mode selection unit 124 calculates a variance value with the surrounding motion vector (S82a), and if the value is threshold or larger than the value (S82b), the first mode is selected. Select (S84).
- FIGS. 9A and 9B are explanatory diagrams illustrating a stream format example of additional information associated with the stream data by the variable length coding unit 125.
- FIG. 9A shows a format example in which additional information is added as user data in units of pictures. That is, the additional information in units of macroblocks is added as user data to the stream data portion that is the power of the picture header and picture data.
- This user data is data that the user may arbitrarily set in the stream.
- FIG. 9B shows a format example in which the output means embeds additional information in the stream data.
- additional information for each macroblock is embedded in the macroblock data.
- Fig. B can reduce the amount of data in that a macroblock address is not required.
- the additional information may be electronically transmitted and substantially carried in the stream data by an information loading technique such as a technique, and the loaded stream data may be transmitted.
- the encoding unit 101 acquires additional information from the additional information generation unit 102, and uses electronic watermarking technology or the like to add additional information to the image data to be encoded within a range that does not impair the image quality of the decoded image. It is good also as a structure to crawl into.
- Digital watermarking techniques include time axis difference embedding method, space axis difference embedding method, layer structure embedding method, wavelet transform, and spectrum diffusion.
- FIG. 10 is a flowchart showing the decryption process in the decoding device 2.
- the decoding device 2 determines whether or not the size is a size that can be displayed on the connected display (S102).
- the low resolution video data LV2 decoded by the encoding unit 201 is output for display. (S104). If the size is displayable, a high-resolution image is generated from the low-resolution video data LV2 decoded by the decoding unit 201 (S103), and is output for display (S104).
- FIG. 11A is a flowchart showing the high-resolution image generation process in S103 of FIG.
- the resolution increasing unit 202 performs variable length decoding on the additional information (S111), determines whether or not the additional mode includes generation mode information (that is, conversion mode information) (S112), In some cases, a high-resolution image is generated according to the generation mode information (S113), and if not, a high-resolution image is generated by spatio-temporal pixel interpolation (S114), and the generated high-resolution image is output (S115).
- generation mode information that is, conversion mode information
- this high-resolution image generation processing is performed, for example, when additional information is provided in units of macroblocks, and is processed in units of macroblocks, and when additional information is provided in units of pictures. Are processed in units of pictures.
- FIG. 11B is a flowchart showing an outline of the high resolution processing shown in step S113 of FIG. 11A.
- the high resolution unit 202 determines the generation mode information (that is, conversion mode information) in the additional information (S120), and if the generation mode information indicates (A) INTRA interpolation, INTRA interpolation is performed. If the generation mode information indicates (B) MC FWD or (C) MC BWD, MC interpolation is performed (SI 22), and the generation mode information indicates (D) MC BID. In this case, MC BID interpolation processing is performed (SI 23), and when the generation mode information power NTRA-MC mixing processing is indicated, INTRA-MC mixing processing is performed (S 124).
- any interpolation processing is selected according to a certain rule. Moyore. For example, a low-resolution image encoding mode corresponding to an image to be processed by interpolation processing (the above (a) intra encoding mode, (b) forward prediction encoding mode, (c) backward prediction) Interpolation processing may be selected in accordance with the encoding mode and (d) bi-predictive encoding mode).
- FIG. 11C is an explanatory diagram of the MC interpolation process in step S122 of FIG. 11B.
- a high-resolution image that shows the pixels in one horizontal row or one vertical column of the reference image.
- White circles indicate pixels included in the low-resolution image, and black circles indicate interpolated pixels.
- the right side of the figure shows one horizontal row or one vertical column image in the high resolution processing target image.
- White circles indicate the pixels included in the low-resolution image, and broken circles indicate the pixels to be interpolated. It is assumed that the motion vector of block B1 in the image to be processed points to a region R1 in the image with a high resolution.
- the resolution increasing unit 202 interpolates the pixel position al to be interpolated in the block B1 using the pixel value of the pixel pi in the region R1, and the pixel position a2 to be interpolated in the block B1. Is interpolated using the pixel value of pixel p2 in region R1.
- the high resolution unit 202 interpolates the pixel position a3 to be interpolated in the block B2 by using the pixel value of the pixel p3 in the region R2, and the pixel position a4 to be interpolated in the block B2. Is interpolated using the pixel value of pixel p4 in region R2.
- FIG. 1 This figure shows the case where the interpolation generation modes are (B) MC- FWD and (C) MC- BWD.
- the interpolation generation mode is (D) MC—BID
- the high resolution unit 202 obtains a weighted average based on two pixel values obtained from two images with high resolution. The pixel value of the pixel to be interpolated is calculated.
- FIG. 11D is a flowchart showing the MC interpolation process shown in step S122 of FIG. 11B in more detail. This figure shows the processing for one block when the resolution of the processing target image is increased in units of blocks.
- the decoding device 2 has a memory for holding the image whose resolution has been completed and the image to be processed. The image whose resolution has been completed is referred to when interpolation is performed using motion vectors.
- the image to be processed is composed of pixels constituting a low resolution image and pixels to be interpolated.
- the resolution enhancement unit 202 performs variable length decoding on the differential motion vector included in the additional information.
- the resolution increasing unit 202 interpolates all the pixels to be interpolated in the block in the loop 1 process (S133 to S137).
- the high resolution unit 202 specifies a pixel corresponding to the pixel to be interpolated in the specified rectangular area (S134), reads the value of the specified pixel from the memory (S135), and reads the read value.
- the obtained pixel value is written in the memory as the pixel value to be interpolated in the block (S136).
- all the pixels to be interpolated in the processing target image are interpolated using the pixel values read from the reference image according to the motion vector.
- FIG. 11E is a flowchart showing the MC-BID interpolation process shown in step S123 of FIG. 11B in more detail.
- FIG. 11E is different from FIG. 11D in that steps S130a to Sl35a and SI 37a are provided instead of steps S130 to S135 and S137, and that S140 has been added. The description of the same points will be omitted, and different points will be mainly described below.
- the resolution enhancement unit 202 performs variable length decoding on the two differential motion vectors included in the additional information (S 130a), and converts the obtained two differences into the corresponding two motion vectors of the low resolution image.
- the two motion vectors H-MV1 and H-MV2 for the high-resolution image (S131a) the two rectangular areas in the two reference images with high resolution are specified (S132a). .
- the resolution increasing unit 202 interpolates all the pixels to be interpolated in the block in the loop 1 process (S133a to S137a).
- the high resolution unit 202 specifies two pixels corresponding to the pixel to be interpolated in the two specified rectangular areas (S 134a), and reads the values of the two specified pixels from the memory. (S135a).
- the weighted average of the two read pixel values is calculated.
- the weight of each pixel value may be determined according to the distance from the processing target image to each reference image, for example. Further, the weight may be changed according to the magnitude of the motion vector corresponding to the two pixel values.
- the weight of the pixel value corresponding to the smaller of the two motion vectors may be greater than the weight of the other pixel value.
- the weighted average calculation result is written in the memory as the pixel value to be interpolated (S136).
- S136 the pixel value to be interpolated.
- all pixels to be interpolated in the processing target image are based on two pixel values read from two reference images according to two motion vectors, Is interpolated. Note that in the MC-BID interpolation process of FIG. 11E, two motion vectors, force using two reference images, three or more motion vectors, and three or more reference images may be used.
- FIG. 11F is a flowchart showing in more detail the INTRA-MC mixed interpolation process shown in step S124 of FIG. 11B.
- Fig. 11F is different from Fig. 11E in that S150 and S151 forces were added. The description of the same points will be omitted, and different points will be mainly described below.
- the high resolution unit 202 determines whether the pixel to be interpolated should be subjected to INTRA interpolation or MC interpolation. This determination can be based on the position in the block of the pixel to be interpolated and whether the pixel adjacent to the pixel to be interpolated is a pixel of a low resolution image or an interpolated pixel. For example, if the adjacent pixel is a pixel of a low resolution image, it is determined to be INTRA interpolation, and if the adjacent pixel is an interpolation pixel, it is determined to be MC interpolation. If it is determined to be INTRA interpolation, the high resolution unit 202 performs INTRA interpolation on the pixel to be interpolated in S151.
- determination may be made for each force block or slice for determining whether the INTRA interpolation should be performed for each pixel to be interpolated or the force to be MC-interpolated.
- variable length coding unit 125 may temporarily input the low resolution image bitstream LVS from the variable length code keying unit 113 and output it in association with the additional information.
- the generation of a high resolution image from a low resolution image can be realized with a low processing amount and high image quality. it can.
- the first embodiment may be modified as follows within a practical range.
- the additional information generation unit 102 in FIG. 2 removes the high-resolution image generation unit 121, and instead uses the high-resolution image signal HV1 at the same time as the decoded high-resolution image signal as a search target to the motion vector detection unit 122.
- An input configuration may be used.
- the motion detection unit 122 detects a motion vector of the high resolution image from the high resolution image signal HV1, and generates a differential motion vector for the high resolution image. By doing so, the configuration of the encoding device 1 can be simplified, and a low processing amount can be realized.
- the additional information generation unit 102 in FIG. 2 removes the high-resolution image generation unit 121 and replaces it.
- the high resolution image signal HV2 at the same time as the decoded high resolution image signal may be input from the high resolution section 202 to the motion vector detection section 122 as a search target.
- the motion detector 122 detects a motion vector of the high resolution image from the high resolution image signal HV1 and the high resolution image signal HV2, and generates a differential motion vector for the high resolution image.
- the high-resolution image generation unit 121 may be used only for selecting a generation mode without removing the high-resolution image generation unit 121. In this way, it is possible to achieve high image quality while reducing the processing for increasing the resolution of the decoded image one frame before.
- FIG. 12 is a flowchart showing another example of the image encoding process in the second embodiment. This figure is executed instead of FIG. 7 and FIG. 8 in the first embodiment.
- a motion vector detection unit 122 detects a motion vector from an original image (high resolution image HV1) with reference to an already high resolution image in the high resolution image generation unit 121 ( A difference motion vector between the detected motion vector and the motion vector detected by the motion vector detecting unit 118 is calculated (S122).
- the generation mode selection unit 124 calculates a difference value D between the high-resolution image generated by the high-resolution image generation unit 121 according to the differential motion vector and the original image (high-resolution image HV1) (S123).
- the amount of generated code when the vector is encoded as additional information is calculated (S124), and the COST shown in the following equation is calculated (S125).
- High-resolution image Interpolation-generated image I is the sum of the difference values D calculated in S123.
- the high-resolution image generated according to the original image (high-resolution image HV1) and the difference motion vector It means the sum of pixel value differences for each block from the digitized image. If this value is 0, it means that the interpolated image is exactly the same as the original image (the image quality of the interpolated image is the best). The larger this value, the farther the interpolated image is from the original image. This means that the image quality is poor and the image quality is poor.
- the generated code amount is calculated in S124. If the generated code amount is small, it means that the encoding efficiency of the low resolution image bitstream LVS is deteriorated too much.
- COST value Larger means lower resolution image bitstream LVS code efficiency . If the COST value is large, it means that at least one of the image quality and coding efficiency of the high-resolution image is bad. The smaller the COST value, the better the image quality and coding of the high-resolution image. It means to achieve both efficiency and good.
- the generation mode selection unit 124 compares the calculated COST and COST1 (S126), and if COST is small, updates the value of COST1 to the value of COST (S127).
- the initial value of COST1 is the minimum COST threshold that should be secured.
- COST1 is the minimum COST value in the loop processing until the search range of motion beta is completed (S128). Will be updated.
- the generation mode selection unit 124 determines whether or not the search range has ended. However, the generation mode selection unit 124 selects several generation modes from the generation modes (second to fourth modes) similar to the encoding mode. It may be determined whether or not an attempt has been made.
- the generation mode selection unit 124 can obtain the motion vector or the generation mode that becomes the minimum COST1 for the high-resolution image generated according to the difference motion vector.
- the spatiotemporal interpolation image generation unit 123 generates an interpolation image by spatiotemporal interpolation (S129), and the generation mode selection unit 124 calculates the generated interpolation image and the original image (high resolution image HV1).
- the difference value D is calculated (S130), and COST is calculated (S131).
- the generation mode selection unit 124 compares the calculated COST and COST2 (S132), and updates the value of COST2 to the value of COST if COST is small (S133).
- the initial value of COST2 is the minimum threshold value of COST to be secured, and may be the same value as the initial value of COST1.
- the generation mode selection unit 124 determines whether or not the generation method by interpolation has been completed. This determination is performed while changing the type of filter used for interpolation and the selection of strength. It is only necessary to determine whether or not it has been completed. The type and strength of the filter used for interpolation may be selected according to the downsampling information DSI.
- the generation mode selection unit 124 can obtain a generation mode that is the minimum COST2 for a high-resolution image generated by space-time interpolation.
- the generation mode selection unit 124 corresponds to the smallest one of COST1 and COST2.
- a generation mode is selected (S135).
- the variable length encoding unit 125 encodes the generation mode information indicating the selected generation mode (S136).
- COST is a measure for evaluating the poor image quality of high-resolution images and the deterioration of the coding efficiency of low-resolution images by adding additional information.
- the generation mode selection unit 124 in the present embodiment is configured to calculate COSTs in various generation modes and select a generation mode that minimizes COST. As a result, the image quality of the high-resolution image can be improved, and the deterioration of the encoding efficiency due to the addition of additional information can be minimized.
- an encoding and decoding program and a code string (data stream) for realizing the configuration of the encoding process and the decoding process shown in the above embodiments are recorded on a recording medium such as a flexible disk.
- a recording medium such as a flexible disk.
- FIGS. 13 (a) to 13 (c) show the coding process or the decoding process of Embodiments 1 and 2 described above by a computer system using a flexible disk storing the encoding and decoding programs. It is a figure for demonstrating the case where it implements.
- Fig. 13 (b) shows the appearance, cross-sectional structure, and flexible disc as seen from the front of the flexible disc
- Fig. 13 (a) shows an example of the physical format of the flexible disc that is the main body of the recording medium.
- the flexible disk FD is built in the case F, and on the surface of the disk, a plurality of tracks Tr are formed concentrically from the outer periphery toward the inner periphery, and each track has 16 sectors Se in the angular direction. It is divided. Therefore, data as the program is recorded in the programmed area.
- Fig. 13 (c) shows a configuration for recording and reproducing the program on the flexible disk FD.
- the data as the above program is written from the computer system Cs via the flexible disk drive.
- a flexible disk drive is used. Eve reads the program from the flexible disk and transfers it to the computer system.
- the flexible disk is used as the recording medium, but the same can be done using an optical disk.
- the recording medium is not limited to this, and any recording medium such as an IC card or a ROM cassette that can record a program can be used.
- the encoding method described in the above embodiment 'decoding method is used in mobile communication devices such as mobile phones and car navigation systems, and in imaging devices such as digital video cameras and digital still cameras. It is possible to mount with the semiconductor.
- the transmission / reception type terminal having both an encoder and a decoder there are three possible implementation formats: a transmitting terminal with only an encoder and a receiving terminal with only a decoder. Specific application examples will be described with reference to FIGS.
- FIG. 14 is a block diagram showing the overall configuration of a content supply system exlOO that implements a content distribution service.
- the communication service provision area is divided into desired sizes, and base stations exl07 exl10, which are fixed radio stations, are installed in each cell.
- This content supply system exlOO is a computer exl 11 PDA (personal digital assistant) exl 12, camera exl via Internet service provider exl02 and telephone network exl04, and base station exl07 exl 10, for example.
- PDA personal digital assistant
- l 3 mobile phone exl 14, mobile phone with camera exl 15, etc. are connected.
- the content supply system exlOO is not limited to the combination shown in Fig. 14, and any combination of them may be connected. Further, each device may be directly connected to the telephone network exl04 without going through the base stations exl 0 7 exl 10 which are fixed radio stations.
- the camera exl 13 is a device capable of shooting a moving image such as a digital video camera.
- the mobile phonetic is based on PDC (Personal Digital Communications), CDMA (Code Division Multiple Access), W—CDMA (Wideband-Code Division Multiple Access), or GSM (Global System for Mobile Communications).
- PDC Personal Digital Communications
- CDMA Code Division Multiple Access
- W—CDMA Wideband-Code Division Multiple Access
- GSM Global System for Mobile Communications
- a mobile phone or a PHS Personal Handyphone System
- the streaming server exl03 starts from the camera 6 113 to the base station 6 109, the telephone network exl04. It is possible to perform live distribution based on the encoded data transmitted by the user using the camera exl13.
- the sign processing of the captured data may be performed by the camera exl 13 or may be performed by a server or the like that performs data transmission processing.
- the moving image data shot by the camera exl 16 may be transmitted to the streaming server exl 03 via the computer exl 11.
- the camera exl 16 is a device that can shoot still images and videos such as digital cameras. In this case, the video data can be encoded with either the camera exl 16 or the computer exl 11.
- the encoding process is performed in LSI exl 7 included in the computer exl 11 and the camera ex 116.
- the image encoding / decoding software may be incorporated in some storage medium (CD-ROM, flexible disk, hard disk, etc.) that is a recording medium readable by the computer exl 11 or the like. Furthermore, you may transmit moving image data with the mobile phone exl 15 with a camera. The moving image data at this time is data encoded by the LSI of the mobile phone exl 15.
- this content supply system exlOO content (for example, video of music live) that the user has photographed with the camera exl l3, camera exll6, etc., is encoded and processed as in the above embodiment. While sending to the streaming server exl03, the streaming server exl03 streams the content data to the requested client. Examples of the client include a computer exl l l, a PDA exl l 2, a camera exl l 3 and a mobile phone exl 14 capable of decoding the encoded data. In this way, the content supply system exlOO can receive and play back the encoded data at the client, and further receive, decode and play back the data in real time at the client. It is a system that can also realize
- the image encoding device or the image decoding device described in each of the above embodiments may be used for encoding and decoding of each device constituting the system.
- a mobile phone will be described as an example.
- FIG. 15 is a diagram showing a mobile phone exl 15 using the image coding method and the image decoding method described in the above embodiment.
- the mobile phone exl l 5 can take images and still images from the antenna ex201, CCD camera, etc. for transmitting and receiving radio waves to and from the base station exl lO Camera unit ex 203, a camera unit captured video at ex 203, a display unit ex202 such as a liquid crystal display video and the like received by the antenna ex201 displays the decoded data, a body unit including a set of operation keys ex 204 groups, audio output Audio output unit ex20 8 for speakers, etc., audio input unit ex205 such as microphone for audio input, captured video or still image data, received e-mail data, video data or still image data
- the recording medium ex207 for storing the encoded data or the decoded data, and the slot part ex206 for allowing the recording medium ex207 to be attached to the cellular phone exl 5 are provided.
- Recording media ex207 is a flash memory device that is a kind of EEPROM (Electrically Erasable and Programmable Read Only Memory) that is a nonvolatile memory that can be electrically rewritten and erased in a plastic case such as an SD card. is there
- the mobile phone exl 5 has a power supply circuit unit ex310, an operation input control unit ex304, and a main control unit ex311 which is configured to control the respective units of the main body unit including the display unit ex202 and the operation key ex204.
- Image code unit ex312, camera interface unit ex303, LCD (Liquid Crystal Display) control unit ex302, image decoding unit ex309, demultiplexing unit ex308, recording / playback unit ex307, modulation / demodulation circuit unit ex306 and audio processing unit ex305 They are connected to each other via the synchronous bus ex313.
- the power circuit unit ex310 can operate the digital mobile phone exl 15 with a camera by supplying power from the battery pack to each unit when the end call and the power key are turned on by a user operation. Start to state.
- the mobile phone exl l 5 receives the audio signal collected by the audio input unit ex205 in the voice call mode based on the control of the main control unit ex311 composed of CPU, ROM, RAM, etc. by the audio processing unit e x305. This is converted into digital audio data, and this is subjected to spectrum spreading processing by the modulation / demodulation circuit unit ex306, and after being subjected to digital analog conversion processing and frequency conversion processing by the transmission / reception circuit unit ex301, it is transmitted through the antenna ex201.
- the mobile phone ex 115 amplifies the received data received by the antenna ex201 in the voice call mode and performs frequency conversion processing and analog-digital conversion processing, and the modulation / demodulation circuit unit ex306 performs spectrum reverse processing. After diffusion processing and conversion to analog audio data by the audio processing unit ex305, this is output via the audio output unit ex208.
- the text data of the e-mail input by operating the operation key ex2 04 on the main unit is sent to the main control unit ex311 via the operation input control unit ex304.
- the main control unit ex311 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex306, performs digital analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex301, and then transmits the text data to the base station ex110 via the antenna ex201.
- the image data captured by the camera unit ex203 is supplied to the image encoding unit ex312 via the camera interface unit ex303.
- the image data captured by the camera unit ex203 can be directly displayed on the display unit ex202 via the camera interface unit ex303 and the LCD control unit ex302.
- the image encoding unit ex312 is configured to include the image encoding device described in the present invention, and the image data supplied from the camera unit ex203 is used for the image encoding device shown in the above embodiment. It is converted into encoded image data by compressing and encoding using the encoding method, and this is sent to the demultiplexing unit ex308.
- the cellular phone exl 5 sends the voice collected by the voice input unit ex205 during imaging by the camera unit ex203 to the demultiplexing unit ex308 via the voice processing unit ex305 as digital voice data.
- the demultiplexing unit ex308 multiplexes the encoded image data supplied from the image encoding unit ex312 and the audio data supplied from the audio processing unit ex305 by a predetermined method, and the resulting multiplexing
- the data is subjected to spread spectrum processing by the modulation / demodulation circuit unit ex306, digital-analog conversion processing and frequency conversion processing by the transmission / reception circuit unit ex301, and then transmitted through the antenna ex201.
- the received data received from the base station exl 10 via the antenna ex201 is subjected to spectrum despreading processing by the conversion circuit unit ex306. Then, the multiplexed data obtained as a result is sent to the multiplex separation unit ex308.
- the multiplexing / demultiplexing unit ex308 separates the multiplexed data to separate the bit stream of the image data and the bit stream of the audio data.
- the encoded video data is supplied to the video decoding unit ex309 via the synchronization bus ex313, and the audio data is supplied to the audio processing unit ex305.
- the image decoding unit ex309 has a configuration including the image decoding apparatus described in the present invention, and corresponds to the encoding method shown in the above embodiment for the bit stream of image data.
- the decoded moving image data is generated by decoding using the decoding method, and this is supplied to the display unit ex202 via the LCD control unit ex302, so that it is included in the moving image file linked to the homepage, for example.
- the audio processing unit ex305 converts the audio data into analog audio data, and then supplies the analog audio data to the audio output unit ex208.
- the audio data included in the moving image file linked to the homepage is stored. Played.
- the digital broadcasting system also includes at least the image coding of the above embodiment. Either a device or an image decoding device can be incorporated.
- a bit stream of video information is transmitted to a communication or broadcasting satellite ex410 via radio waves.
- the broadcasting satellite ex410 transmits a radio wave for broadcasting, and this radio wave is received by a home antenna ex406 having a satellite broadcasting receiving facility, such as a television (receiver) ex401 or a set top box (STB) ex407.
- a satellite broadcasting receiving facility such as a television (receiver) ex401 or a set top box (STB) ex407.
- the device decodes the bitstream and plays it back.
- the image decoding device described in the above embodiment can be mounted on the playback device ex403 that reads and decodes the bitstream recorded on the storage medium ex402 such as a CD or DVD as a recording medium. is there. In this case, the reproduced video signal is displayed on the monitor ex404.
- a configuration in which an image decoding device is installed in a cable set for cable TV ex405 or a set-top box ex407 connected to a satellite / terrestrial broadcast antenna ex406 and this is played back on a TV monitor ex408 is also considered. It is done.
- the image decoding apparatus may be incorporated in the television, not the set top box.
- from car ex410 with car ex412 with antenna ex411 Can receive a signal from the base station ex 107 and the like, and can play a moving image on a display device such as the car navigation ex413 of the car ex412.
- the image signal can be encoded by the image encoding device shown in the above embodiment and recorded on a recording medium.
- a recorder ex420 such as a DVD recorder that records image signals on a DVD disc ex421 and a disk recorder that records on a hard disk. It can also be recorded on the SD card ex422. If the recorder ex420 includes the image decoding device shown in the above embodiment, the image signal recorded on the DVD disc ex421 or the SD card ex422 can be reproduced and displayed on the monitor ex408.
- the configuration of the car navigation ex 413 is, for example, the configuration shown in FIG. 15 excluding the camera unit ex 203, the camera interface unit ex 303, and the image encoding unit ex 312. Exl 11 and TV (receiver) ex401 are also considered.
- the terminal such as the mobile phone exl 14 is a transmitting terminal having only an encoder and a receiving terminal having only a decoder.
- the terminal such as the mobile phone exl 14 is a transmitting terminal having only an encoder and a receiving terminal having only a decoder.
- each functional block in the block diagrams shown in FIGS. 1 and 2 is typically realized as an LSI which is an integrated circuit device.
- This LSI may be integrated into a single chip or multiple chips.
- functional blocks other than memory may be integrated into a single chip.
- IC system LSI
- super LSI unoletra LSI because of the difference in power integration as LSI.
- LSI power integration
- the method of circuit integration is not limited to LSI, but may be realized by a dedicated circuit or a general-purpose processor. You can use a field programmable gate array (FPGA) that can be programmed after LSI manufacturing, or a reconfigurable processor that can reconfigure the connection and settings of circuit cells inside the LSI.
- FPGA field programmable gate array
- the central part is also realized by a processor and a program.
- the image encoding method or the image decoding method shown in the above embodiment can be used in any of the above-mentioned device systems, and by doing so, in the above embodiment, The described effect can be obtained.
- the present invention is suitable for an encoding device that encodes or decodes an image, a decoding device, a web server that distributes moving images, a network terminal that receives the same, a digital camera capable of recording and reproducing moving images, Suitable for mobile phones with cameras, DVD recorders / players, PDAs, personal computers, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Television Systems (AREA)
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP05775151A EP1788817A4 (en) | 2004-08-30 | 2005-08-29 | DECODER, ENCODER, DECODING METHOD AND CODING METHOD |
US11/661,277 US8208549B2 (en) | 2004-08-30 | 2005-08-29 | Decoder, encoder, decoding method and encoding method |
JP2006532683A JP4949028B2 (ja) | 2004-08-30 | 2005-08-29 | 復号化装置、符号化装置、復号化方法、符号化方法 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP04020569.2 | 2004-08-30 | ||
EP04020569A EP1631089A1 (en) | 2004-08-30 | 2004-08-30 | Video coding apparatus and decoding apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2006025339A1 true WO2006025339A1 (ja) | 2006-03-09 |
Family
ID=34926354
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/015679 WO2006025339A1 (ja) | 2004-08-30 | 2005-08-29 | 復号化装置、符号化装置、復号化方法、符号化方法 |
Country Status (4)
Country | Link |
---|---|
US (1) | US8208549B2 (ja) |
EP (2) | EP1631089A1 (ja) |
JP (1) | JP4949028B2 (ja) |
WO (1) | WO2006025339A1 (ja) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007129647A (ja) * | 2005-11-07 | 2007-05-24 | Sony Corp | 記録再生装置および記録再生方法、記録装置および記録方法、再生装置および再生方法、並びにプログラム |
JP2009253764A (ja) * | 2008-04-08 | 2009-10-29 | Fujifilm Corp | 画像処理システム、画像処理方法、およびプログラム |
JP2009253586A (ja) * | 2008-04-04 | 2009-10-29 | Fujifilm Corp | 画像処理システム、画像処理方法、およびプログラム |
JP2013518463A (ja) * | 2010-01-22 | 2013-05-20 | トムソン ライセンシング | サンプリングベースの超解像度ビデオ符号化および復号化方法並びに装置 |
US8447128B2 (en) | 2008-04-07 | 2013-05-21 | Fujifilm Corporation | Image processing system |
US9338477B2 (en) | 2010-09-10 | 2016-05-10 | Thomson Licensing | Recovering a pruned version of a picture in a video sequence for example-based data pruning using intra-frame patch similarity |
JP2017005687A (ja) * | 2015-04-23 | 2017-01-05 | アクシス アーベー | ビデオカメラでビデオストリームを処理する方法及び装置 |
US9544598B2 (en) | 2010-09-10 | 2017-01-10 | Thomson Licensing | Methods and apparatus for pruning decision optimization in example-based data pruning compression |
US9813707B2 (en) | 2010-01-22 | 2017-11-07 | Thomson Licensing Dtv | Data pruning for video compression using example-based super-resolution |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2449631B (en) * | 2007-05-21 | 2012-02-15 | Doo Technologies Fze | Method and system for processing of images |
JP4518111B2 (ja) * | 2007-07-13 | 2010-08-04 | ソニー株式会社 | 映像処理装置、映像処理方法、及びプログラム |
AU2007237313A1 (en) * | 2007-12-03 | 2009-06-18 | Canon Kabushiki Kaisha | Improvement for error correction in distributed vdeo coding |
AU2007242924A1 (en) * | 2007-12-12 | 2009-07-02 | Canon Kabushiki Kaisha | Improvement for error correction in distributed video coding |
KR100939917B1 (ko) | 2008-03-07 | 2010-02-03 | 에스케이 텔레콤주식회사 | 움직임 예측을 통한 부호화 시스템 및 움직임 예측을 통한부호화 방법 |
US8274603B2 (en) * | 2008-03-28 | 2012-09-25 | Microsoft Corporation | Choosing video deinterlacing interpolant based on cost |
US20090304293A1 (en) * | 2008-06-08 | 2009-12-10 | Te-Hao Chang | Motion estimation method and related apparatus for efficiently selecting motion vector |
US8755515B1 (en) | 2008-09-29 | 2014-06-17 | Wai Wu | Parallel signal processing system and method |
US8359205B2 (en) | 2008-10-24 | 2013-01-22 | The Nielsen Company (Us), Llc | Methods and apparatus to perform audio watermarking and watermark detection and extraction |
US8121830B2 (en) * | 2008-10-24 | 2012-02-21 | The Nielsen Company (Us), Llc | Methods and apparatus to extract data encoded in media content |
US9667365B2 (en) | 2008-10-24 | 2017-05-30 | The Nielsen Company (Us), Llc | Methods and apparatus to perform audio watermarking and watermark detection and extraction |
US8667162B2 (en) * | 2008-12-31 | 2014-03-04 | Industrial Technology Research Institute | Method, apparatus and computer program product for providing a mobile streaming adaptor |
US8520736B2 (en) * | 2009-04-14 | 2013-08-27 | Fastvdo, Llc | Real-time superresolution and video transmission |
CA2760677C (en) | 2009-05-01 | 2018-07-24 | David Henry Harkness | Methods, apparatus and articles of manufacture to provide secondary content in association with primary broadcast media content |
JP5184447B2 (ja) * | 2009-06-22 | 2013-04-17 | 株式会社Kddi研究所 | 動画像符号化装置および復号装置 |
US8548062B2 (en) * | 2010-07-16 | 2013-10-01 | Sharp Laboratories Of America, Inc. | System for low resolution power reduction with deblocking flag |
TWI606418B (zh) * | 2012-09-28 | 2017-11-21 | 輝達公司 | 圖形處理單元驅動程式產生內插的圖框之電腦系統及方法 |
US20150350565A1 (en) * | 2014-05-29 | 2015-12-03 | Opentv, Inc. | Techniques for magnifying a high resolution image |
WO2020012556A1 (ja) * | 2018-07-10 | 2020-01-16 | オリンパス株式会社 | 撮像装置、画像補正方法および画像補正プログラム |
EP3648059B1 (en) * | 2018-10-29 | 2021-02-24 | Axis AB | Video processing device and method for determining motion metadata for an encoded video |
US11381867B2 (en) * | 2019-01-08 | 2022-07-05 | Qualcomm Incorporated | Multiple decoder interface for streamed media data |
CN115361582B (zh) * | 2022-07-19 | 2023-04-25 | 鹏城实验室 | 一种视频实时超分辨率处理方法、装置、终端及存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6335093A (ja) * | 1986-07-30 | 1988-02-15 | Sony Corp | 高能率符号化装置 |
JPH06209468A (ja) * | 1993-01-11 | 1994-07-26 | Sony Corp | 画像信号符号化方法および画像信号符号化装置、並びに画像信号復号化方法および画像信号復号化装置 |
JPH10126749A (ja) * | 1996-10-14 | 1998-05-15 | Toshiba Corp | 順次走査変換装置 |
JP2000036963A (ja) * | 1998-07-17 | 2000-02-02 | Sony Corp | 画像符号化装置、画像符号化方法および画像復号化装置 |
JP2003134476A (ja) * | 2001-10-24 | 2003-05-09 | Hitachi Ltd | 走査変換処理装置 |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69426584T2 (de) * | 1993-08-06 | 2001-06-13 | Lg Electronics Inc., Seoul/Soul | Einrichtung zur Umsetzung der Vollbildfrequenz |
US5569520A (en) * | 1994-01-12 | 1996-10-29 | Martin Marietta Energy Systems, Inc. | Rechargeable lithium battery for use in applications requiring a low to high power output |
US5621467A (en) * | 1995-02-16 | 1997-04-15 | Thomson Multimedia S.A. | Temporal-spatial error concealment apparatus and method for video signal processors |
EP0961991B1 (en) * | 1997-12-22 | 2004-06-16 | Koninklijke Philips Electronics N.V. | Method and arrangement for creating a high-resolution still picture |
WO1999052281A2 (en) | 1998-04-03 | 1999-10-14 | Miranda Technologies Inc. | Hdtv up converter |
US6192079B1 (en) * | 1998-05-07 | 2001-02-20 | Intel Corporation | Method and apparatus for increasing video frame rate |
US6300973B1 (en) * | 2000-01-13 | 2001-10-09 | Meir Feder | Method and system for multimedia communication control |
US6510177B1 (en) * | 2000-03-24 | 2003-01-21 | Microsoft Corporation | System and method for layered video coding enhancement |
JP4765194B2 (ja) * | 2001-05-10 | 2011-09-07 | ソニー株式会社 | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム格納媒体及び動画像符号化プログラム |
US7088780B2 (en) * | 2001-05-11 | 2006-08-08 | Mitsubishi Electric Research Labs, Inc. | Video transcoder with drift compensation |
US6612153B2 (en) * | 2001-06-05 | 2003-09-02 | Agilent Technologies, Inc. | Planar manifold with integrated heated injector inlet and unheated pneumatics |
WO2003036978A1 (en) * | 2001-10-26 | 2003-05-01 | Koninklijke Philips Electronics N.V. | Method and apparatus for spatial scalable compression |
JP4015934B2 (ja) * | 2002-04-18 | 2007-11-28 | 株式会社東芝 | 動画像符号化方法及び装置 |
US20040131122A1 (en) * | 2002-12-09 | 2004-07-08 | Kei Kudo | Encoding device and encoding method |
KR20050105222A (ko) * | 2003-02-17 | 2005-11-03 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | 비디오 부호화 |
-
2004
- 2004-08-30 EP EP04020569A patent/EP1631089A1/en not_active Withdrawn
-
2005
- 2005-08-29 US US11/661,277 patent/US8208549B2/en active Active
- 2005-08-29 WO PCT/JP2005/015679 patent/WO2006025339A1/ja active Application Filing
- 2005-08-29 EP EP05775151A patent/EP1788817A4/en not_active Withdrawn
- 2005-08-29 JP JP2006532683A patent/JP4949028B2/ja active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6335093A (ja) * | 1986-07-30 | 1988-02-15 | Sony Corp | 高能率符号化装置 |
JPH06209468A (ja) * | 1993-01-11 | 1994-07-26 | Sony Corp | 画像信号符号化方法および画像信号符号化装置、並びに画像信号復号化方法および画像信号復号化装置 |
JPH10126749A (ja) * | 1996-10-14 | 1998-05-15 | Toshiba Corp | 順次走査変換装置 |
JP2000036963A (ja) * | 1998-07-17 | 2000-02-02 | Sony Corp | 画像符号化装置、画像符号化方法および画像復号化装置 |
JP2003134476A (ja) * | 2001-10-24 | 2003-05-09 | Hitachi Ltd | 走査変換処理装置 |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7983530B2 (en) | 2005-11-07 | 2011-07-19 | Sony Corporation | Recording and playback apparatus and recording and playback method, recording apparatus and recording method, playback apparatus and playback method, and program |
JP2007129647A (ja) * | 2005-11-07 | 2007-05-24 | Sony Corp | 記録再生装置および記録再生方法、記録装置および記録方法、再生装置および再生方法、並びにプログラム |
JP2009253586A (ja) * | 2008-04-04 | 2009-10-29 | Fujifilm Corp | 画像処理システム、画像処理方法、およびプログラム |
US8447128B2 (en) | 2008-04-07 | 2013-05-21 | Fujifilm Corporation | Image processing system |
JP2009253764A (ja) * | 2008-04-08 | 2009-10-29 | Fujifilm Corp | 画像処理システム、画像処理方法、およびプログラム |
US9602814B2 (en) | 2010-01-22 | 2017-03-21 | Thomson Licensing | Methods and apparatus for sampling-based super resolution video encoding and decoding |
JP2013518463A (ja) * | 2010-01-22 | 2013-05-20 | トムソン ライセンシング | サンプリングベースの超解像度ビデオ符号化および復号化方法並びに装置 |
US9813707B2 (en) | 2010-01-22 | 2017-11-07 | Thomson Licensing Dtv | Data pruning for video compression using example-based super-resolution |
KR101789845B1 (ko) * | 2010-01-22 | 2017-11-20 | 톰슨 라이센싱 | 샘플링 기반 초 해상도 비디오 인코딩 및 디코딩을 위한 방법 및 장치 |
US9338477B2 (en) | 2010-09-10 | 2016-05-10 | Thomson Licensing | Recovering a pruned version of a picture in a video sequence for example-based data pruning using intra-frame patch similarity |
US9544598B2 (en) | 2010-09-10 | 2017-01-10 | Thomson Licensing | Methods and apparatus for pruning decision optimization in example-based data pruning compression |
JP2017005687A (ja) * | 2015-04-23 | 2017-01-05 | アクシス アーベー | ビデオカメラでビデオストリームを処理する方法及び装置 |
US10057591B2 (en) | 2015-04-23 | 2018-08-21 | Axis Ab | Method and device for processing a video stream in a video camera |
Also Published As
Publication number | Publication date |
---|---|
EP1788817A1 (en) | 2007-05-23 |
US8208549B2 (en) | 2012-06-26 |
EP1788817A4 (en) | 2009-07-01 |
US20080117975A1 (en) | 2008-05-22 |
JP4949028B2 (ja) | 2012-06-06 |
JPWO2006025339A1 (ja) | 2008-05-08 |
EP1631089A1 (en) | 2006-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2006025339A1 (ja) | 復号化装置、符号化装置、復号化方法、符号化方法 | |
JP4594201B2 (ja) | 画像符号化方法、画像符号化装置、プログラムおよび集積回路 | |
KR101075270B1 (ko) | 움직임 검출 방법 및 동화상 부호화 방법 | |
TWI356595B (en) | Picture decoding apparatus and the methods | |
KR100948714B1 (ko) | 동화상 부호화 방법 및 동화상 복호화 방법 | |
KR100976672B1 (ko) | 동화상 부호화 방법 및 동화상 복호화 방법 | |
KR100967237B1 (ko) | 동화상 부호화 방법 및 동화상 복호화 방법 | |
KR100985236B1 (ko) | 움직임 보상 방법, 화상 부호화 방법 및 화상 복호화 방법 | |
JP4130783B2 (ja) | 動きベクトル符号化方法および動きベクトル復号化方法 | |
WO2004008773A1 (ja) | フィルタリング強度の決定方法、動画像符号化方法、および動画像復号化方法 | |
JP2008199587A (ja) | 画像符号化装置、画像復号化装置および方法 | |
JP4313710B2 (ja) | 画像符号化方法および画像復号化方法 | |
JP4641995B2 (ja) | 画像符号化方法および画像符号化装置 | |
JP4495013B2 (ja) | 動画符号化装置 | |
JP4519676B2 (ja) | 動き検出方法および動画像符号化方法 | |
CN101431679B (zh) | 图像编码方法及图像编码装置 | |
JP2004215215A (ja) | 動きベクトル検出方法 | |
JP2005142986A (ja) | 動画像符号化方法、動画像符号化装置および動画像符号化プログラム | |
JP2005176337A (ja) | 画像信号処理方法、画像信号処理装置、画像信号処理プログラムおよび集積回路装置 | |
JP2004040512A (ja) | 画像符号化方法および画像復号方法 | |
JP2004364064A (ja) | 動き推定方法および動画像符号化方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006532683 Country of ref document: JP |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 11661277 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005775151 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWP | Wipo information: published in national office |
Ref document number: 2005775151 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 11661277 Country of ref document: US |