WO2014112076A1 - Dispositif de codage vidéo - Google Patents
Dispositif de codage vidéo Download PDFInfo
- Publication number
- WO2014112076A1 WO2014112076A1 PCT/JP2013/050814 JP2013050814W WO2014112076A1 WO 2014112076 A1 WO2014112076 A1 WO 2014112076A1 JP 2013050814 W JP2013050814 W JP 2013050814W WO 2014112076 A1 WO2014112076 A1 WO 2014112076A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- picture
- previous
- referenceable
- flash
- difference
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/87—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving scene cut or scene change detection in combination with video compression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/142—Detection of scene cut or scene change
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
Definitions
- the present invention relates to a moving picture coding apparatus that performs coding using inter-frame prediction, and more particularly to a moving picture coding apparatus that can prevent image disturbance immediately after flashing.
- Inter-frame prediction is used in video compression.
- a flash image has a low degree of similarity with an image adjacent thereto, it is often inappropriate to use the flash image as a reference frame in inter-frame prediction. Therefore, it has been proposed to use the frame before the flash as the reference frame of the frame after the flash (see, for example, Patent Document 1).
- a flash image is detected by using two of the inter-frame difference value of the accumulated luminance value and the difference for each pixel of the line in the first direction / first direction.
- the difference for each pixel of the line in the first direction / first direction is too sensitive to movement, the difference increases only by moving one pixel from the previous frame, and false detection increases.
- the present invention has been made to solve the above-described problems, and an object of the present invention is to obtain a moving picture coding apparatus capable of preventing image disturbance immediately after flashing.
- the present invention is a moving image encoding apparatus that performs encoding using inter-frame prediction, and includes an encoding unit that encodes a difference value between an encoding target frame and a prediction frame, and a current picture of the encoding target frame , A scene change detection unit for detecting the presence or absence of a scene change for each of the previous and second previous referenceable pictures, and the luminance of a block corresponding between the current picture and the previous referenceable picture A luminance block accumulating unit that calculates X as the difference of each accumulated value accumulated by one picture, and the difference between the accumulated luminance values of the previous referenceable picture and the current picture as Y1 ,
- the difference between the accumulated values of the luminances of the previous referenceable picture and the current picture is Y2, and the previous referenceable picture and the current picture are (1)
- the current picture and the 1 are the difference between the accumulated luminance values of Y3, Y3, the luminance accumulation unit for calculating the difference between them, the first threshold value Xth, and
- the previous referenceable picture is determined to be a flash image, and (2) the one When a scene change is detected between the previous referenceable picture and the previous two referenceable pictures, and
- a flash determination unit that determines that the previous and second previous reference pictures are flash images, and refers to the first previous reference picture when the first previous reference picture is not determined to be a flash image;
- the prediction frame is calculated, and when the previous referenceable picture is determined to be a flash image, the prediction frame is calculated with reference to the two previous referenceable pictures, and the previous and second Prediction frame calculation for calculating the prediction frame with reference to the three previous referenceable pictures when the previous referenceable picture is determined to be a flash image And a section.
- the block size may be considered arbitrary. The larger the block size, the more moderate the increase / decrease of the difference value with respect to the motion.
- FIG. 1 is a block diagram showing a moving picture coding apparatus according to an embodiment of the present invention. This moving image encoding apparatus performs encoding using inter-frame prediction.
- the input moving image signal is a moving image signal (video signal) reproduced by a video recording / reproducing device such as a digital VTR or a DVD system capable of repeatedly reproducing the same signal a plurality of times.
- a video recording / reproducing device such as a digital VTR or a DVD system capable of repeatedly reproducing the same signal a plurality of times.
- An I picture is a picture that is encoded only with its own picture information and is the most beautiful but requires the most code amount.
- a P picture is a picture that is encoded only with its own picture information and is encoded with reference to a picture earlier than itself, and is more beautiful than a B picture, but has a larger amount of code than a B picture.
- a B picture is a picture that is encoded with reference to a past / future picture from itself, and the amount of code can be reduced most among three pictures.
- an I picture and a P picture are referenceable pictures that can be referred to when other pictures are encoded, while a B picture is a non-referenceable picture.
- Video interface 1 inputs an input video signal and outputs a frame to be encoded.
- the subtracter 2 calculates the difference between the encoding target frame and the predicted frame.
- the discrete cosine transformer 3 performs a discrete cosine transform (DCT) on the output of the subtracter 2 in units of a fixed block.
- the quantization unit 4 quantizes the DCT coefficient data obtained by the discrete cosine transform.
- the inverse quantization unit 5 performs inverse quantization on the quantized DCT coefficient data.
- the inverse discrete cosine transformer 6 performs inverse discrete cosine transform (inverse DCT) on the inversely quantized DCT coefficient data.
- the adder 7 adds the output of the inverse discrete cosine transformer 6 and the prediction frame, and stores the addition result in the video memory 8 as a referenceable picture.
- the motion compensation prediction unit 9 does not perform motion vector detection when the encoding target frame is an I picture, and outputs 0 as a prediction frame.
- the motion compensation prediction unit 9 starts from the encoding target frame. A motion vector is detected for each macroblock, and a prediction frame is generated.
- Motion vector detection is performed in units of macroblocks between the encoding target frame and the referenceable picture stored in the video memory 8. Specifically, by detecting the macroblock having the highest correlation of the referenceable picture with respect to the macroblock of the encoding target frame, the macroblock of the encoding target frame has moved from which macroblock of the referenceable picture. Is detected as a motion vector.
- the motion compensation prediction unit 9 performs motion compensation on the referenceable picture using this motion vector to calculate a prediction frame.
- the motion compensation prediction unit 9 also outputs a prediction mode of motion compensation prediction and prediction mode / motion vector information indicating a motion vector.
- the entropy encoding unit 10 encodes the quantized DCT coefficient data and the prediction mode / motion vector information. In this way, the entropy encoding unit 10 encodes the difference value between the encoding target frame and the prediction frame.
- the header / syntax adding unit 11 adds syntax information to the encoded result.
- a number ref-idx is assigned as syntax information to an image to be a predicted frame.
- Ref-idx is assigned in order from 0 in order of time closest to the current picture being processed.
- the scene change detection unit 12 detects the presence / absence of a scene change (SC) for each of the current picture of the encoding target frame, the previous one picture, and the second previous reference picture.
- FIG. 2 is a diagram showing the relationship between a flash image and a scene change.
- a scene change is detected between the current picture and the previous referenceable picture (pattern 1).
- the entire front surface of 2 pictures is a flash image, and 1/3 of the screen before 1 picture is a flash image (1/3 of the screen is white)
- the previous referenceable picture and the previous 2 pictures A scene change is detected in the referenceable picture (pattern 2).
- FIG. 3 is a diagram showing a calculation method of the luminance block accumulation unit.
- the luminance block accumulating unit 13 calculates, as X, a value obtained by accumulating the difference for each accumulated value of luminance of the corresponding block between the current picture and the previous referenceable picture for one picture.
- the block is a macroblock of 16 pixels ⁇ 16 pixels here, but is not limited thereto, and may be a block of 4 pixels ⁇ 4 pixels, 8 pixels ⁇ 8 pixels, 16 pixels ⁇ 32 pixels, or the like.
- FIG. 4 is a diagram illustrating a calculation method of the luminance accumulation unit.
- the luminance accumulation unit 14 sets Y1 as the difference between the accumulated luminance values of the previous referenceable picture and the current picture, and determines the difference between the accumulated luminance values of the previous referenceable picture and the current picture.
- Y2 is set as Y2
- the difference between the accumulated values of the luminance of the three previous referenceable pictures and the current picture is set as Y3, and the difference between them is calculated.
- the flash determination unit 15 determines the flash image as follows by setting the first threshold value to Xth and the second threshold value to Yth. (1) When a scene change is detected between the current picture and the previous referenceable picture, and
- the flash determination unit 15 detects the flash image
- the flash determination unit 15 instructs to change the predicted frame to the frame immediately before the flash image.
- the motion compensated prediction unit 9 calculates a prediction frame by referring to the previous referenceable picture when the previous referenceable picture is not determined to be a flash image, and calculates the previous frame.
- a prediction frame is calculated with reference to the previous referenceable picture, and when the previous and second previous referenceable pictures are determined to be flash images, 3 is calculated.
- a prediction frame is calculated with reference to the previous referenceable picture.
- FIG. 5 and 6 are diagrams showing examples of selecting pictures to be referred to when calculating a prediction frame.
- FIG. 5 shows a case where the interval M to the referenceable picture (I or P picture) is 1.
- the previous picture is referred to, and in the case of 2 picture flash, the previous picture is referred to so that the flash image is not referred to.
- FIG. 6 shows a case where M is 3, the interval N to the next I picture is 15, and the fourth P picture is a flash image.
- the 1st I picture is referred to without referring to the 4th P picture.
- the I picture is a flash picture
- the picture immediately after that is changed to a P picture and encoded. After that, encoding is performed with a GOP configuration adapted to the change. Although the immediately following picture may be an I picture, a large amount of codes are generated.
- the B picture is a flash image, the B picture is not referred to in the first place, so that encoding is performed as usual.
- the flash determination unit 15 detects the flash image, the flash determination unit 15 instructs the header / syntax adding unit 11 to change ref-idx of the frame immediately before the flash image to 0.
- the calculation result X of the luminance block accumulating unit 13 is more responsive to the motion of the picture (increase / decrease in the difference value) than the accumulated luminance difference for each pixel.
- the difference between Y1, Y2, and Y3 calculated by the luminance accumulation unit 14 is more responsive to the motion of the picture (increase / decrease in the difference value) than the calculation result X of the luminance block accumulation unit 13. Accordingly, since the flash image is determined for each scene change calculation pattern based on these calculation results, erroneous detection of the flash image is reduced. Further, it is possible to cope with a two-picture flash in which a flash is generated over two frames. For this reason, the disturbance of the image immediately after the flash can be prevented.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
La présente invention concerne une unité de codage (10) qui code des valeurs de différence entre une trame à coder et une trame prédite. Selon l'invention, une unité de détection de changements de scène (12) détecte la présence ou l'absence de changements de scène. Un accumulateur de blocs de luminance (13) calcule la différence de chaque valeur cumulée de la luminance de blocs correspondants entre l'image courante et la première image précédente pouvant servir de référence, en désignant la valeur cumulée pour une image en tant que X. Un accumulateur de luminance (14) calcule la différence entre chaque élément parmi : la différence (Y1) entre des valeurs cumulées de la luminance entre la première image précédente pouvant servir de référence et l'image courante ; la différence (Y2) entre des valeurs cumulées de la luminance de la deuxième image précédente pouvant servir de référence et de l'image courante ; et la différence (Y3) entre des valeurs cumulées de luminance entre la troisième image précédente pouvant servir de référence et l'image courante. Une unité d'évaluation flash (15) évalue une image flash sur la base des différences entre un motif de détection de changements de scène, X, et Y1 à Y3. Une unité de calcul de trame prédite (9) calcule une trame prédite par référencement à l'image pouvant servir de référence juste avant l'image flash.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2014557244A JP5880737B2 (ja) | 2013-01-17 | 2013-01-17 | 動画像符号化装置 |
PCT/JP2013/050814 WO2014112076A1 (fr) | 2013-01-17 | 2013-01-17 | Dispositif de codage vidéo |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2013/050814 WO2014112076A1 (fr) | 2013-01-17 | 2013-01-17 | Dispositif de codage vidéo |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014112076A1 true WO2014112076A1 (fr) | 2014-07-24 |
Family
ID=51209197
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2013/050814 WO2014112076A1 (fr) | 2013-01-17 | 2013-01-17 | Dispositif de codage vidéo |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP5880737B2 (fr) |
WO (1) | WO2014112076A1 (fr) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07193818A (ja) * | 1993-12-27 | 1995-07-28 | Sony Corp | 画像処理方法および画像処理装置 |
JP2000115778A (ja) * | 1998-09-25 | 2000-04-21 | Sarnoff Corp | ビデオ・デ―タ中のフラッシュ・フレ―ムの検出および符号化 |
JP2002010270A (ja) * | 2000-06-27 | 2002-01-11 | Mitsubishi Electric Corp | 画像符号化装置及び画像符号化方法 |
JP2006115336A (ja) * | 2004-10-15 | 2006-04-27 | Canon Inc | 動画像符号化装置及びその制御方法 |
JP2007166408A (ja) * | 2005-12-15 | 2007-06-28 | Toshiba Corp | 画像処理装置及び画像処理方法 |
JP2011205354A (ja) * | 2010-03-25 | 2011-10-13 | Renesas Electronics Corp | 動画像符号化装置及び動画像符号化方法並びにプログラム |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4585981B2 (ja) * | 2006-03-10 | 2010-11-24 | キヤノン株式会社 | 画像符号化方法及び画像符号化装置 |
US20110211633A1 (en) * | 2008-11-12 | 2011-09-01 | Ferran Valldosera | Light change coding |
US8681265B2 (en) * | 2009-04-16 | 2014-03-25 | Panasonic Corporation | Imaging device, external flash detection method, program, and integrated circuit |
-
2013
- 2013-01-17 JP JP2014557244A patent/JP5880737B2/ja active Active
- 2013-01-17 WO PCT/JP2013/050814 patent/WO2014112076A1/fr active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07193818A (ja) * | 1993-12-27 | 1995-07-28 | Sony Corp | 画像処理方法および画像処理装置 |
JP2000115778A (ja) * | 1998-09-25 | 2000-04-21 | Sarnoff Corp | ビデオ・デ―タ中のフラッシュ・フレ―ムの検出および符号化 |
JP2002010270A (ja) * | 2000-06-27 | 2002-01-11 | Mitsubishi Electric Corp | 画像符号化装置及び画像符号化方法 |
JP2006115336A (ja) * | 2004-10-15 | 2006-04-27 | Canon Inc | 動画像符号化装置及びその制御方法 |
JP2007166408A (ja) * | 2005-12-15 | 2007-06-28 | Toshiba Corp | 画像処理装置及び画像処理方法 |
JP2011205354A (ja) * | 2010-03-25 | 2011-10-13 | Renesas Electronics Corp | 動画像符号化装置及び動画像符号化方法並びにプログラム |
Also Published As
Publication number | Publication date |
---|---|
JPWO2014112076A1 (ja) | 2017-01-19 |
JP5880737B2 (ja) | 2016-03-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7171052B2 (en) | Apparatus and method for correcting motion of image | |
JP5197591B2 (ja) | 映像符号化方法及び復号方法、それらの装置、それらのプログラム並びにプログラムを記録した記録媒体 | |
JP5426655B2 (ja) | 圧縮動画符号化装置、圧縮動画復号化装置、圧縮動画符号化方法および圧縮動画復号化方法 | |
JP2007235291A (ja) | 適応量子化装置及び適応量子化プログラム | |
KR100561398B1 (ko) | 동화상 내 구획별로 휘도 변화를 검출하고 보상하는 장치 및 방법 | |
US20170070555A1 (en) | Video data flow compression method | |
US9066100B2 (en) | Image coding apparatus and image coding method | |
US20070140336A1 (en) | Video coding device and image recording/reproducing device | |
JP2006135557A (ja) | 動画像符号化装置 | |
JP3778208B2 (ja) | 画像符号化装置及び画像符号化方法 | |
JP2010258576A (ja) | シーンチェンジ検出装置および映像記録装置 | |
JP2006203597A (ja) | ディジタル画像復号装置及び方法 | |
JP5880737B2 (ja) | 動画像符号化装置 | |
JP3732760B2 (ja) | 物体認識装置及び物体認識方法 | |
JP4911625B2 (ja) | 画像処理装置、およびそれを搭載した撮像装置 | |
EP3547684A1 (fr) | Procédé, dispositif et système pour procédé de codage d'une séquence de trames dans un flux vidéo | |
JPH07184206A (ja) | 画像符号化装置 | |
JP2008199521A (ja) | 画像処理装置およびその方法 | |
JP2011129979A (ja) | 画像処理装置 | |
JP2007129335A (ja) | 画面動き検出装置 | |
JP2007324685A (ja) | 動画像処理装置及び方法 | |
JP2891250B2 (ja) | 動画像符号化方法および装置 | |
JP2002344949A (ja) | 動画像監視装置 | |
JP4690250B2 (ja) | フェード検出装置 | |
JPH11355781A (ja) | 動画像符号化装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13871567 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2014557244 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13871567 Country of ref document: EP Kind code of ref document: A1 |