WO2014112076A1 - Dispositif de codage vidéo - Google Patents

Dispositif de codage vidéo Download PDF

Info

Publication number
WO2014112076A1
WO2014112076A1 PCT/JP2013/050814 JP2013050814W WO2014112076A1 WO 2014112076 A1 WO2014112076 A1 WO 2014112076A1 JP 2013050814 W JP2013050814 W JP 2013050814W WO 2014112076 A1 WO2014112076 A1 WO 2014112076A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
previous
referenceable
flash
difference
Prior art date
Application number
PCT/JP2013/050814
Other languages
English (en)
Japanese (ja)
Inventor
啓介 篠沢
弘徳 遠藤
哲司 名幸
Original Assignee
Nttエレクトロニクス株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nttエレクトロニクス株式会社 filed Critical Nttエレクトロニクス株式会社
Priority to JP2014557244A priority Critical patent/JP5880737B2/ja
Priority to PCT/JP2013/050814 priority patent/WO2014112076A1/fr
Publication of WO2014112076A1 publication Critical patent/WO2014112076A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/87Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving scene cut or scene change detection in combination with video compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/142Detection of scene cut or scene change
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field

Definitions

  • the present invention relates to a moving picture coding apparatus that performs coding using inter-frame prediction, and more particularly to a moving picture coding apparatus that can prevent image disturbance immediately after flashing.
  • Inter-frame prediction is used in video compression.
  • a flash image has a low degree of similarity with an image adjacent thereto, it is often inappropriate to use the flash image as a reference frame in inter-frame prediction. Therefore, it has been proposed to use the frame before the flash as the reference frame of the frame after the flash (see, for example, Patent Document 1).
  • a flash image is detected by using two of the inter-frame difference value of the accumulated luminance value and the difference for each pixel of the line in the first direction / first direction.
  • the difference for each pixel of the line in the first direction / first direction is too sensitive to movement, the difference increases only by moving one pixel from the previous frame, and false detection increases.
  • the present invention has been made to solve the above-described problems, and an object of the present invention is to obtain a moving picture coding apparatus capable of preventing image disturbance immediately after flashing.
  • the present invention is a moving image encoding apparatus that performs encoding using inter-frame prediction, and includes an encoding unit that encodes a difference value between an encoding target frame and a prediction frame, and a current picture of the encoding target frame , A scene change detection unit for detecting the presence or absence of a scene change for each of the previous and second previous referenceable pictures, and the luminance of a block corresponding between the current picture and the previous referenceable picture A luminance block accumulating unit that calculates X as the difference of each accumulated value accumulated by one picture, and the difference between the accumulated luminance values of the previous referenceable picture and the current picture as Y1 ,
  • the difference between the accumulated values of the luminances of the previous referenceable picture and the current picture is Y2, and the previous referenceable picture and the current picture are (1)
  • the current picture and the 1 are the difference between the accumulated luminance values of Y3, Y3, the luminance accumulation unit for calculating the difference between them, the first threshold value Xth, and
  • the previous referenceable picture is determined to be a flash image, and (2) the one When a scene change is detected between the previous referenceable picture and the previous two referenceable pictures, and
  • a flash determination unit that determines that the previous and second previous reference pictures are flash images, and refers to the first previous reference picture when the first previous reference picture is not determined to be a flash image;
  • the prediction frame is calculated, and when the previous referenceable picture is determined to be a flash image, the prediction frame is calculated with reference to the two previous referenceable pictures, and the previous and second Prediction frame calculation for calculating the prediction frame with reference to the three previous referenceable pictures when the previous referenceable picture is determined to be a flash image And a section.
  • the block size may be considered arbitrary. The larger the block size, the more moderate the increase / decrease of the difference value with respect to the motion.
  • FIG. 1 is a block diagram showing a moving picture coding apparatus according to an embodiment of the present invention. This moving image encoding apparatus performs encoding using inter-frame prediction.
  • the input moving image signal is a moving image signal (video signal) reproduced by a video recording / reproducing device such as a digital VTR or a DVD system capable of repeatedly reproducing the same signal a plurality of times.
  • a video recording / reproducing device such as a digital VTR or a DVD system capable of repeatedly reproducing the same signal a plurality of times.
  • An I picture is a picture that is encoded only with its own picture information and is the most beautiful but requires the most code amount.
  • a P picture is a picture that is encoded only with its own picture information and is encoded with reference to a picture earlier than itself, and is more beautiful than a B picture, but has a larger amount of code than a B picture.
  • a B picture is a picture that is encoded with reference to a past / future picture from itself, and the amount of code can be reduced most among three pictures.
  • an I picture and a P picture are referenceable pictures that can be referred to when other pictures are encoded, while a B picture is a non-referenceable picture.
  • Video interface 1 inputs an input video signal and outputs a frame to be encoded.
  • the subtracter 2 calculates the difference between the encoding target frame and the predicted frame.
  • the discrete cosine transformer 3 performs a discrete cosine transform (DCT) on the output of the subtracter 2 in units of a fixed block.
  • the quantization unit 4 quantizes the DCT coefficient data obtained by the discrete cosine transform.
  • the inverse quantization unit 5 performs inverse quantization on the quantized DCT coefficient data.
  • the inverse discrete cosine transformer 6 performs inverse discrete cosine transform (inverse DCT) on the inversely quantized DCT coefficient data.
  • the adder 7 adds the output of the inverse discrete cosine transformer 6 and the prediction frame, and stores the addition result in the video memory 8 as a referenceable picture.
  • the motion compensation prediction unit 9 does not perform motion vector detection when the encoding target frame is an I picture, and outputs 0 as a prediction frame.
  • the motion compensation prediction unit 9 starts from the encoding target frame. A motion vector is detected for each macroblock, and a prediction frame is generated.
  • Motion vector detection is performed in units of macroblocks between the encoding target frame and the referenceable picture stored in the video memory 8. Specifically, by detecting the macroblock having the highest correlation of the referenceable picture with respect to the macroblock of the encoding target frame, the macroblock of the encoding target frame has moved from which macroblock of the referenceable picture. Is detected as a motion vector.
  • the motion compensation prediction unit 9 performs motion compensation on the referenceable picture using this motion vector to calculate a prediction frame.
  • the motion compensation prediction unit 9 also outputs a prediction mode of motion compensation prediction and prediction mode / motion vector information indicating a motion vector.
  • the entropy encoding unit 10 encodes the quantized DCT coefficient data and the prediction mode / motion vector information. In this way, the entropy encoding unit 10 encodes the difference value between the encoding target frame and the prediction frame.
  • the header / syntax adding unit 11 adds syntax information to the encoded result.
  • a number ref-idx is assigned as syntax information to an image to be a predicted frame.
  • Ref-idx is assigned in order from 0 in order of time closest to the current picture being processed.
  • the scene change detection unit 12 detects the presence / absence of a scene change (SC) for each of the current picture of the encoding target frame, the previous one picture, and the second previous reference picture.
  • FIG. 2 is a diagram showing the relationship between a flash image and a scene change.
  • a scene change is detected between the current picture and the previous referenceable picture (pattern 1).
  • the entire front surface of 2 pictures is a flash image, and 1/3 of the screen before 1 picture is a flash image (1/3 of the screen is white)
  • the previous referenceable picture and the previous 2 pictures A scene change is detected in the referenceable picture (pattern 2).
  • FIG. 3 is a diagram showing a calculation method of the luminance block accumulation unit.
  • the luminance block accumulating unit 13 calculates, as X, a value obtained by accumulating the difference for each accumulated value of luminance of the corresponding block between the current picture and the previous referenceable picture for one picture.
  • the block is a macroblock of 16 pixels ⁇ 16 pixels here, but is not limited thereto, and may be a block of 4 pixels ⁇ 4 pixels, 8 pixels ⁇ 8 pixels, 16 pixels ⁇ 32 pixels, or the like.
  • FIG. 4 is a diagram illustrating a calculation method of the luminance accumulation unit.
  • the luminance accumulation unit 14 sets Y1 as the difference between the accumulated luminance values of the previous referenceable picture and the current picture, and determines the difference between the accumulated luminance values of the previous referenceable picture and the current picture.
  • Y2 is set as Y2
  • the difference between the accumulated values of the luminance of the three previous referenceable pictures and the current picture is set as Y3, and the difference between them is calculated.
  • the flash determination unit 15 determines the flash image as follows by setting the first threshold value to Xth and the second threshold value to Yth. (1) When a scene change is detected between the current picture and the previous referenceable picture, and
  • the flash determination unit 15 detects the flash image
  • the flash determination unit 15 instructs to change the predicted frame to the frame immediately before the flash image.
  • the motion compensated prediction unit 9 calculates a prediction frame by referring to the previous referenceable picture when the previous referenceable picture is not determined to be a flash image, and calculates the previous frame.
  • a prediction frame is calculated with reference to the previous referenceable picture, and when the previous and second previous referenceable pictures are determined to be flash images, 3 is calculated.
  • a prediction frame is calculated with reference to the previous referenceable picture.
  • FIG. 5 and 6 are diagrams showing examples of selecting pictures to be referred to when calculating a prediction frame.
  • FIG. 5 shows a case where the interval M to the referenceable picture (I or P picture) is 1.
  • the previous picture is referred to, and in the case of 2 picture flash, the previous picture is referred to so that the flash image is not referred to.
  • FIG. 6 shows a case where M is 3, the interval N to the next I picture is 15, and the fourth P picture is a flash image.
  • the 1st I picture is referred to without referring to the 4th P picture.
  • the I picture is a flash picture
  • the picture immediately after that is changed to a P picture and encoded. After that, encoding is performed with a GOP configuration adapted to the change. Although the immediately following picture may be an I picture, a large amount of codes are generated.
  • the B picture is a flash image, the B picture is not referred to in the first place, so that encoding is performed as usual.
  • the flash determination unit 15 detects the flash image, the flash determination unit 15 instructs the header / syntax adding unit 11 to change ref-idx of the frame immediately before the flash image to 0.
  • the calculation result X of the luminance block accumulating unit 13 is more responsive to the motion of the picture (increase / decrease in the difference value) than the accumulated luminance difference for each pixel.
  • the difference between Y1, Y2, and Y3 calculated by the luminance accumulation unit 14 is more responsive to the motion of the picture (increase / decrease in the difference value) than the calculation result X of the luminance block accumulation unit 13. Accordingly, since the flash image is determined for each scene change calculation pattern based on these calculation results, erroneous detection of the flash image is reduced. Further, it is possible to cope with a two-picture flash in which a flash is generated over two frames. For this reason, the disturbance of the image immediately after the flash can be prevented.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

La présente invention concerne une unité de codage (10) qui code des valeurs de différence entre une trame à coder et une trame prédite. Selon l'invention, une unité de détection de changements de scène (12) détecte la présence ou l'absence de changements de scène. Un accumulateur de blocs de luminance (13) calcule la différence de chaque valeur cumulée de la luminance de blocs correspondants entre l'image courante et la première image précédente pouvant servir de référence, en désignant la valeur cumulée pour une image en tant que X. Un accumulateur de luminance (14) calcule la différence entre chaque élément parmi : la différence (Y1) entre des valeurs cumulées de la luminance entre la première image précédente pouvant servir de référence et l'image courante ; la différence (Y2) entre des valeurs cumulées de la luminance de la deuxième image précédente pouvant servir de référence et de l'image courante ; et la différence (Y3) entre des valeurs cumulées de luminance entre la troisième image précédente pouvant servir de référence et l'image courante. Une unité d'évaluation flash (15) évalue une image flash sur la base des différences entre un motif de détection de changements de scène, X, et Y1 à Y3. Une unité de calcul de trame prédite (9) calcule une trame prédite par référencement à l'image pouvant servir de référence juste avant l'image flash.
PCT/JP2013/050814 2013-01-17 2013-01-17 Dispositif de codage vidéo WO2014112076A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2014557244A JP5880737B2 (ja) 2013-01-17 2013-01-17 動画像符号化装置
PCT/JP2013/050814 WO2014112076A1 (fr) 2013-01-17 2013-01-17 Dispositif de codage vidéo

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2013/050814 WO2014112076A1 (fr) 2013-01-17 2013-01-17 Dispositif de codage vidéo

Publications (1)

Publication Number Publication Date
WO2014112076A1 true WO2014112076A1 (fr) 2014-07-24

Family

ID=51209197

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2013/050814 WO2014112076A1 (fr) 2013-01-17 2013-01-17 Dispositif de codage vidéo

Country Status (2)

Country Link
JP (1) JP5880737B2 (fr)
WO (1) WO2014112076A1 (fr)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07193818A (ja) * 1993-12-27 1995-07-28 Sony Corp 画像処理方法および画像処理装置
JP2000115778A (ja) * 1998-09-25 2000-04-21 Sarnoff Corp ビデオ・デ―タ中のフラッシュ・フレ―ムの検出および符号化
JP2002010270A (ja) * 2000-06-27 2002-01-11 Mitsubishi Electric Corp 画像符号化装置及び画像符号化方法
JP2006115336A (ja) * 2004-10-15 2006-04-27 Canon Inc 動画像符号化装置及びその制御方法
JP2007166408A (ja) * 2005-12-15 2007-06-28 Toshiba Corp 画像処理装置及び画像処理方法
JP2011205354A (ja) * 2010-03-25 2011-10-13 Renesas Electronics Corp 動画像符号化装置及び動画像符号化方法並びにプログラム

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4585981B2 (ja) * 2006-03-10 2010-11-24 キヤノン株式会社 画像符号化方法及び画像符号化装置
US20110211633A1 (en) * 2008-11-12 2011-09-01 Ferran Valldosera Light change coding
US8681265B2 (en) * 2009-04-16 2014-03-25 Panasonic Corporation Imaging device, external flash detection method, program, and integrated circuit

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07193818A (ja) * 1993-12-27 1995-07-28 Sony Corp 画像処理方法および画像処理装置
JP2000115778A (ja) * 1998-09-25 2000-04-21 Sarnoff Corp ビデオ・デ―タ中のフラッシュ・フレ―ムの検出および符号化
JP2002010270A (ja) * 2000-06-27 2002-01-11 Mitsubishi Electric Corp 画像符号化装置及び画像符号化方法
JP2006115336A (ja) * 2004-10-15 2006-04-27 Canon Inc 動画像符号化装置及びその制御方法
JP2007166408A (ja) * 2005-12-15 2007-06-28 Toshiba Corp 画像処理装置及び画像処理方法
JP2011205354A (ja) * 2010-03-25 2011-10-13 Renesas Electronics Corp 動画像符号化装置及び動画像符号化方法並びにプログラム

Also Published As

Publication number Publication date
JPWO2014112076A1 (ja) 2017-01-19
JP5880737B2 (ja) 2016-03-09

Similar Documents

Publication Publication Date Title
US7171052B2 (en) Apparatus and method for correcting motion of image
JP5197591B2 (ja) 映像符号化方法及び復号方法、それらの装置、それらのプログラム並びにプログラムを記録した記録媒体
JP5426655B2 (ja) 圧縮動画符号化装置、圧縮動画復号化装置、圧縮動画符号化方法および圧縮動画復号化方法
JP2007235291A (ja) 適応量子化装置及び適応量子化プログラム
KR100561398B1 (ko) 동화상 내 구획별로 휘도 변화를 검출하고 보상하는 장치 및 방법
US20170070555A1 (en) Video data flow compression method
US9066100B2 (en) Image coding apparatus and image coding method
US20070140336A1 (en) Video coding device and image recording/reproducing device
JP2006135557A (ja) 動画像符号化装置
JP3778208B2 (ja) 画像符号化装置及び画像符号化方法
JP2010258576A (ja) シーンチェンジ検出装置および映像記録装置
JP2006203597A (ja) ディジタル画像復号装置及び方法
JP5880737B2 (ja) 動画像符号化装置
JP3732760B2 (ja) 物体認識装置及び物体認識方法
JP4911625B2 (ja) 画像処理装置、およびそれを搭載した撮像装置
EP3547684A1 (fr) Procédé, dispositif et système pour procédé de codage d'une séquence de trames dans un flux vidéo
JPH07184206A (ja) 画像符号化装置
JP2008199521A (ja) 画像処理装置およびその方法
JP2011129979A (ja) 画像処理装置
JP2007129335A (ja) 画面動き検出装置
JP2007324685A (ja) 動画像処理装置及び方法
JP2891250B2 (ja) 動画像符号化方法および装置
JP2002344949A (ja) 動画像監視装置
JP4690250B2 (ja) フェード検出装置
JPH11355781A (ja) 動画像符号化装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13871567

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2014557244

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13871567

Country of ref document: EP

Kind code of ref document: A1