WO2017203555A1 - Dispositif de codage, dispositif de photographie et programme - Google Patents

Dispositif de codage, dispositif de photographie et programme Download PDF

Info

Publication number
WO2017203555A1
WO2017203555A1 PCT/JP2016/065115 JP2016065115W WO2017203555A1 WO 2017203555 A1 WO2017203555 A1 WO 2017203555A1 JP 2016065115 W JP2016065115 W JP 2016065115W WO 2017203555 A1 WO2017203555 A1 WO 2017203555A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
difference
pictures
importance
motion
Prior art date
Application number
PCT/JP2016/065115
Other languages
English (en)
Japanese (ja)
Inventor
尚宏 木皿
龍博 石橋
達也 橋本
勝大 草野
隆宏 平松
Original Assignee
三菱電機ビルテクノサービス株式会社
三菱電機株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 三菱電機ビルテクノサービス株式会社, 三菱電機株式会社 filed Critical 三菱電機ビルテクノサービス株式会社
Priority to PCT/JP2016/065115 priority Critical patent/WO2017203555A1/fr
Publication of WO2017203555A1 publication Critical patent/WO2017203555A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/114Adapting the group of pictures [GOP] structure, e.g. number of B-frames between two anchor frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/137Motion inside a coding unit, e.g. average field, frame or block difference
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/65Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience

Definitions

  • the present invention relates to an encoding device, an imaging device, and a program, and more particularly to encoding of a moving image generated by imaging using an imaging device such as a surveillance camera.
  • a moving image is composed of a set of still images, but not a still image (hereinafter referred to as “picture”) as it is, but compresses a picture by encoding to reduce storage capacity and data communication amount.
  • a moving image is composed of an I picture and a P picture by encoding.
  • An I picture has no dependency with other pictures and can be decoded independently to generate an image.
  • a P picture is an image formed by extracting only a moving part, and an image cannot be generated even if it is decoded alone.
  • a conventional encoding apparatus forms an I picture and a P picture as follows.
  • a series of moving images is divided into a plurality of groups. For example, a group is formed every 30 pictures. This formed group is called a picture group or GOP (Group Of Pictures).
  • GOP Group Of Pictures
  • the encoding device sets the first picture of the formed picture group as an I picture. Subsequently, the encoding apparatus generates a P picture by extracting the difference between the remaining picture and the previous picture. For example, the second picture in the picture group is generated by the difference from the previous picture, that is, the first picture. The third picture is generated by the difference from the previous picture, that is, the second picture. In this way, each P picture is formed by a difference from the immediately preceding picture.
  • the picture group is composed of a set of one uncompressed I picture and a compressed n-1 P picture, thereby reducing the amount of data.
  • the video (moving image) shot and generated by the surveillance camera or the like is compressed as described above by the encoding device built in the surveillance camera, transmitted to the PC or the like, and reproduced.
  • the second picture in the picture group is decoded by adding the corresponding P picture to the previous picture, that is, the first picture.
  • the third picture is decoded by adding the corresponding P picture to the previous picture, that is, the decoded second picture. In this way, by decoding each P picture, the moving image can be reproduced at the transmission destination.
  • a P picture generated by a conventional encoding device is a difference picture generated by a difference from the immediately preceding picture.
  • the third P picture cannot be decoded without the second picture.
  • the second P picture cannot be decoded without the first picture. That is, the third P picture cannot be decoded without the first and second pictures.
  • An object of the present invention is to enable decoding of a difference picture subsequent to a missing difference picture even if the difference picture generated by encoding is missing.
  • the encoding apparatus extracts any picture included in a picture group formed from a moving picture as a reference picture that can be decoded independently, and each picture other than the reference picture and the reference picture included in the picture group And a picture generating means for generating each differential picture from the difference between the two.
  • a motion detection unit that detects a motion of the difference picture based on a difference between the difference picture and a picture immediately before the difference picture, and each of the difference pictures is important in descending order of the motion detected by the motion detection unit.
  • Importance setting means for setting the degree.
  • An image capturing apparatus extracts a picture included in a picture group formed from a moving image generated by shooting by a shooting unit and the shooting unit as a reference picture that can be decoded independently, and a reference picture Coding means having a picture generation unit for generating each difference picture from a difference from each picture other than the reference picture included in the picture group.
  • the encoding means further includes a motion detection unit for detecting a motion of the difference picture based on a difference between the difference picture and a picture immediately before the difference picture, and for each difference picture, An importance level setting unit that sets importance levels in descending order of motion detected by the motion detection unit, and the transmission unit sets the difference pictures following the reference picture by the importance level setting unit. Are transmitted in the order of importance.
  • the program according to the present invention extracts a computer as a reference picture that can independently decode any picture included in a picture group formed from a moving image, and each of the reference pictures and the reference pictures included in the picture group. This is to function as a picture generating means for generating each differential picture from the difference from the picture.
  • a difference picture subsequent to the lost difference picture can be decoded.
  • FIG. 1 is a block configuration diagram showing an embodiment of a photographing apparatus according to the present invention. It is the block block diagram which showed one Embodiment of the encoding apparatus which concerns on this invention. It is the flowchart which showed the encoding process in this Embodiment. It is the conceptual diagram which showed the production
  • FIG. 1 is a block diagram showing an embodiment of a photographing apparatus according to the present invention.
  • the photographing apparatus 1 in the present embodiment is an apparatus that generates a moving image by photographing such as a surveillance camera.
  • the photographing device 1 includes a photographing unit 2 for photographing video, a moving image storage unit 3 for storing video data (moving images), an encoding device 4 for encoding moving images, and reproducing the encoded moving images on a PC or the like. Or it has the transmission part 5 which transmits to the apparatus which can be preserve
  • FIG. 2 is a block diagram showing an embodiment of the encoding apparatus according to the present invention.
  • the encoding device 4 in the present embodiment includes a GOP forming unit 41, a picture generating unit 42, a motion detecting unit 43, and an importance setting unit 44. Note that components not used in the description of the present embodiment are omitted from FIG.
  • the GOP forming unit 41 forms a picture group from a moving image.
  • the picture generation unit 42 extracts any picture included in the picture group as an I picture that can be decoded independently, and generates each P picture from the difference between the I picture and each picture other than the I picture included in the picture group. To do.
  • the first picture included in the picture group is extracted as an I picture.
  • the motion detector 43 detects the motion of the P picture based on the difference between the P picture and the picture immediately before the P picture.
  • the importance level setting unit 44 sets the importance level for each P picture in descending order of motion detected by the motion detection unit 43.
  • the photographing apparatus 1 has a built-in computer having storage means such as a CPU, ROM, and RAM, and a network interface.
  • the components 41 to 44 in the encoding apparatus 4 are mounted on the computer and the computer. This is realized by a cooperative operation with a program running on a CPU. Alternatively, it may be realized by hardware.
  • the program used in this embodiment can be provided not only by communication means but also by storing it in a computer-readable recording medium such as a USB memory.
  • the program provided from the communication means or the recording medium is installed in the computer, and various processes are realized by the CPU of the computer sequentially executing the program.
  • a moving image is a set of a plurality of continuous still images.
  • This one still image is generally called a “picture” or “frame”, but in this embodiment, a “picture” is used.
  • a picture group is formed by a plurality of pictures (for example, 30 pictures).
  • a picture group is also called GOP (Group Of Pictures) and is a unit for compression, playback, and editing of pictures.
  • Each picture included in the picture group is encoded into an I picture or a P picture.
  • the I picture is a reference picture that has no dependency with other pictures and can be decoded independently to generate an image.
  • a P picture is a differential picture formed by extracting only a moving part, and an image cannot be generated even if it is decoded alone.
  • the present embodiment is characterized by this P picture generation method.
  • the GOP forming unit 41 When the GOP forming unit 41 obtains the moving image by reading out the moving image from the moving image storage unit 3 (step 101), the GOP forming unit 41 forms a picture group for each predetermined number of pictures (step 102). In this embodiment, a case where a picture group is formed every 30 sheets will be described as an example.
  • the nth P picture is generated based on the difference between the I picture and the nth picture as described above, and the concept of the encoding method for generating each picture based on the difference from the I picture is illustrated in FIG. 4 shows.
  • the fifth P picture is generated by the difference between the immediately preceding fourth picture and the fifth picture.
  • the eighth P picture was generated by the difference between the previous seventh picture and the eighth picture.
  • the fifth P picture is generated by the difference between the I picture (first picture) and the fifth picture.
  • the eighth P picture is generated by the difference between the I picture (first picture) and the eighth picture.
  • the motion detection unit 43 acquires the same picture group as that of the picture generation unit 42, and extracts the difference from the previous picture for the second and subsequent pictures included in the picture group, thereby detecting the motion in the picture. Detect (step 105). This may be calculated by adding up the total number of pixels whose pixel values have changed with respect to the previous picture. For the detection of this movement, the same method as before may be used.
  • processing in the picture generation unit 42 (steps 103 and 104) and the processing in the motion detection unit 43 (step 105) can be performed independently, so either of them may be executed first or in parallel. May be.
  • the importance level setting unit 44 sets the importance level for each picture generated by the encoding by the picture generation unit 42 (step 106).
  • the importance level setting unit 44 first sets the highest importance level 1 for the I picture.
  • the importance level setting unit 44 refers to the value indicating the motion detected by the motion detection unit 43 (in the above example, the total number of pixels in which the pixel value has changed), and the importance level setting unit 44 performs the second and subsequent values in descending order. Set importance.
  • the pictures 3 and 6 have a relatively high importance level because many motions are detected as compared with the I picture.
  • the pictures 2, 4, and 5 are set with relatively low importance because no motion is detected as compared with the I picture. If the value indicating motion is the same value, a predetermined rule such as setting a high importance level for a picture close to the head may be provided.
  • the transmission unit 5 transmits the encoded moving image to an information terminal device that can be reproduced or stored. At this time, the transmission unit 5 may transmit the images in the order in which the pictures are arranged, but may transmit the images in the descending order of importance with reference to the importance set by the encoding device 4.
  • FIG. 5 is a block configuration diagram of the information terminal device 6 in the present embodiment.
  • the information terminal device 6 in the present embodiment is realized by a general-purpose computer such as a PC. That is, the information terminal device 6 includes storage means such as a CPU, ROM, RAM, and HDD, a network interface, and user interface means such as a mouse, keyboard, and display.
  • the information terminal device 6 includes a receiving unit 7 that receives the encoded moving image transmitted from the photographing device 1, a decoding device 8 that decodes the encoded moving image, and a moving image that stores the decoded moving image. It has the memory
  • the decoding device 8 decodes the moving image received by the receiving unit 7 as follows.
  • P pictures are not rearranged according to importance.
  • the second picture is generated by decoding the immediately preceding first picture and second P picture.
  • the third picture is generated by decoding the immediately preceding second picture and the third P picture. That is, the third picture can be decoded only after the immediately preceding second picture is decoded normally.
  • the information terminal device cannot normally receive all the P pictures transmitted from the photographing device due to some event such as an increase in network load.
  • the fourth P picture cannot be received normally, conventionally, the fourth and subsequent pictures cannot be generated by decoding. That is, the moving image cannot be normally decoded until the next I picture can be normally received.
  • the nth picture is decoded by the I picture and the nth P picture. Therefore, even when the fourth P picture cannot be received normally, when the subsequent P pictures can be received normally, for example, when the fifth P picture can be received normally, the fifth picture is designated as an I picture. It can be normally decoded with the fifth P picture. As described above, according to the present embodiment, it is possible to minimize missing of a picture in a decoded moving image.
  • the importance is set in the descending order of the detected motion with respect to the encoded picture, and transmission is performed in the descending order of importance. Therefore, when a picture located in the second half of the picture group cannot be transmitted normally, it means that a picture with relatively small motion could not be transmitted. That is, even if transmission cannot be performed normally in the latter half of the transmission of a picture group, it means that a picture with a relatively small detected motion cannot be transmitted. That is, in this embodiment, even if a picture with relatively small motion cannot be decoded, it is highly possible that a picture with relatively large motion can be decoded normally. Becomes higher.
  • a threshold value indicating a motion amount that does not allow a motion to be visually recognized with a difference from the previous picture even if it is decoded and played back is set in advance, and the transmission unit 5 detects a motion whose motion is less than the threshold value. Avoid sending pictures. Thereby, it is possible to prevent an increase in network load accompanying the transmission of moving images. Further, even if a picture in which little motion is detected is not reproduced, the viewer can view the moving image without feeling uncomfortable.
  • the encoding device 4 is installed in the imaging device 1 such as a surveillance camera.
  • the encoding device 4 is not limited to the imaging device 1 and is installed in an apparatus that handles moving images. May be.

Abstract

La présente invention a pour objet de permettre, même lorsqu'une image différentielle générée par codage est manquante, de décoder des images différentielles qui suivent l'image différentielle manquante. Un dispositif 4 de codage est muni d'une unité 41 de formation de GOP servant à former un groupe d'images à partir d'une image animée générée par photographie à l'aide d'un dispositif de photographie, d'une unité 42 de génération d'images servant à générer une image de début (image I) du groupe d'images et P images par une différence par rapport à chaque image autre que l'image I, une unité 43 de détection de mouvement servant à détecter le mouvement de chaque image par une différence par rapport à l'image qui la précède immédiatement, et une unité 44 de spécification d'importance servant à attribuer la plus haute importance à l'image I et à attribuer une importance aux P images par ordre décroissant d'amplitude détectée de mouvement. Le dispositif de photographie incorporant le dispositif 4 de codage émet des images codées par ordre décroissant d'importance.
PCT/JP2016/065115 2016-05-23 2016-05-23 Dispositif de codage, dispositif de photographie et programme WO2017203555A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/JP2016/065115 WO2017203555A1 (fr) 2016-05-23 2016-05-23 Dispositif de codage, dispositif de photographie et programme

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2016/065115 WO2017203555A1 (fr) 2016-05-23 2016-05-23 Dispositif de codage, dispositif de photographie et programme

Publications (1)

Publication Number Publication Date
WO2017203555A1 true WO2017203555A1 (fr) 2017-11-30

Family

ID=60411717

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2016/065115 WO2017203555A1 (fr) 2016-05-23 2016-05-23 Dispositif de codage, dispositif de photographie et programme

Country Status (1)

Country Link
WO (1) WO2017203555A1 (fr)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0795571A (ja) * 1993-09-27 1995-04-07 Oki Electric Ind Co Ltd 画像符号化装置、画像復号化装置及び多地点間データ伝送方法
JP2001224032A (ja) * 2000-02-10 2001-08-17 Mega Chips Corp 画像送信装置、画像受信装置、および、画像伝送システム
US20030086494A1 (en) * 1999-03-12 2003-05-08 Microsoft Corporation Media coding for loss recovery with remotely predicted data units
JP2006086991A (ja) * 2004-09-17 2006-03-30 Fuji Photo Film Co Ltd 監視カメラシステム、および監視カメラシステムの制御装置、並びに制御方法
JP2010087711A (ja) * 2008-09-30 2010-04-15 Sanyo Electric Co Ltd 画像処理装置
WO2011123882A2 (fr) * 2010-04-07 2011-10-13 Vincenzo Liguori Système de transmission de données vidéo présentant des exigences de mémoire réduites

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0795571A (ja) * 1993-09-27 1995-04-07 Oki Electric Ind Co Ltd 画像符号化装置、画像復号化装置及び多地点間データ伝送方法
US20030086494A1 (en) * 1999-03-12 2003-05-08 Microsoft Corporation Media coding for loss recovery with remotely predicted data units
JP2001224032A (ja) * 2000-02-10 2001-08-17 Mega Chips Corp 画像送信装置、画像受信装置、および、画像伝送システム
JP2006086991A (ja) * 2004-09-17 2006-03-30 Fuji Photo Film Co Ltd 監視カメラシステム、および監視カメラシステムの制御装置、並びに制御方法
JP2010087711A (ja) * 2008-09-30 2010-04-15 Sanyo Electric Co Ltd 画像処理装置
WO2011123882A2 (fr) * 2010-04-07 2011-10-13 Vincenzo Liguori Système de transmission de données vidéo présentant des exigences de mémoire réduites

Similar Documents

Publication Publication Date Title
JP5331316B2 (ja) 改良アラーム前映像バッファ
US10911817B2 (en) Information processing system
US9877056B1 (en) Compressed media with still images selected from a video stream
US20100254455A1 (en) Image processing apparatus, image processing method, and program
US11330263B1 (en) Machine learning based coded size estimation in rate control of video encoding
Ascenso et al. The jpeg ai standard: Providing efficient human and machine visual data consumption
JP5713797B2 (ja) 映像監視装置
US20240048738A1 (en) Methods, apparatuses, computer programs and computer-readable media for processing configuration data
US20220335560A1 (en) Watermark-Based Image Reconstruction
US8086056B2 (en) Encoding device and method, decoding device and method, and program
JP2008252262A (ja) 符号化装置及び動画像の変化点検出方法
JP2006020330A (ja) ビデオドキュメントを圧縮するためのプロセス及び装置
US8594195B2 (en) Method and apparatus for encoding and decoding at least one image frame that is artificially inserted into image sequence
JP2014011572A5 (ja) 動画像予測復号装置、方法及びプログラム
WO2017203555A1 (fr) Dispositif de codage, dispositif de photographie et programme
US20100027621A1 (en) Apparatus, method and computer program product for moving image generation
US11323730B2 (en) Temporally-overlapped video encoding, video decoding and video rendering techniques therefor
JP4235209B2 (ja) 動きベクトル検出装置および動きベクトル検出方法
JP4795141B2 (ja) 映像符号化合成装置、映像符号化合成方法及び映像伝送システム
US20230111528A1 (en) Information processing apparatus, information processing method, and non-transitory computer-readable storage medium
KR20190067577A (ko) 데이터 인코딩 및 디코딩 장치와 방법
JP2020080479A (ja) 動画記録再生装置、動画伝送システムおよび方法
JP4519569B2 (ja) 立体mpeg信号同期システム
US20230105436A1 (en) Generative adversarial network for video compression
US11197014B2 (en) Encoding apparatus, decoding apparatus, and image processing system

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16903037

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16903037

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP