WO2020027512A1 - Procédé de commande de suivi d'objet basé sur syntaxe destiné à une image comprimée par un appareil photo ptz - Google Patents

Procédé de commande de suivi d'objet basé sur syntaxe destiné à une image comprimée par un appareil photo ptz Download PDF

Info

Publication number
WO2020027512A1
WO2020027512A1 PCT/KR2019/009373 KR2019009373W WO2020027512A1 WO 2020027512 A1 WO2020027512 A1 WO 2020027512A1 KR 2019009373 W KR2019009373 W KR 2019009373W WO 2020027512 A1 WO2020027512 A1 WO 2020027512A1
Authority
WO
WIPO (PCT)
Prior art keywords
moving object
image
ptz camera
motion vector
compressed image
Prior art date
Application number
PCT/KR2019/009373
Other languages
English (en)
Korean (ko)
Inventor
이현우
정승훈
이성진
Original Assignee
이노뎁 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 이노뎁 주식회사 filed Critical 이노뎁 주식회사
Publication of WO2020027512A1 publication Critical patent/WO2020027512A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/695Control of camera direction for changing a field of view, e.g. pan, tilt or based on tracking of objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/137Motion inside a coding unit, e.g. average field, frame or block difference
    • H04N19/139Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/58Means for changing the camera field of view without moving the camera body, e.g. nutating or panning of optics or image sensors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/69Control of means for changing angle of the field of view, e.g. optical zoom objectives or electronic zooming

Definitions

  • the present invention generally relates to a technique for effectively tracking a moving object using a PTZ camera in a CCTV control system.
  • the present invention is a syntax (eg, motion vector, coding) obtained by parsing compressed image data rather than tracking and controlling an PTZ camera by identifying an object through complex image processing as in the prior art. It is related to a technology that can control the object tracking of a PTZ camera by extracting a region in which something meaningful movement exists in the image, that is, a moving object region and following the moving object region by using a small number of operations.
  • a syntax eg, motion vector, coding
  • Pan-Tilt-Zoom (PTZ) cameras are cameras whose supports can rotate up, down, left and right and adjust the zoom ratio of the lens.
  • the PTZ camera may move the surveillance area by a panning operation rotated 360 degrees in the horizontal direction and a tilting operation rotated at an angle in the vertical direction, and enlarge or take a photograph of the subject by changing the zoom ratio of the lens.
  • PTZ cameras have been used actively in the field of video control because they can be shot while changing targets or by tracking specific targets. Also, it can be used by combining general camera and PTZ camera. In this case, a general camera can be used to capture and observe a panoramic image over a relatively large area, while a PTZ camera performs object tracking and monitoring for a specific target. The object tracking operation of the PTZ camera can be controlled manually by the controller, but can also be controlled automatically.
  • Tracking control of a specific object on a PTZ camera requires analyzing the original image to identify the moving object and identifying the moving object in a series of image frames. This is because panning, tilting, and zooming the PTZ camera require knowing how the object is moving. Significant computation is required to identify moving objects and detect motion from compressed images such as H.264 AVC and H.265 HEVC generated by PTZ cameras or other CCTV cameras around them.
  • FIG. 1 is a block diagram illustrating a general configuration of a video decoding apparatus according to the H.264 AVC Technical Standard.
  • a video decoding apparatus according to H.264 AVC includes a parser 11, an entropy decoder 12, an inverse converter 13, a motion vector operator 14, a predictor 15, and a deblocking filter ( 16) is configured to include.
  • These hardware modules process compressed video sequentially to decompress and restore the original video data.
  • the parser 11 parses the motion vector and the coding type for the coding unit of the compressed image.
  • Such a coding unit is generally an image block such as a macroblock or a subblock.
  • FIG. 2 is a flowchart illustrating an object tracking control process of a PTZ camera in a conventional video analysis solution.
  • the compressed image is decoded according to H.264 AVC and H.265 HEVC, etc. (S10), and the frame images of the reproduced image are downscaled to a small image, for example, 320 ⁇ 240 (S20).
  • the reason for downscaling resizing is to reduce the processing burden in the subsequent process.
  • moving objects are extracted through image analysis and coordinates of the moving objects are calculated (S30).
  • by panning, tilting, and zooming the PTZ camera using the size and coordinates of the moving object the moving object is tracked and photographed (S40).
  • An object of the present invention is to provide a technique for effectively tracking a moving object using a PTZ camera in general CCTV control system.
  • a syntax eg, motion vector, coding type
  • an object tracking control method of a syntax-based PTZ camera for a compressed image includes a first step of obtaining a motion vector and a coding type for a coding unit by parsing a bitstream of the compressed image.
  • the object tracking control method of the PTZ camera calculating a global motion vector corresponding to the average value of the motion vector for each frame; And subtracting the global motion vector calculated for the frame with respect to the motion vector.
  • the global motion vector may be calculated by summing all the motion vectors acquired in the frame and dividing by the total number of image blocks belonging to the frame.
  • the object tracking control method of the PTZ camera includes a step of identifying a plurality of adjacent video blocks (hereinafter referred to as 'neighbor block') around the moving object area; B) comparing a motion vector value with a second preset threshold value for a plurality of neighboring blocks; C) additionally marking a neighboring block having a motion vector value exceeding a second threshold as a moving object region; D) additionally marking a neighboring block having a coding type of an intra picture among the plurality of neighboring blocks as a moving object region; The method may further include an e-step of performing interpolation on the plurality of moving object regions to additionally mark a predetermined number or less of unmarked image blocks surrounded by the moving object region as the moving object region.
  • the seventh step may be configured to perform panning and tilting control on the PTZ camera so that the tracked moving object region is located at a preset observation point in the compressed image.
  • the eighth step of the present invention may be configured to perform a zoom control on the PTZ camera so that the tracking target moving object region is a preset observation size in the compressed image.
  • the computer program according to the present invention is stored in the medium in order to execute the object tracking control method of the syntax-based PTZ camera for the compressed image as described above combined with hardware.
  • an object tracking control of a PTZ camera can be performed by identifying a moving object region from a CCTV compressed image without performing complicated processing such as decoding, downscale resizing, difference image acquisition, image analysis, and the like.
  • the object tracking control of the PTZ camera is possible even with a calculation amount of about 1/20 compared with the prior art, and thus, there is an advantage in that the number of PTZ camera accommodation channels of the CCTV control system can be greatly increased without large-scale investment.
  • FIG. 1 is a block diagram showing a general configuration of a video decoding apparatus.
  • Figure 2 is a flow chart showing the object tracking control process of the PTZ camera made in the prior art.
  • FIG. 3 is a flowchart illustrating an object tracking control process of a PTZ camera according to the present invention.
  • FIG. 4 is a flowchart illustrating an embodiment of a process of detecting effective motion from a compressed image in the present invention.
  • FIG. 5 is a diagram illustrating an example of a result of applying an effective motion region detection process according to the present invention to a CCTV compressed image.
  • FIG. 6 is a flowchart illustrating an example of a process of detecting a boundary region for a moving object region in the present invention.
  • FIG. 7 is a diagram illustrating an example of a result of applying a boundary area detection process according to the present invention to the CCTV image of FIG.
  • FIG. 8 is a diagram illustrating an example of a result of arranging a moving object region through interpolation with respect to the CCTV image of FIG. 7.
  • FIG. 9 is a flowchart illustrating an embodiment of a process of controlling an object to control a PTZ camera with respect to a moving object region to be tracked in the present invention.
  • FIG. 10 is a diagram for one example in which a unique ID is assigned to a moving object area in the present invention.
  • FIG. 11 illustrates an example in which size information is identified in a moving object area in the present invention.
  • FIG. 12 is a diagram illustrating an example in which location information is identified in a moving object area in the present invention.
  • FIG. 3 is a flowchart illustrating an object tracking control process of a PTZ camera according to the present invention.
  • the object tracking control process of the PTZ camera according to the present invention may be performed by an image analysis server which processes a compressed image generated by a PTZ camera in a system that processes a series of compressed images, for example, a CCTV control system.
  • syntax information obtained for each image block that is, a macro block and a sub block, by parsing a bitstream of the compressed image without decoding the compressed image, for example, a motion vector
  • the moving object area thus obtained does not accurately reflect the boundary of the moving object, but the processing speed is high and the reliability is higher than a certain level.
  • the moving object area can be tracked by panning, tilting, and zooming the PTZ camera based on the size and position of the moving object area (e.g., center coordinates). To perform the operation.
  • the moving object region can be extracted and the object tracking can be performed without decoding the compressed image.
  • the apparatus or software to which the present invention is applied should not perform the operation of decoding the compressed image, but the scope of the present invention is not limited.
  • Step S100 First, an effective motion that can be substantially recognized from the compressed image is detected from the compressed image based on the motion vector of the compressed image, and the image region in which the effective motion is detected is set as the moving object region.
  • data of a compressed image is parsed according to video compression standards such as H.264 AVC and H.265 HEVC to obtain a motion vector and a coding type for a coding unit.
  • video compression standards such as H.264 AVC and H.265 HEVC to obtain a motion vector and a coding type for a coding unit.
  • the size of the coding unit is generally about 64x64 to 4x4 pixels and may be set to be flexible.
  • the motion vectors are accumulated for a predetermined time period (for example, 500 msec) for each image block, and it is checked whether the motion vector accumulation value exceeds the first predetermined threshold (for example, 20). If such an image block is found, it is considered that effective motion has been found in the image block and marked as a moving object area. Accordingly, even if the motion vector is generated, if the cumulative value for a predetermined time does not exceed the first threshold, the image change is assumed to be insignificant and ignored.
  • a predetermined time period for example, 500 msec
  • Step S200 Detects how far the boundary region is to the moving object region detected in S100 based on the motion vector and the coding type. If a motion vector occurs above a second threshold (for example, 0) or a coding type is an intra picture by inspecting a plurality of adjacent image blocks centered on the image block marked as a moving object area, the corresponding image block is also moved. Mark as an object area. Through this process, the corresponding image block is substantially in the form of forming a lump with the moving object region detected in S100.
  • a second threshold for example, 0
  • a coding type is an intra picture by inspecting a plurality of adjacent image blocks centered on the image block marked as a moving object area, the corresponding image block is also moved. Mark as an object area.
  • an effective motion is found and there is a certain amount of motion in the vicinity of the moving object area, it is marked as a moving object area because it is likely to be a mass with the previous moving object area.
  • determination based on a motion vector is impossible. Accordingly, the intra picture located adjacent to the image block already detected as the moving object region is estimated as a mass together with the previously extracted moving object region.
  • Step S300 The interpolation is applied to the moving object areas detected at S100 and S200 to clean up the fragmentation of the moving object area.
  • the moving object area since it is determined whether the moving object area is the image block unit, even though it is actually a moving object (for example, a person), there is an image block that is not marked as the moving object area in the middle.
  • the phenomenon of dividing into may occur. Accordingly, if there are one or a few unmarked image blocks surrounded by a plurality of image blocks marked with the moving object region, they additionally mark the moving object region. By doing so, it is possible to make the mobile object region divided into several into one. The influence of such interpolation is clearly seen when comparing FIG. 7 and FIG.
  • Step S400 The moving object region is quickly extracted from each frame image constituting the compressed image based on the syntax (motion vector, coding type) of the coding unit through the above process.
  • steps S400 to S700 when a specific moving object region is identified as a target, object tracking control for the PTZ camera is performed using the extraction result of the moving object region.
  • a specific moving object region is identified as a tracking target among one or more moving object regions identified in the above process, which is referred to herein as a 'tracking target moving object region'.
  • the tracking object area may be implemented so that the control personnel can designate the control agent through a mouse operation or the like on the CCTV video control screen, or may be implemented so that the video control software discovers itself according to the identification condition set through the menu of the video control software. .
  • Step S500 Obtain location information and size information of the area to be tracked in the current frame image.
  • 11 and 12 illustrate examples in which size information and position information are identified for a moving object region.
  • the location information means a location in the image of the corresponding video block.
  • the location information may be set as the upper left coordinate as shown in FIG. 11 or may be set as the center coordinate as shown in FIG. 12.
  • CW clockwise direction
  • a preset observation size eg, 50% of the entire screen
  • a process of performing object tracking control of the PTZ camera based on the syntax of the compressed image will be described in detail with reference to FIGS. 4 to 12.
  • a process of identifying a moving object region by syntax from a compressed image will be described in detail with reference to FIGS. 4 to 8.
  • a process of using the identified moving object area for object tracking control for the PTZ camera will be described in detail with reference to FIGS. 9 to 12.
  • FIG. 4 is a flowchart illustrating an example of a process of detecting effective motion from a compressed image in the present invention
  • FIG. 5 is a diagram illustrating an example of a result of applying the effective motion region detection process according to the present invention to a CCTV compressed image.
  • the process of FIG. 4 corresponds to step S100 in FIG. 3.
  • Step S110 First, a coding unit of a compressed image is parsed to obtain a motion vector and a coding type.
  • a video decoding apparatus performs parsing (header parsing) and motion vector operations on a stream of compressed video according to a video compression standard such as H.264 AVC and H.265 HEVC. Through this process, the motion vector and coding type are parsed for the coding unit of the compressed image.
  • Step S120 Next, a global motion vector is subtracted for each motion vector.
  • the global motion vector means an average value of all motion vectors obtained from the corresponding frame.
  • a value obtained by summing the motion vectors acquired in the frame and dividing by the total number of image blocks belonging to the frame may be set as the global motion vector.
  • the captured image changes as a whole depending on the specific orientation.
  • the global motion vector is obtained with a significant value and reflects the directivity of the entire captured image.
  • the motion vector obtained by the PTZ operation is canceled and the motion vector remains with the direction removed from the whole image by the PTZ operation only for the image block with purely moving contents. . This allows the PTZ camera to extract and track moving objects while panning, tilting and zooming.
  • the global motion vector should not have any effect.
  • only a few image blocks are derived from the captured image, and most of the image blocks do not derive the motion vector.
  • the global motion vector is obtained with a very small value close to 0, subtracting the global motion vector for each motion vector does not have a significant effect.
  • Step S130 Acquire a motion vector cumulative value for a preset time (for example, 500 ms) for each of the plurality of image blocks constituting the compressed image.
  • This step is presented with the intention to detect if there are effective movements that are practically recognizable from the compressed image, such as driving cars, running people, and fighting crowds. Shaky leaves, ghosts that appear momentarily, and shadows that change slightly due to light reflections, though they are moving, are virtually meaningless objects and should not be detected.
  • a motion vector cumulative value is obtained by accumulating a motion vector in units of one or more image blocks for a predetermined time period (for example, 500 msec).
  • the image block is used as a concept including a macroblock and a subblock.
  • Steps S140 and S150 Comparing a motion vector cumulative value with respect to a plurality of video blocks with a preset first threshold value (eg, 20), and moving the image block having a motion vector cumulative value exceeding the first threshold value.
  • an image block having a predetermined motion vector accumulation value is found as described above, it is considered that something significant movement, that is, effective movement, is found in the image block and is marked as a moving object region.
  • a human run is to select and detect a movement that is worth the attention of the control personnel.
  • the cumulative value for a predetermined time is small enough not to exceed the first threshold, the change in the image is assumed to be small and insignificant and is neglected in the detection step.
  • FIG. 5 is an example illustrating a result of detecting an effective motion region from a CCTV compressed image through the process of FIG. 4.
  • an image block having a motion vector accumulation value equal to or greater than a first threshold is marked as a moving object area and displayed as a bold line area.
  • the sidewalk block, the road, and the shadowed part are not displayed as the moving object area, while the walking people or the driving car are displayed as the moving object area.
  • FIG. 6 is a flowchart illustrating an example of a process of detecting a boundary region of a moving object region in the present invention
  • FIG. 7 is a boundary region of FIG. 5 with respect to the CCTV image of FIG. Figure 1 shows an example of the results of further applying the detection process.
  • the process of FIG. 6 corresponds to step S200 in FIG. 3.
  • the moving object is not properly marked and only a portion of the moving object is marked. In other words, if you look at a person walking or driving a car, you will find that not all of the objects are marked, but only some blocks. In addition, it is also found that a plurality of moving object areas are marked for one moving object. This means that the criterion of the moving object region adopted in (S100) above was useful for filtering out the general region but was a very strict condition. Therefore, it is necessary to detect the boundary of the moving object by looking around the moving object area.
  • Step S210 First, a plurality of adjacent image blocks are identified based on the image blocks marked as moving object areas by the previous S100. In the present specification, these are referred to as 'neighborhood blocks'. These neighboring blocks are portions that are not marked as the moving object region by S100, and the process of FIG. 6 examines them further to determine whether any of these neighboring blocks may be included in the boundary of the moving object region.
  • Steps S220 and S230 compare a motion vector value with respect to a plurality of neighboring blocks with a second preset threshold (eg, 0), and mark the neighboring block having a motion vector value exceeding the second threshold as a moving object region. do. If the movement is located adjacent to the area of the moving object where effective motion that is practically meaningful is found and a certain amount of movement is found for itself, the image block is likely to be a block with the area of the adjacent moving object due to the characteristics of the photographed image. . Therefore, such neighboring blocks are also marked as moving object regions.
  • a second preset threshold eg, 0
  • Step S240 Also, the coding type is an intra picture among the plurality of neighboring blocks as a moving object region.
  • an intra picture since a motion vector does not exist, it is fundamentally impossible to determine whether a motion exists in a corresponding neighboring block based on the motion vector. In this case, it is safer for the intra picture located adjacent to the image block already detected as the moving object region to maintain the settings of the previously extracted moving object region.
  • FIG. 7 is a diagram visually illustrating a result of applying a boundary region detection process to a CCTV compressed image.
  • a plurality of image blocks marked as a moving object region through the above process are indicated by a bold line. Referring to FIG. 7, it was found that the moving object area was further extended to the vicinity of the moving object area indicated by the bold line area in FIG. 5, so that the moving object area was covered enough to be compared with the image taken by CCTV. can do.
  • FIG. 8 is a diagram illustrating an example of a result of arranging a moving object region through interpolation according to the present invention for a CCTV image image to which the boundary region detection process illustrated in FIG. 7 is applied.
  • Step S300 is a process of arranging the division of the moving object area by applying interpolation to the moving object areas detected in the previous steps S100 and S200.
  • an unmarked image block is found between the moving object regions indicated by the bold lines. If there is an unmarked image block in the middle, it can be regarded as if they are a plurality of individual moving objects. When the moving object region is fragmented in this way, the result of step S500 may be inaccurate, and the number of moving object regions may increase, thereby complicating the process of steps S500 to S700.
  • the present invention if there is one or a few unmarked image blocks surrounded by a plurality of image blocks marked as the moving object region, this is marked as the moving object region, which is called interpolation.
  • interpolation in contrast to FIG. 7, all of the non-marked image blocks existing between the moving object regions are marked as moving object regions.
  • the moving object region properly reflects the actual image situation through the boundary region detection process and the interpolation process.
  • FIG. 5 if the block is marked as a bold line region, a large number of very small objects are moved as if moving in the image screen, which is inconsistent with reality.
  • it is determined as a block marked with the bold line area in Fig. 8 will be treated as a few moving objects having a certain volume to reflect the actual scene similarly.
  • FIG. 9 is a flowchart illustrating an example of a process of tracking and tracking an object of a PTZ camera with respect to a moving object region to be tracked in the present invention, and corresponds to steps S500 to S700 in FIG. 3.
  • the present invention extracts a moving object region based on syntax information directly obtained from a compressed image.
  • the process of acquiring and analyzing the difference image with respect to the original image by decoding the compressed image of the prior art is unnecessary, and according to the inventor's test, the processing speed is improved up to 20 times.
  • this approach has the disadvantage of poor precision.
  • this feature is also reflected in the process of tracking an object based on the moving object area of the PTZ camera.
  • Step S410 First, a unique ID is managed for a moving object region extracted based on a syntax from a compressed image.
  • the moving object region is derived from each image frame constituting the compressed image. This is not a result of analyzing the image content and determining that it is an object, but a concept of a chunk of an image that seems to be moving in the image frame.
  • the moving object region set as the tracking target in the previous frame must be continuously identified in the next frame when the image frame is advanced. In other words, you need to go beyond the idea of a chunk of an image in that frame and treat it like an object. Therefore, by assigning and managing a unique ID for the moving object region derived from the compressed image, the moving object region can be treated like an object rather than a region, and the compressed image is passed over a series of frame images. You can track the movement of a specific object.
  • Unique ID management of the mobile object area is handled in the following three cases. If a unique ID is assigned in the previous frame and the current frame image identifies a moving object region that is assigned an ID (S411), the moving object region identified as an unassigned ID in the current frame image because it has never been identified in the previous frame. In the case of newly assigning a unique ID to the SID, a mobile object region in which a unique ID is allocated in the previous frame but disappeared from the current frame image is identified and revokes the allocated unique ID (S413).
  • the image block is a moving object region without checking the contents of the original image, it is not possible to confirm whether the chunks of the moving object region are actually the same in the image frames before and after. That is, since the contents of the moving object area are not known, such a change cannot be identified, for example, when the cat is replaced by a dog between the front and rear frames at the same point. However, considering that the time interval between frames is very short and that the observation object of the CCTV camera moves at a normal speed, the possibility of this happening can be excluded.
  • the present invention estimates that the ratio or number of image blocks overlapping between the chunks of the moving object region in the front and back frames is equal to or greater than a predetermined threshold. According to this approach, even if the contents of the image are not known, it is possible to determine whether the previously identified moving object region is moved or whether a new moving object region is newly discovered or the existing moving object region is lost. This judgment is lower in accuracy than the prior art, but can greatly increase the data processing speed, which is advantageous in practical applications.
  • step S411 when the moving object region to which the Unique ID has been assigned in the previous frame is identified in the current frame image, the previously allocated Unique ID is allocated to the corresponding moving object region.
  • the identification may be marked in the management database of the Unique ID.
  • step S412 if a new object is unidentified in the current frame image because it has not been identified in the previous frame, a unique ID is newly assigned to the mobile object region. This means that a new moving object is found in the image.
  • 11 and 12 illustrate examples in which unique IDs are allocated to three moving object areas in a CCTV photographing image.
  • step S413 when the moving object region in which the unique ID is assigned in the previous frame of the compressed image disappears from the current frame image, the moving object region is allocated in step S412 with respect to the previous frame for the moving object region. Revoke unique ID maintained in S411). In other words, the moving object that has been discovered and managed before has disappeared from the image.
  • Step S420 In step S400 of FIG. 3, a specific moving object region is set as the tracking target moving object region in the CCTV photographed image by, for example, a mouse operation of a controller.
  • the tracked moving object region may be set in the current frame or may be set in the previous frame.
  • a unique ID assigned to the tracked moving object region is identified in the current frame image, which is sometimes referred to as 'tracking unique ID'.
  • the unique ID is allocated and managed through the operation S410 for the moving object region identified in each frame image in the process of processing the frame image one by one in the compressed image.
  • the unique ID is managed by classifying the moving object area identified in the previous frame, the moving object area newly identified in the current frame, and the moving object area identified in the previous frame but disappearing from the current frame. Accordingly, the unique ID of the tracking target object region may be identified unless the tracking target object region disappears from the current frame.
  • Steps S430 to S450 Derived the position information and the size information of the moving object region to which the tracking target Unique ID value is allocated in the current frame image, which corresponds to step S500 of FIG. 3.
  • 11 and 12 illustrate examples in which size information and position information are identified for a moving object region.
  • the location information means a location in the image of the corresponding video block.
  • the location information may be set as the upper left coordinate as shown in FIG. 11 or may be set as the center coordinate as shown in FIG. 12.
  • As the size information, as shown in FIG. 11, a size of a rectangle that optimally surrounds a moving object area may be set.
  • panning and tilting control of the PTZ camera is performed using the location information of the tracking target moving object region
  • zoom control of the PTZ camera is performed using the size information of the tracking target moving object region. This process has been described above in steps S600 and S700 of FIG. 3.
  • the present invention may be embodied in the form of computer readable codes on a computer readable nonvolatile recording medium.
  • Such nonvolatile recording media include various types of storage devices, such as hard disks, SSDs, CD-ROMs, NAS, magnetic tapes, web disks, and cloud disks. Forms that are implemented and executed may also be implemented.
  • the present invention may be implemented in the form of a computer program stored in a medium in combination with hardware to execute a specific procedure.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

La présente invention concerne généralement une technique permettant de suivre efficacement un objet mobile à l'aide d'un appareil photo PTZ dans un système de commande de CCTV. Plus spécifiquement, contrairement à la technique classique pour une image comprimée dans laquelle un objet est identifié par l'intermédiaire d'un traitement d'image compliqué pour permettre une commande de suivi d'un appareil photo PTZ, la présente invention concerne une technique dans laquelle une syntaxe (par exemple, un vecteur de mouvement et un type de codage) obtenue par analyse de données d'image comprimées est utilisée pour extraire une zone dans laquelle un mouvement significatif existe dans une image, c'est-à-dire une zone d'objet mobile, puis amener la zone d'objet mobile à suivre, de façon à permettre la commande d'un suivi d'objet d'un appareil photo PTZ avec une petite quantité de calcul. Conformément à la présente invention, il est possible de commander le suivi d'objet d'un appareil photo PTZ même avec une quantité de calcul correspondant à environ 1/20 de celle de la technique classique, de telle sorte qu'il est avantageux que le nombre de canaux de réception d'appareil photo PTZ d'un système de commande de CCTV puisse être significativement augmenté même sans investissement important.
PCT/KR2019/009373 2018-07-30 2019-07-29 Procédé de commande de suivi d'objet basé sur syntaxe destiné à une image comprimée par un appareil photo ptz WO2020027512A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2018-0088402 2018-07-30
KR1020180088402A KR102080456B1 (ko) 2018-07-30 2018-07-30 압축영상에 대한 신택스 기반의 ptz 카메라의 객체 추적 제어 방법

Publications (1)

Publication Number Publication Date
WO2020027512A1 true WO2020027512A1 (fr) 2020-02-06

Family

ID=69232271

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2019/009373 WO2020027512A1 (fr) 2018-07-30 2019-07-29 Procédé de commande de suivi d'objet basé sur syntaxe destiné à une image comprimée par un appareil photo ptz

Country Status (2)

Country Link
KR (1) KR102080456B1 (fr)
WO (1) WO2020027512A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113163254A (zh) * 2021-04-06 2021-07-23 广州津虹网络传媒有限公司 直播图像处理方法、装置及电子设备

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102462837B1 (ko) * 2022-07-14 2022-11-04 주식회사 에이스엘이디 원/근거리 객체를 자동추적 및 인식하기 위한 인공지능 영상 촬영 시스템 및 그 구동방법

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09252467A (ja) * 1996-03-18 1997-09-22 Toshiba Corp 移動体検出装置
JP2002262296A (ja) * 2001-02-28 2002-09-13 Mitsubishi Electric Corp 移動物体検出装置、および画像監視システム
US6553150B1 (en) * 2000-04-25 2003-04-22 Hewlett-Packard Development Co., Lp Image sequence compression featuring independently coded regions
KR20090093817A (ko) * 2008-02-29 2009-09-02 가시오게산키 가부시키가이샤 패닝화상을 촬상하기 위한 패닝모드를 구비한 촬상장치
KR20170102424A (ko) * 2016-03-01 2017-09-11 엑시스 에이비 팬 및 틸트 제어가 가능한 카메라를 제어하기 위한 방법 및 디바이스

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09252467A (ja) * 1996-03-18 1997-09-22 Toshiba Corp 移動体検出装置
US6553150B1 (en) * 2000-04-25 2003-04-22 Hewlett-Packard Development Co., Lp Image sequence compression featuring independently coded regions
JP2002262296A (ja) * 2001-02-28 2002-09-13 Mitsubishi Electric Corp 移動物体検出装置、および画像監視システム
KR20090093817A (ko) * 2008-02-29 2009-09-02 가시오게산키 가부시키가이샤 패닝화상을 촬상하기 위한 패닝모드를 구비한 촬상장치
KR20170102424A (ko) * 2016-03-01 2017-09-11 엑시스 에이비 팬 및 틸트 제어가 가능한 카메라를 제어하기 위한 방법 및 디바이스

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113163254A (zh) * 2021-04-06 2021-07-23 广州津虹网络传媒有限公司 直播图像处理方法、装置及电子设备

Also Published As

Publication number Publication date
KR20200013335A (ko) 2020-02-07
KR102080456B1 (ko) 2020-02-24

Similar Documents

Publication Publication Date Title
WO2020027513A1 (fr) Système d'analyse d'image basé sur la syntaxe pour image compressée, et procédé de traitement d'interfonctionnement
KR100432870B1 (ko) 침입물체 감시방법 및 침입물체 감시시스템
WO2019124635A1 (fr) Procédé orienté syntaxe de détection d'une intrusion d'objet dans une vidéo comprimée
WO2020027607A1 (fr) Dispositif de détection d'objets et procédé de commande
WO2017034177A1 (fr) Système de mise en application pour freiner le stationnement et l'arrêt illégaux à l'aide d'images provenant de différentes caméras et système de commande le comprenant
KR102187376B1 (ko) 딥러닝 이미지 분석과 연동하는 신택스 기반의 선별 관제 제공 방법
WO2018135906A1 (fr) Caméra et procédé de traitement d'image d'une caméra
WO2020027512A1 (fr) Procédé de commande de suivi d'objet basé sur syntaxe destiné à une image comprimée par un appareil photo ptz
WO2016064107A1 (fr) Procédé et appareil de lecture vidéo sur la base d'une caméra à fonctions de panoramique/d'inclinaison/de zoom
KR101861245B1 (ko) 멀티 센서 cctv 파노라마 영상에서 움직임 검출 시스템 및 방법
WO2019039661A1 (fr) Procédé d'extraction basée sur la syntaxe d'une région d'objet mobile d'une vidéo compressée
JPH09252467A (ja) 移動体検出装置
WO2020027511A1 (fr) Procédé de génération d'une carte thermique basée sur la syntaxe pour une image compressée
WO2020171388A2 (fr) Procédé d'identification d'un objet ayant un mouvement anormal dans une image compressée à l'aide d'une trajectoire et d'un motif de vecteur de mouvement
WO2019124636A1 (fr) Procédé basé sur la syntaxe pour détecter une circulation routière à contresens dans une vidéo compressée
KR102015082B1 (ko) 압축영상에 대한 신택스 기반의 객체 추적 방법
KR102061915B1 (ko) 압축영상에 대한 신택스 기반의 객체 분류 방법
KR100656345B1 (ko) 두 대의 카메라를 이용한 이동 물체 추적 장치 및 방법
JP2001094968A (ja) 映像処理装置
EP3413268B1 (fr) Détection de contre-courant dans une vidéo comprimée
WO2019124632A1 (fr) Procédé orienté syntaxe de détection d'objet de flânerie dans une vidéo comprimée
WO2019124633A1 (fr) Procédé orienté syntaxe de détection d'objet d'escalade de mur dans une vidéo comprimée
JP3758511B2 (ja) 物体検出装置及び物体検出プログラム
KR20140134505A (ko) 영상 객체 추적 방법
KR102178952B1 (ko) 압축영상에 대한 신택스 기반의 mrpn-cnn을 이용한 객체 분류 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19844742

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19844742

Country of ref document: EP

Kind code of ref document: A1