WO2005117448A1 - 移動物体検出装置および移動物体検出方法 - Google Patents
移動物体検出装置および移動物体検出方法 Download PDFInfo
- Publication number
- WO2005117448A1 WO2005117448A1 PCT/JP2005/009665 JP2005009665W WO2005117448A1 WO 2005117448 A1 WO2005117448 A1 WO 2005117448A1 JP 2005009665 W JP2005009665 W JP 2005009665W WO 2005117448 A1 WO2005117448 A1 WO 2005117448A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- moving object
- video
- area
- object detection
- information
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/48—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
Definitions
- the present invention relates to a moving object detecting apparatus and method for detecting a moving object from a video stream generated by encoding a video.
- the moving object detection device extracts a motion vector used in a motion prediction compensation encoding method for decoding a video stream, and regards the motion vector as a motion of an object in a certain area. It detects a moving object at high speed.
- FIG. 1 shows a conventional moving object detection device described in Patent Document 1.
- the coding mode, motion compensation mode, and motion vector information of the image block decoded by the variable length decoding unit 1801 and the pattern information detected by the pattern information detection unit 1802 are moving.
- the object is sent to the object detection processing unit 1803.
- the moving object detection processing unit 1803 uses these pieces of information to determine whether or not this image block is a moving object. This determination is performed using a motion vector, a spatial similarity determination, a temporal similarity determination, and the like.
- Patent Document 1 JP-A-10-75457
- An object of the present invention is to convert an image into a reduced image, a horizontal component, a vertical component, and a diagonal component.
- Moving object capable of performing high-speed, high-accuracy, low-processing-load detection of a moving object from a video stream that has been video-coded using motion prediction compensation coding It is to provide a detection device and method.
- the moving object detection device of the present invention extracts motion information from a video stream that has been video-encoded using hierarchical coding and motion prediction / compensation coding, which divide the video into a plurality of layers and encode it.
- a moving object detection method is a method for detecting a moving object in a video stream.
- the moving object detection device for detecting the moving object executes the moving object detection method by dividing an image into a plurality of layers. Extracting video stream power and motion information that have been video-encoded using hierarchical coding and motion prediction compensation coding; extracting the video stream power and edge information; and extracting the extracted motion information. And detecting the moving object using the edge information.
- a band division method for dividing an image into a reduced image, a horizontal component, a vertical component, and a diagonal component, and a video encoding using a motion prediction compensation encoding It is possible to detect the contour of an object moving at high speed, high accuracy, and low processing load without decoding the video from the video stream. At the same time, video decoding can be performed.
- FIG. 1 is a diagram showing a configuration of a conventional moving object detection device
- FIG. 2 is a diagram showing a configuration of a video decoding device according to Embodiment 1 of the present invention.
- FIG. 3 is a conceptual diagram of bit plane encoding according to Embodiment 1 of the present invention.
- FIG. 4 is a flowchart showing an operation of the video decoding device according to the first embodiment of the present invention.
- FIG. 5 is a flowchart showing an operation of a moving object detection process of the video decoding device according to the first embodiment of the present invention.
- FIG. 6 is a stream structure diagram of an enhancement layer according to the first embodiment of the present invention.
- FIG. 7 is a stream structure diagram of a bit plane k of an enhancement layer according to the first embodiment of the present invention.
- FIG. 8 is a stream structure diagram of bit plane k in enhancement layer region j according to Embodiment 1 of the present invention.
- FIG. 9 is a diagram showing a stream structure of a base layer according to the first embodiment of the present invention.
- FIG. 10 is a diagram showing a stream structure of an area j of a base layer according to the first embodiment of the present invention.
- FIG. 11A is a diagram illustrating an example of a horizontal component in an 8 ⁇ 8 pixel region according to the first embodiment of the present invention
- FIG. 11B is a diagram illustrating an example in an 8 ⁇ 8 pixel region according to the first embodiment of the present invention
- FIG. 11C is a diagram showing another example of the horizontal component
- FIG. 11C is a diagram showing still another example of the horizontal component in the 8 ⁇ 8 pixel area according to the first embodiment of the present invention.
- FIG. 12 is a diagram showing a configuration of a video surveillance system according to Embodiment 2 of the present invention.
- FIG. 13 is a diagram showing a configuration of an automatic tracking camera according to Embodiment 2 of the present invention.
- FIG. 14 is a diagram showing a configuration of a video encoding device according to Embodiment 2 of the present invention.
- FIG. 15 is a flowchart showing the operation of the automatic tracking camera according to Embodiment 2 of the present invention.
- FIG. 16 is a flowchart showing the operation of the video encoding device according to the second embodiment of the present invention.
- FIG. 17 is a flowchart showing an operation of the video monitoring device according to the second embodiment of the present invention.
- FIG. 18 is a sequence diagram showing an operation of the video monitoring system according to the second embodiment of the present invention.
- FIG. 19 is a diagram showing a configuration of a video decoding device according to Embodiment 3 of the present invention.
- FIG. 20 is a flowchart showing the operation of the video decoding device according to the third embodiment of the present invention.
- Embodiment 1 is an application of the moving object detection method and device according to the present invention to a video decoding device. That is, at the same time that the video stream is decoded, a moving object in the video can be detected at high speed and with high accuracy.
- This video stream is composed of a base layer and an enhancement layer.
- the base layer can be decoded independently to obtain a low-resolution video.
- the enhancement layer improves the image quality of the base layer and This is additional information from which an image can be obtained, and includes horizontal, vertical, and diagonal edge components (horizontal, vertical, and diagonal components).
- the input image is divided into bands to generate a reduced image, a horizontal component, a vertical component, and a diagonal component.
- the reduced image is encoded as a base layer capable of independently decoding a video by motion prediction compensation encoding.
- the horizontal direction component, the vertical direction component, and the diagonal direction component are encoded by a bit plane encoding as an enhancement layer for encoding a video obtained by decoding the base layer with high image quality.
- band division an image is divided into four components: a reduced image, a horizontal component, a vertical component, and a diagonal component.
- This band division is performed by wavelet transform or by using a combination of a high-pass filter, a low-pass filter, and a down-sampler.
- the reduced image, the horizontal component, the vertical component, and the diagonal component obtained by band division can be restored to the original image by band combination.
- the horizontal component, the vertical component, and the diagonal component obtained by this band division are differences in pixel values from neighboring pixels that can be mathematically calculated, and do not necessarily represent the contour of the object. . For example, in a black-and-white horizontal stripe pattern, a strong vertical component appears as a horizontal line at the boundary between the colors.
- FIG. 2 is a block diagram showing a configuration of a video decoding device 100 according to Embodiment 1 to which the moving object detection method and device of the present invention are applied.
- the video decoding device 100 includes a stream input unit 101, base layer decoding 1
- an enhancement layer decoding unit 103 an enhancement layer decoding unit 103, a band synthesis unit 104, a video output unit 105, a moving object detection unit 106, and a detection result output unit 107.
- the base layer decoding unit 102, the enhancement layer decoding unit 103, and the band synthesizing unit 104 correspond to the video decoding unit of the present invention, and the base layer decoding unit 102 corresponds to the motion information extracting unit.
- the enhancement layer decoding unit 103 corresponds to the edge information extraction unit, and the moving object detection unit
- 106 corresponds to a moving object detecting means.
- the video decoding means generates and outputs a video by decoding the input video stream.
- the motion information extraction means extracts and transfers the input video stream power motion information. Output to the moving object detecting means.
- the edge information extracting means extracts the input video stream power margin information and outputs it to the moving object detecting means.
- the moving object detecting means detects the input edge information and the motion information force and the moving object.
- FIG. 4 is a flowchart showing the operation of video decoding apparatus 100 according to Embodiment 1 shown in FIG.
- the control program stored in a storage device (for example, a ROM or a flash memory) is executed by a CPU (not shown), whereby the control program is executed by software by executing the program. It is also possible to do so.
- the stream input unit 101 inputs a video stream from outside the video decoding device 100, and a base layer of the video stream is input to the base layer decoding unit 102, and an enhancement layer is subjected to enhanced layer decoding.
- the output is output to the dangling unit 103 (step S301).
- base layer decoding section 102 extracts motion information from the base layer input from stream input section 101, and outputs the extracted motion information to moving object detection section 106. Further, enhancement layer decoding section 103 extracts enhancement layer power edge information input from stream input section 101 and outputs the information to moving object detection section 106. Then, the moving object detecting unit 106 detects a moving object using the motion information and the edge information input from the base layer decoding unit 102 and the enhancement layer decoding unit 103, and generates and detects a moving object detection result. The result is output from the result output unit 107 and the band synthesis unit 104 (step S302).
- the video may or may not include a moving object, and if so, may include only one moving object or may include a plurality of moving objects.
- step S302 the moving object detection processing in step S302 will be described in detail.
- FIG. 5 is a flowchart illustrating an example of a procedure of the moving object detection process in FIG.
- step S401 an edge information extraction process is performed.
- the enhancement layer decoding unit 103 extracts a code including information up to a specific bit plane from the enhancement layer power input from the stream input unit 101, generates edge information, and generates a moving object detection unit 106. Output to
- bit plane encoding will be described.
- the bit plane is a bit string in which only the same bit positions of some numerical data represented by binary numbers are arranged.
- the method of coding for each bit plane is called bit plane coding.
- FIG. 3 is a diagram showing the concept of bit plane coding, and the description will proceed assuming that it represents a region having a horizontal component.
- one column represents one pixel of the horizontal component represented by a binary number (pixel 1, pixel 2).
- One row represents a bit plane (bit plane 1, bit plane 2) in an area having a horizontal component, that is, only the same bits of each pixel are collected.
- bit plane the higher the bit plane, the stronger the edge of the horizontal component can be expressed.
- the edge information is obtained by encoding the information on the most significant bit plane force up to a specific bit plane. For example, it includes information such as the code amount for each bit plane up to a specific bit plane for each area of 8 ⁇ 8 pixels or 16 ⁇ 16 pixels.
- the bit plane coding is performed such that the code length is shortened when the number of “0s” is large. Therefore, the code length of the bit plane in the horizontal component, the vertical component, and the diagonal component region increases as the number of “1” s increases.
- FIG. 6 shows a data structure of the enhancement layer according to the present embodiment.
- the enhancement layer shown in FIG. 6 is a code for one image, and includes information on n bit planes and m regions.
- the enhancement layer for one image holds the header information 501 of the image, the bit plane representing the most significant bit plane, and the information 502 of the least significant bit plane n.
- FIG. 7 shows the data structure of bit plane k of the enhancement layer in FIG. 6.
- Bit plane k of the enhancement layer includes header information 601 of the bit plane, and code 602 of bit plane k of region 1 to region m. .
- FIG. 8 shows the data structure of the bit plane k of the area j of the enhancement layer in FIG. 7.
- the bit plane k of the area j of the enhancement layer has the code 701 of the pixel component of the corresponding area and the code of the area. It includes a termination signal 702 that indicates termination.
- the bit plane information is extracted from the video stream in order from the most significant bit plane to the specific bit plane, and the end signals of those areas are sequentially searched. It is only necessary to count the code length. Therefore, the enhancement layer decoding unit 103 can generate edge information at high speed.
- step S402 a motion information extraction process is performed. Specifically, the base layer decoding unit 102 extracts the base layer force motion vector information input from the stream input unit 101, generates motion information, and outputs it to the moving object detection unit 106.
- This motion information is used for motion prediction compensation of the base layer, and includes information on force that is motion prediction compensation code for each area or intra-frame coding, and information on motion level.
- the information includes the size and direction, information on the image referred to by the motion vector, and information on whether the entire image is a motion prediction compensation code or an intra-frame code.
- FIG. 9 shows a data structure of a base layer according to the present embodiment.
- the base layer shown in FIG. 9 is a code for one image and includes information of m regions. That is, the base layer for one image includes the header information 801 of the image and the information 802 of the areas 1 to m.
- Fig. 10 shows the data structure of the base layer region p in Fig. 9, where the base layer region p indicates that the region header information 901, the motion vector 902, the pixel component code 903, and the code of the region have been completed. Including termination signal 904.
- Extraction of the motion vector only requires searching the video stream for header information 901 and end signal 904 of those areas, and decoding only the motion vector 902 whose positional force is also at a fixed position. Thereby, base layer decoding section 102 can generate motion information at high speed.
- step S403 detection processing of the contour of the moving object is performed. Specifically, the moving object detection unit 106 detects a contour area of the moving object using the motion information and the edge information input from the base layer decoding unit 102 and the enhancement layer decoding unit 103, The result is stored in the moving object detection unit 106.
- a code length obtained from a bit plane of a horizontal component, a vertical component, and a diagonal component for a certain area, for example, each code from the most significant bit plane to a three-bit plane Condition 1 is that the total code length of the quantity is greater than or equal to the threshold A.
- this threshold A is a reference value for determining a weak edge.
- the condition 2 is that the total code length of the above-mentioned area is equal to or less than the threshold value B.
- This threshold value B is a reference value for identifying an image that is not an edge such as a striped pattern.
- FIGS. 11A to 11C show examples of horizontal components in an 8 ⁇ 8 pixel area, respectively.
- pixel values are represented by binary values, and the most significant bit plane force is black if it contains "1" by a specific bit plane, and white if it does not contain "1".
- Fig. 11A shows the horizontal component when noise or small points exist in the area
- Fig. 11B shows the horizontal component when a vertical line exists in the area
- Fig. 11C shows the entire horizontal component. For example, it shows the horizontal component when it is a part of a stripe pattern.
- 11A, 11B, and 11C in ascending order of the number of non-zero values included in the regions. The same applies to the vertical component and the diagonal component.
- the threshold value A is 8 and the threshold value B is 32, it can be determined that the area shown in FIG. 11B where the relationship of the threshold value A and the total value ⁇ the threshold value B is satisfied includes a line appearing in the contour of the object. . Note that threshold A is equal to threshold B.
- threshold value A may be used, and it may be determined that an area where the relationship of threshold value A ⁇ the total value is satisfied includes a line that appears in the contour of the object.
- a certain area determined as a contour is a force that is a contour of a moving object is determined by whether or not the following condition 3 or condition 4 is satisfied.
- condition 3 is that the magnitude of the motion vector of the area is smaller than the threshold value C, and the movement of the target moving object needs to move to a certain degree or more.
- Condition 4 is that the magnitude of the difference vector between the motion vector of the area and the surrounding motion vector is smaller than the threshold value D. This determines whether the moving object moves in the same way as the surroundings. The number of surrounding motion vectors need not be one. Condition 4 in that case will be described. First, a plurality of surrounding motion vectors are extracted. The magnitude of the difference vector from the motion vector of the area is obtained. Condition 4 in this case is that the sum of the magnitudes of the difference vectors is less than the threshold value D.
- condition 4 the following condition other than the above can be assumed for the condition 4.
- the sum of squares of the difference between the X direction component (horizontal component) of the motion vector of the region and the surrounding region and the Y direction component (vertical method component) The variance can also be used as a reference, calculated as the sum of the square sums of the differences.
- Condition 4 in this case is that the variance is less than threshold D. If condition 4 is satisfied, the motion vector of the area is assumed to have the same direction and size as the surroundings, and it is determined that the area is not a moving object.
- the calculation of the variance is not limited to this, and the variance may be calculated as a value obtained by summing the product of the absolute value of the difference between the magnitudes of the motion vectors and the absolute value of the angle difference in the surrounding area. It is not limited to these as long as it can be determined whether or not the motion vector of the region has a different direction and size from the surrounding motion vectors.
- the condition 4 or the condition 5 it is determined that the area is not a moving object area. It should be noted that, as in the case of intra-frame encoding of the entire image, the motion vector is not included! / In the frame, the outline is not determined, and the frame including the motion vector is waited for. This is a force that cannot detect motion from a frame without a motion vector.
- the moving object detection unit 106 determines that the region satisfying the condition 3 or the condition 4 among the regions determined as the object contour from the above conditions 1 and 2 is not the contour of the moving object. This is because the contour of a moving object moves at a different speed from the surroundings.
- step S404 detection processing of the inside of the moving object is performed.
- the moving object detection unit 106 detects an area inside the moving object using the motion information input from the base layer decoding unit 102 and the stored detection result of the outline of the moving object. .
- the detection result of the internal area is stored in the moving object detection unit 106.
- condition for determining that a certain area is inside the moving object is when the following condition 5 or condition 6 is satisfied.
- Condition 5 is that the moving object is in the vicinity of the contour or the area determined to be inside the moving object.
- the variance in the magnitude and direction of the motion vector is less than the threshold value E.
- the threshold value E is a reference value when it is determined that the contour and the inside of the moving object move at the same speed.
- Condition 6 is that the moving object is surrounded by a contour or a region determined to be inside, which is a force in which the inside of the moving object is surrounded by the contour.
- step S405 processing for removing erroneous detection of a moving object is performed.
- the moving object detection unit 106 removes the erroneously detected area from the stored detection results of the outline of the moving object and the internal area, generates a moving object detection result, and outputs a detection result output unit 107 and a band synthesis unit 104. Output to
- the condition for determining that the area is an erroneously detected area is that there are few areas around the moving object that are determined to be contours or inside. If an extremely small moving object is detected, erroneous detection is possible. It is because the nature is high.
- the moving object detection unit 106 generates a region force moving object detection result of the moving object obtained as described above.
- the moving object detection result is, for example, as follows.
- the first is information describing, for each region, whether the region is a moving object or not.
- the method of detecting a moving object is not limited to the method of detecting a moving object using a motion vector, and other methods may be used in combination with the edge information of the present invention.
- the moving object detection method of the present embodiment if the base layer includes a motion vector and the enhancement layer includes a code up to a bit plane of a certain bit position, transmission is performed at a low bit rate. Therefore, even if the image quality is poor, it is possible to detect a moving object at high speed, with high accuracy, and with a low processing load.
- step S303 the result of detecting the moving object is output.
- the detection result output unit 107 outputs the coordinates of the area of the moving object input from the moving object detection unit 106 to the outside.
- step S 304 base layer decoding processing is performed.
- the base layer The decoding unit 102 performs motion prediction compensation decoding on the base layer of the video stream input from the stream input unit 101, generates a reduced image, and outputs the reduced image to the band synthesis unit 104.
- step S 305 enhancement layer decoding processing is performed.
- the enhancement layer decoding unit 103 performs bit plane decoding on the enhancement layer of the video stream input from the stream input unit 101 to generate a horizontal component, a vertical component, and a diagonal component, Output to band synthesis section 104.
- band combining section 104 combines the reduced image input from base layer decoding section 102 and the horizontal, vertical, and diagonal components input from enhancement layer decoding section 103 into a band. Then, a decoded image is generated and output to the video output unit 105. Further, band combining section 104 may use the moving object detection result input from moving object detecting section 106 to emphasize a region including the moving object in the decoded image.
- the band synthesis unit 104 performs processing such as coloring the decoded video only in the area of the moving object area or enclosing the moving object area with a frame. Further, the value of all the pixels of the reduced image obtained by decoding the base layer may be set to “0” to generate an image having only the contour by performing band synthesis, and further, the area of the moving object region may be emphasized.
- step S307 a video output process is performed. Specifically, video output section 105 outputs the decoded video input from band synthesis section 104 to the outside.
- step S304 Since the processing up to the force processing (step S307) is not performed, it is possible to detect a moving object at a higher speed and with a lower processing load.
- step S308 an end determination process is performed.
- the stream input unit 101 determines whether or not there is a subsequent video stream, and terminates the process if the video decoding apparatus 100 does not need to detect a moving object any more and decode the video. If not, return to step S301.
- Step S307 is performed after the moving object detection processing (Step S302 and Step S303), but is not limited thereto, and the moving object detection processing is performed in parallel with the decoding processing of the base layer and the enhancement layer. It is possible.
- the enhancement layer can include not only the horizontal direction component, the vertical direction component, and the diagonal direction component but also information about the difference between the reduced image and the image obtained by decoding the base layer.
- information on the horizontal component, the vertical component, and the diagonal component obtained by directly subdividing the input image, and the motion generated by motion prediction compensation
- the base layer using the motion prediction code and the horizontal, vertical, and diagonal components can be extracted.
- a moving object can be detected with high speed, high accuracy, and a low processing load without decoding a video stream that is an enhancement layer using bit plane coding.
- the edge information it is possible to extract the edge information from the video stream of the enhancement layer from the video stream of the base layer, and to show that the motion information does not move. Processing such as extraction of edge information is stopped when there is In addition, when the edge information indicates that there is no edge, processing such as extraction of motion information can be stopped to reduce the processing load, and the contour of the object can be reduced at high speed. Can be detected. At this time, either of the extraction of the motion information and the extraction of the edge information may be performed first, or they may be performed in parallel.
- the detection of a moving object can be performed only by using a motion vector and edge information of a part of a bit plane, a low bit rate such as a situation where a communication speed is limited. , It is possible to detect a moving object at high speed and with high efficiency.
- the enhancement layer decoding unit 103 extracts edge information necessary for detecting a moving object
- the base layer decoding unit 102 extracts motion information. Since the video decoding process and the moving object detection process can share the means and processes of the sections, the detection of the moving object and the decoding of the video can be performed simultaneously and at high speed. The size of the entire apparatus can be reduced.
- enhancement layer decoding section 103 generates start signal included in bit plane header 601 in the video stream, and end signal 702 for each area such as 8 ⁇ 8 pixels. It is possible to generate edge information at high speed simply by searching for and counting the code length between identification signals.
- base layer decoding section 102 searches for an identification signal for each area such as, for example, 8 ⁇ 8 pixels in the video stream, and the identification signal power is determined. It is possible to generate motion information at high speed only by decoding the motion vector at the position.
- moving object detecting section 106 detects the contour of the moving object based on the edge information and the motion information, detects the inside of the moving object based on the motion information and the result of the detection, In addition, by removing erroneous detection, it is possible to detect a moving object with high accuracy.
- band combining section 104 emphasizes the area of the moving object in the decoded video, and performs band synthesis on the reduced video in which the base layer is decoded without band combining.
- the detection result of the moving object can be detected and chewed by the monitoring person.
- Embodiment 2 is an application of the moving object detection method and apparatus according to the present invention to a video surveillance system.
- the video surveillance system has an automatic tracking camera equipped with a video encoding device.
- a moving object in the video is detected at high speed, with high accuracy, and with a low processing load. It automatically tracks and enables efficient video monitoring.
- FIG. 12 is a diagram showing a configuration of a video surveillance system according to Embodiment 2 to which the moving object detection method and device of the present invention are applied.
- This video monitoring system has a video monitoring device 1100, a communication network 1110, and N automatic tracking power cameras 1121 to 112N.
- the automatic tracking camera corresponds to the imaging device of the present invention.
- FIG. 13 is a block diagram showing a configuration of automatic tracking cameras 1121 to 112N according to the second embodiment.
- the automatic tracking camera shown in FIG. 13 corresponds to the automatic tracking camera 1121 in the video surveillance system shown in FIG.
- the automatic tracking camera 1121 includes an imaging unit 1201, a video encoding unit 1202, and an imaging control unit 1203.
- the other automatic tracking cameras 1122 to 112N have the same configuration.
- imaging unit 1201 corresponds to the imaging unit of the present invention
- imaging control unit 1203 corresponds to the imaging control unit of the present invention
- the imaging unit 1201 outputs an image captured by performing an imaging function operation such as pan, tilt, and zoom to the video encoding unit 1202.
- the video encoding unit 1202 divides the input video into bands, and generates a video stream including information on the horizontal component, the vertical component, and the diagonal component, and a motion vector generated by motion prediction compensation. I do.
- the imaging control unit 1203 receives information on a target to be tracked and a result of detection of a moving object, and generates and outputs a control signal for performing pan-tilt-zoom to the imaging unit 1201.
- FIG. 14 is a block diagram illustrating a configuration of the video encoding unit 1202, and illustrates a moving object according to the present invention. This corresponds to a video encoding device to which the detection method and device are applied.
- video encoding unit 1202 includes video input unit 1301, band division unit 1302, basic layer encoding unit 1303, enhancement layer encoding unit 1304, stream output unit 1305, moving object detection It has a unit 1306 and a detection result output unit 1307.
- band division section 1302, base layer coding section 1303, and enhancement layer coding section 1304 correspond to the video coding section of the present invention, and base layer coding section 1303 extracts motion information.
- the enhancement layer coding unit 1304 corresponds to edge information extraction means, and the moving object detection unit 1306 corresponds to moving object detection means.
- the video encoding unit encodes the input video to generate and output a video stream.
- the band division unit 1302 constituting the band division unit divides the input image into bands to generate a reduced image, a horizontal component, a vertical component, and a diagonal component.
- the horizontal component, the vertical component, and the diagonal component are coded as an enhancement layer using a bit plane code.
- the base layer coding unit 1303 extracts motion information from the generated video stream and outputs the extracted motion information to the moving object detection unit 1306.
- Enhancement layer encoding section 1304 also extracts the edge information from the generated video stream power and outputs it to moving object detection section 1306.
- the moving object detection unit 1306 detects the input edge information and the moving information force moving object.
- the stream output unit 1305 and the detection result output unit 1307 correspond to the output unit of the present invention.
- FIG. 15 is a flowchart showing the operation of the automatic tracking camera 1121 shown in FIG. Note that the flowchart shown in FIG. 15 is executed in a software manner by executing a control program stored in a storage device (not shown, for example, a ROM or a flash memory) by a CPU (not shown). It is also possible to make it.
- a control program stored in a storage device (not shown, for example, a ROM or a flash memory) by a CPU (not shown). It is also possible to make it.
- an imaging process is performed in step S1401. Specifically, the imaging unit 1201 captures a video to be monitored, and outputs an input image to the video input unit 1301 of the video encoding unit 1202. Further, the imaging unit 1201 outputs information of the pan / tilt / zoom and the installation location to the detection result output unit 1307 of the video encoding unit 1202.
- a video encoding process is performed.
- the video encoding unit 1202 encodes the input video input from the imaging unit 1202 to generate a video stream, and simultaneously detects a moving object to generate a moving object detection result.
- the generated video stream and the moving object detection result are output to the receiving unit 1101 of the video monitoring device 1100 via the communication network 1110. Further, it outputs the moving object detection result to the imaging control unit 1203.
- step S1403 an imaging control process is performed. More specifically, the image capturing control unit 1203 outputs a target tracking command input from the camera group control unit 1102 of the video monitoring device 1100 via the communication network 1100, and a moving object detection result input from the video coding unit. , And generates a pan / tilt / zoom control signal and outputs the control signal to the imaging unit 1201. The imaging unit 1201 performs pan-tilt-zoom based on the control signal input from the imaging control unit 1203.
- the imaging control unit 1203 adjusts the Generates a control signal to pan and tilt. If there is a deviation between the coordinates for capturing the suspicious person to be supplemented and the coordinates of the area of the moving object indicated by the moving object detection result, the imaging control unit 1203 corrects the deviation and generates a control signal. Is also good. Further, the camera may be panned so that the moving object to be tracked always occupies a certain area with respect to the screen.
- control signal may be generated so that all of the plurality of moving objects are included in the video.
- a control signal for causing the imaging unit 1201 to swing in order to capture a wide area may be generated!
- step S1404 if there is no need to perform video monitoring, such as when the power of the automatic tracking camera 1121 is turned off, the process ends. Otherwise, the process returns to step S1401.
- step S 1402 in FIG. 15 will be described in detail.
- FIG. 16 is a flowchart showing the operation of the video encoding unit 120. Note that the flowchart shown in FIG. 16 executes a control program stored in a storage device (not shown) (for example, a ROM or a flash memory) by a CPU (not shown). It is also possible.
- a storage device for example, a ROM or a flash memory
- step S1501 video input processing is performed. Specifically, the video input unit 1301 inputs an input image from the imaging unit 1201 of the automatic tracking camera 1121 and outputs the input image to the band division unit 1302.
- step S1502 band division processing is performed. More specifically, the input image input from the video input unit 1301 is divided into bands to generate a reduced image, a horizontal component, a vertical component, and a diagonal component. The horizontal direction component, the vertical direction component, and the diagonal direction component are output to the converting unit 1303 to the enhancement layer coding unit 1304.
- step S1503 base layer coding processing is performed. Specifically, base layer coding section 1303 generates a base layer by performing motion prediction compensation coding on the reduced image input from band division section 1302, and outputs the base layer to stream output section 1305. Also, motion information obtained at the time of motion prediction compensation is output to the moving object detection unit 1306.
- step S1504 enhancement layer coding processing is performed. Specifically, the extended layer coding unit 1304 generates an enhancement layer by bit plane coding the horizontal component, the vertical component, and the diagonal component input from the band division unit 1302, and generates a stream output. Output to part 1 305. In addition, edge information obtained at the time of bit plane encoding is output to moving object detection section 1306.
- step S1505 a stream output process is performed. Specifically, the stream output unit 1305 receives the base layer input from the base layer coding unit 1303 and the enhancement layer input from the enhancement layer coding unit 1304 via the communication network 1110 to the video monitoring device 1100. Output to communication unit 1101.
- step S1506 a moving object detection process is performed. Specifically, the moving object detection unit 1306 detects a moving object using the motion information input from the base layer coding unit 1303 and the edge information input from the enhancement layer coding unit 1304, The moving object detection result is generated and output to the detection result output unit 1307.
- step S1507 a detection result output process is performed. Specifically, the detection result output
- the power unit 1307 transmits, via the communication network 1110, the moving object detection result input from the moving object detection unit 1306 and information such as pan / tilt / zoom and the installation position input from the imaging unit 1201 of the automatic tracking camera 1121. Output to the receiving unit 1101 of the video monitoring device 1100.
- the video monitoring device 1100 has a receiving unit 1101, an image recognition unit 1102, and a camera group control unit 1103.
- the image recognition unit 1102 corresponds to the image recognition unit of the present invention.
- the image recognition unit 1102 receives a video stream and a detection result of a moving object, performs detailed image recognition, and outputs the image recognition result to the camera group control unit 1103. I do.
- the camera group control unit 1103 corresponds to the camera group control unit of the present invention, inputs the result of image recognition, and generates and outputs target information to be tracked to the cameras 1121 to 112N.
- FIG. 17 is a flowchart showing the operation of the video monitoring device 1100.
- step S1601 reception processing is performed. Specifically, the receiving unit 1101 inputs the video stream and the moving object detection result from the automatic tracking camera 1121 via the communication network 1110, and outputs them to the image recognition unit 1102.
- step S1602 an image recognition process is performed. Specifically, the video stream is decoded using the video stream input from the image recognition unit 1102 and the moving object detection result input from the reception unit 1101, and the detection of a person's face using various known image recognition methods is performed. Authentication and the like are performed, and the result is generated and output to the camera group control unit 1103. Further, the image recognition unit 1102 can perform the processing at high speed by not performing the image recognition except for the area of the moving object included in the moving object detection result.
- step S1603 a camera control process is performed. Specifically, the camera group control unit 1103 uses the image recognition result input from the image recognition unit 1102 to , And outputs a target tracking command to the imaging control unit 1203 of the automatic tracking camera 1121 via the communication network 1110. Also, when a new tracking is required for the other automatic tracking cameras 1122 to 112N based on the image recognition result of the automatic tracking camera 1121, a new target tracking command is generated and the corresponding automatic tracking camera is generated via the communication network 1110. Output to the image section 1203 of 1122 to 112N.
- the camera group control unit 1103 uses the coordinates and enlargement to make the suspicious person larger.
- a target tracking command including a rate and the like is generated. If the suspicious person is present in the video but the automatic tracking camera 1121 cannot capture the suspicious person's face, the automatic tracking camera 1122 generates a target tracking instruction to cause the suspicious person to be photographed. Then, a target tracking command for causing the automatic tracking camera 1121 to capture a wide range including the suspicious person is generated.
- step S 1604 a termination determination is made, and if there is no need to perform video monitoring such as when the power of the video monitoring device 1100 is turned off, the process returns to step S 1601 otherwise.
- FIG. 18 is a sequence diagram showing the operation of the video monitoring system according to the present embodiment.
- step S1701 First, when the auto-tracking camera 1121 captures an image of a monitoring target, a video stream including information on a horizontal component, a vertical component, and a diagonal component, and a motion vector generated by motion prediction compensation. Is generated, the moving object detection results are obtained, and these are transmitted to the video monitoring device 1100 via the communication network 1110 (step S1701).
- the video monitoring device 1100 decodes the received video stream, and recognizes the target object using the information of the moving object detection result. Then, a target tracking command for tracking the target object is transmitted to the automatic tracking camera (step S1702).
- the automatic tracking camera 1121 controls the imaging unit to track the target. Then, the video stream or the like at this time is transmitted to the video monitoring device 1100 (step S1703).
- step S1702 and step S1703 described above are repeated.
- automatic tracking The video stream or the like from the camera 1121 is always transmitted to the video monitoring device 1100 regardless of the presence or absence of a command from the video monitoring device 1100.
- the video surveillance system is configured to encode and compress a video in order to transmit the video from the automatic tracking camera to the video monitoring device via the communication network.
- a moving object is simultaneously detected and the result information can be notified to the video monitoring device. There is no need to detect moving objects. Thereby, the processing of the video monitoring device can be reduced.
- the automatic tracking camera in an image monitoring system that receives an image captured by an automatic tracking camera located in a remote place and monitors and tracks the image with an image monitoring device, the automatic tracking camera
- a video stream containing information on the horizontal, vertical, and diagonal components of the captured image and the motion vector generated by motion prediction compensation Since video encoding processing and moving object detection processing can be performed, high-precision moving object detection and video encoding can be performed simultaneously and at high speed, and the scale of the entire system Can also be reduced.
- the automatic tracking camera can control the panning / tilting / zoom imaging function in accordance with an instruction from the video monitoring device obtained based on the detection result of the moving object. Therefore, it is possible to efficiently monitor moving objects and eventually suspicious persons.
- the video monitoring device recognizes only the area of the moving object based on the detection result of the moving object input together with the video stream.
- the load can be reduced, and the accuracy of image recognition is improved.
- this makes it possible to provide a video monitoring system capable of controlling more automatic tracking cameras and monitoring efficiently.
- Embodiment 3 is a moving object detection method and apparatus according to the present invention.
- a video stream having a base layer and enhancement layer power is also provided.
- This section describes a method for detecting a moving object using only the video stream of the enhancement layer in the stream.
- the video stream of the enhancement layer handled in this embodiment is based on ISO / IEC 14496-2
- FIG. 19 is a block diagram showing a configuration of a moving object detection device 1900 according to Embodiment 1 to which the moving object detection method and device of the present invention are applied.
- moving object detection apparatus 1900 includes stream input section 1901, motion information extraction section 1902, edge information extraction section 1903, moving object detection section 1904, detection result output section 190
- stream input section 1901 inputs only the video stream of the enhancement layer.
- the motion information extraction unit 1902 corresponds to the motion information extraction means, and the edge information extraction unit 1902
- the moving object detection unit 1904 corresponds to moving object detection means.
- the motion information extracting means also extracts the motion information from the input video stream power of the enhancement layer and outputs it to the moving object detecting means.
- the edge information extracting means also extracts the edge information from the input video stream power of the extended layer and outputs it to the moving object detecting means.
- the moving object detecting means detects the input edge information and the motion information force and the moving object.
- FIG. 20 is a flowchart showing the operation of moving object apparatus 1900 of Embodiment 3 shown in FIG.
- the flowchart shown in FIG. 20 is executed by software by executing a control program stored in a storage device (not shown, such as a ROM or a flash memory) by a CPU (not shown). It is also possible to do so.
- the stream input unit 1901 inputs a video stream of the enhancement layer from outside the moving object detection device 1900, and outputs it to the motion information extraction unit 1902 and the edge information extraction unit 1903 (step S2001).
- motion information extracting section 1902 extracts motion information from the enhancement layer input from stream input section 1901, and outputs the motion information to moving object detecting section 1904 (step S2002).
- the edge information extraction unit 1903 also extracts edge information from the enhancement layer power input from the stream input unit 1902, and outputs it to the moving object detection unit 1904 (step S2003).
- the motion vector of the entire frame area is stored at the head of the enhancement layer of one frame, and information of the bit plane is stored subsequently. Therefore, the stream input unit 1901 inputs up to the motion vector video stream, the motion information extraction unit 1902 generates motion information, and inputs the bit plane video stream only when there is motion in the frame, and inputs the edge information extraction unit 1903. May be output.
- the stream input unit 1901 inputs up to the motion vector video stream
- the motion information extraction unit 1902 generates motion information
- the edge information extraction unit 1903. May be output.
- a moving object detection unit 1904 detects a moving object using the motion information input from the motion information extraction unit 1902 and the edge information input from the edge information extraction unit 1903, and implements the embodiment. As in the case of 1, a moving object detection result is generated and output to the detection result output unit 1905 (steps S2004 to S2006).
- the detection result output unit 1905 outputs the coordinates of the area of the moving object input from the moving object detection unit 1904 to the outside (step S2007).
- the stream input unit 1901 determines whether or not there is a subsequent video stream. If the moving object detection device 1900 does not detect any more moving objects, the process ends. If not, the process returns to step S2001. (Step S 2008).
- the moving object detection device of the present invention extracts motion information from a video stream that has been video-encoded using hierarchical coding and motion prediction compensation coding, which divide the video into a plurality of layers and encode it.
- Motion information extracting means for extracting edge information from the video stream It has a configuration including edge information extracting means, and moving object detecting means for detecting a moving object using the motion information and the edge information and outputting the detection result.
- the edge information extraction means may further include, among bit plane information obtained by bit plane encoding the image, the most significant bit plane color N (N is a natural number) bit bits The bit plane information up to the plane is also extracted as edge information of the video stream.
- an edge having a specific strength or more can be detected, and a contour of an object can be detected at high speed.
- the contour of an object can be detected only on a bit plane at or above a certain bit position, and a bit plane with less than a certain bit position is required to receive a required video stream via a communication network with a slow communication speed.
- high-precision detection can be performed at a low bit rate.
- the video stream is further divided into a plurality of regions, and the moving object detection means determines that the sum of the code lengths of the bit plane information in the region is smaller than If the value is equal to or greater than a predetermined first value, the area is determined as a contour area of the moving object.
- the moving object detection means may further include: when a total of code lengths of the bit plane information in the area is equal to or less than a predetermined second value, The region is determined as a contour region of the moving object.
- the contour of the object is a line
- a certain region includes too many horizontal components, vertical components, and diagonal components, for example, it is a region including a stripe pattern. Yes, it is possible to determine that it is not the contour of the moving object and prevent erroneous detection.
- the motion information extraction means may further include The motion vector detecting unit extracts a motion vector that is determined as a contour region of the object, and
- the area is determined to be a contour area of the moving object.
- the motion information extracting means further extracts a region force first motion vector determined to be a contour region of the moving object, and detects a position in the vicinity of the region.
- a moving object detecting means for extracting a second motion vector, and measuring a magnitude of a difference vector between the first motion vector and the second motion vector. If the measured value is equal to or less than a predetermined fourth value, the selected area is determined to be an internal area of the moving object.
- the motion information extracting means selects a plurality of areas, extracts a motion vector from each of the selected areas, and the moving object detecting means further comprises: Calculating the magnitude of the difference vector between the first motion vector and the motion vector of the selected area, and calculating the sum of the magnitudes of the difference vectors for all the selected areas as the measurement value. is there.
- the region of the outline of the moving object in the video has a different speed from the surrounding region, so that a plurality of regions other than the outline of the moving object are not regions of the moving object! / , And the detection accuracy of the moving object can be improved.
- the moving object detection means may further include a motion vector of the area determined to be an internal area of the moving object, and a motion vector of an area located near the area. If the magnitude of the difference vector with respect to the motion vector is equal to or smaller than a predetermined fifth value, it is determined that the moving object is inside the area.
- the area of the moving object moving at a certain speed that is not determined to be the moving object is determined.
- the area can be detected, and the accuracy of detection of the moving object can be improved.
- the moving object detection means may further include a region surrounded by the outline region of the moving object or an area determined to be an internal region of the moving object, and Is determined to be the internal area of
- the moving object detection means further includes a contour area or an inner area of the second moving object near the contour area or the inner area determined as the first moving object. If the number of regions determined to be regions is equal to or greater than a predetermined sixth value, the outline region or the inner region determined to be the first moving object is re-determined to be the first moving object. Things.
- the moving object detection method of the present invention is a method for detecting a moving object of a video stream, wherein the moving object detection device for detecting the moving object executes the processing by dividing the image into a plurality of layers and encoding the moving image. Extracting video stream power and motion information that have been video-encoded using hierarchical coding and motion prediction compensation coding; extracting the video stream power and edge information; and extracting the extracted motion information. And detecting the moving object using the edge information.
- the moving object detection program of the present invention uses a computer for detecting a moving object in a video stream using a hierarchical code and a motion prediction compensation code, which divide a video into a plurality of layers and encode the video. Extracting motion information from the video stream encoded in the video stream, extracting the video stream force edge information, and detecting a moving object using the extracted motion information and the edge information. And the steps to be performed. [0179] According to this program, it is possible to detect the outline of an object without decoding the video stream, and further, it is possible to detect a moving object from motion information, and to move at high speed, with high accuracy, and with a low processing load. Objects can be detected.
- the video decoding device of the present invention is a video decoding means for decoding a video stream coded by hierarchical coding and motion prediction compensation coding by dividing a video into a plurality of layers. And moving object detection means for detecting a moving object from the motion information and the edge information extracted when the video decoding means decodes the video stream.
- the video decoding device and the moving object detection device can share some processing and means, and can simultaneously perform video decoding and moving object detection at a high speed. In addition, it is possible to reduce the scale of the entire apparatus.
- the video stream is divided into a plurality of regions, and the moving object detecting means determines in advance that the sum of the code lengths of the bit plane information in the region is predetermined.
- the region is determined as a contour region of the moving object.
- the region including the horizontal direction component, the vertical direction component, and the diagonal direction component exists in the region only by checking the code amount of the bit plane up to the bit position of a certain threshold value.
- the number of edges can be determined, and the contour of the object can be detected at high speed.
- the moving object detecting means may further include a control unit that determines whether the sum of the code lengths of the bit plane information in the area is equal to or less than a predetermined second value.
- the area is determined as a contour area of a moving object.
- the contour of the object is a line, when a certain region includes too many horizontal components, vertical components, and diagonal components, for example, it is a region including a striped pattern. Yes, it is possible to determine that it is not the contour of the moving object and prevent erroneous detection.
- the video decoding device further generates a video in which the area of the moving object detected by the moving object detection means is emphasized.
- the observer can easily detect the moving object.
- the video decoding means further generates a video composed of edge components, and displays only the area of the moving object detected by the moving object detection means in an emphasized manner. It is something.
- the bit rate of the base layer is extremely low due to the limitation of the communication speed, etc., and the image quality is extremely poor. Even when only a video can be generated, the outline alone may be able to recognize the details. .
- the video encoding apparatus provides a video encoding scheme for generating a video stream encoded using hierarchical encoding and motion prediction compensation encoding that divides an image into a plurality of layers.
- the video encoding means and the moving object detection means can share some processing and means, and can simultaneously perform video encoding and detection of the moving object at a high speed. In addition, it is possible to reduce the scale of the entire apparatus.
- the imaging device of the present invention includes imaging means for inputting a video, a video encoding apparatus according to the present invention for encoding the video input by the imaging means, and detection of a moving object output from the moving object detection means. It has imaging control means for controlling an imaging function for the imaging means based on the result, and an output section for outputting a video stream and a detection result of a moving object.
- a moving object can be detected in the process of generating a video stream generated for transmitting a video to a remote location. Therefore, in video monitoring or the like, a suspicious person or the like moves at high speed. In addition to being able to continue detecting and photographing as an object, the video can be transmitted and video monitoring can be performed efficiently.
- the imaging control means may control the imaging means so that the area of the area of the moving object output by the moving object detection means is a fixed ratio to the entire area of the input video. Is controlled. [0195] With this configuration, the moving object and its surroundings can be included in the video, and the moving object of interest can be monitored efficiently.
- the video surveillance system provides an image capturing apparatus according to the present invention, a video stream received from the image capturing apparatus, a video stream received from the image capturing apparatus, and a detection result of the moving object. And a video monitoring device for performing image recognition.
- a moving object can be detected in the process of generating a video stream generated for transmitting a video to a remote place, and image recognition processing of an area other than the moving object is omitted. Since image recognition can be performed at high speed and with a low processing load, it is possible to detect a suspicious person or the like as a moving object at high speed and continue shooting in video monitoring.
- image recognition is not limited to detection of a moving object, but refers to automatic discrimination means using a machine image, including recognition of a person's face and authentication of a person.
- the video stream is further hierarchized and coded into a base layer and an enhancement layer, and the motion information extracting means includes a video stream power of the base layer.
- the motion information is extracted, and the edge information extracting means extracts the edge information of the video stream of the extended layer.
- the video stream is further encoded by being hierarchized into a base layer and an enhancement layer, and the motion information extracting means includes a video stream power of the enhancement layer.
- the motion information is extracted, and the edge information extracting means extracts the edge information of the video stream of the enhancement layer.
- the moving object detection process can be performed only with the video stream of the enhancement layer, and the contour of the object can be detected with a high speed and a small number of video streams.
- the present specification is based on Japanese Patent Application No. 2004-161053 filed on May 31, 2004 and Japanese Patent Application No. 2005-035627, filed on February 14, 2005. All of these details are included here. Industrial applicability
- the present invention is useful for a moving object detection device that detects a moving object from a video stream generated by encoding a video, and detects a moving object at high speed without decoding a video stream. Suitable to do.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Closed-Circuit Television Systems (AREA)
- Studio Devices (AREA)
Abstract
Description
Claims
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004-161053 | 2004-05-31 | ||
JP2004161053 | 2004-05-31 | ||
JP2005-035627 | 2005-02-14 | ||
JP2005035627A JP2007266652A (ja) | 2004-05-31 | 2005-02-14 | 移動物体検出装置、移動物体検出方法、移動物体検出プログラム、映像復号化装置、映像符号化装置、撮像装置及び映像管理システム |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2005117448A1 true WO2005117448A1 (ja) | 2005-12-08 |
Family
ID=35451279
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/009665 WO2005117448A1 (ja) | 2004-05-31 | 2005-05-26 | 移動物体検出装置および移動物体検出方法 |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP2007266652A (ja) |
WO (1) | WO2005117448A1 (ja) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2013229806A (ja) * | 2012-04-26 | 2013-11-07 | Toshiba Corp | 遠隔点検装置および監視装置 |
JP2016031576A (ja) * | 2014-07-28 | 2016-03-07 | クラリオン株式会社 | 物体検出装置 |
CN105516650A (zh) * | 2014-10-14 | 2016-04-20 | 西门子公司 | 用于检测运动对象的设备和方法 |
CN113706573A (zh) * | 2020-05-08 | 2021-11-26 | 杭州海康威视数字技术股份有限公司 | 一种运动物体的检测方法、装置及存储介质 |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010113129A (ja) * | 2008-11-06 | 2010-05-20 | Nikon Corp | 画像追尾装置、焦点調節装置、および撮像装置 |
KR101416957B1 (ko) * | 2012-10-09 | 2014-07-09 | 주식회사 아이티엑스시큐리티 | 영상 기록 장치 및 svc 비디오 스트림을 이용한 모션 분석 방법 |
US9921397B2 (en) | 2012-12-11 | 2018-03-20 | Solatube International, Inc. | Daylight collectors with thermal control |
CA2980037C (en) | 2015-03-18 | 2018-08-28 | Solatube International, Inc. | Daylight collectors with diffuse and direct light collection |
US9816675B2 (en) | 2015-03-18 | 2017-11-14 | Solatube International, Inc. | Daylight collectors with diffuse and direct light collection |
JP6537396B2 (ja) * | 2015-08-03 | 2019-07-03 | キヤノン株式会社 | 画像処理装置、撮像装置および画像処理方法 |
WO2017094140A1 (ja) | 2015-12-02 | 2017-06-08 | 三菱電機株式会社 | 物体検出装置及び物体検出方法 |
JP6696083B2 (ja) * | 2016-05-20 | 2020-05-20 | 国際航業株式会社 | 領域変位算出システム、領域変位算出方法、及び領域変位算出プログラム |
WO2018037665A1 (ja) * | 2016-08-22 | 2018-03-01 | 日本電気株式会社 | 情報処理装置、情報処理システム、制御方法、及びプログラム |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH01318382A (ja) * | 1988-06-17 | 1989-12-22 | Matsushita Electric Ind Co Ltd | 動き検出装置 |
JPH1075457A (ja) * | 1996-08-29 | 1998-03-17 | Kokusai Denshin Denwa Co Ltd <Kdd> | 動画像内の移動物体検出装置 |
JP2001250118A (ja) * | 2000-03-06 | 2001-09-14 | Kddi Corp | 動画像内の移動物体検出追跡装置 |
JP2003032496A (ja) * | 2001-07-12 | 2003-01-31 | Sanyo Electric Co Ltd | 画像符号化装置および方法 |
-
2005
- 2005-02-14 JP JP2005035627A patent/JP2007266652A/ja active Pending
- 2005-05-26 WO PCT/JP2005/009665 patent/WO2005117448A1/ja not_active Application Discontinuation
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH01318382A (ja) * | 1988-06-17 | 1989-12-22 | Matsushita Electric Ind Co Ltd | 動き検出装置 |
JPH1075457A (ja) * | 1996-08-29 | 1998-03-17 | Kokusai Denshin Denwa Co Ltd <Kdd> | 動画像内の移動物体検出装置 |
JP2001250118A (ja) * | 2000-03-06 | 2001-09-14 | Kddi Corp | 動画像内の移動物体検出追跡装置 |
JP2003032496A (ja) * | 2001-07-12 | 2003-01-31 | Sanyo Electric Co Ltd | 画像符号化装置および方法 |
Non-Patent Citations (2)
Title |
---|
OKUMURA M. ET AL: "Ugoki Tokutyo to Iro Joho o Riyo shita Dobuttai Kenshutsu ni yoru Scene Bunkatsu Shuho ni Kansuru Kento", THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS GIJUTSU KENKYU HOKOKU, vol. 103, no. 585, 16 January 2004 (2004-01-16), pages 31 - 36, XP002997043 * |
YONEYAMA A. ET AL: "MPEG Video Stream kara no Idobuttai no Kenshutsu", THE TRANSACTIONS OF THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS, vol. J81-D-II, no. 8, 25 August 1998 (1998-08-25), pages 1776 - 1786, XP002997044 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2013229806A (ja) * | 2012-04-26 | 2013-11-07 | Toshiba Corp | 遠隔点検装置および監視装置 |
JP2016031576A (ja) * | 2014-07-28 | 2016-03-07 | クラリオン株式会社 | 物体検出装置 |
CN105516650A (zh) * | 2014-10-14 | 2016-04-20 | 西门子公司 | 用于检测运动对象的设备和方法 |
CN113706573A (zh) * | 2020-05-08 | 2021-11-26 | 杭州海康威视数字技术股份有限公司 | 一种运动物体的检测方法、装置及存储介质 |
CN113706573B (zh) * | 2020-05-08 | 2024-06-11 | 杭州海康威视数字技术股份有限公司 | 一种运动物体的检测方法、装置及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
JP2007266652A (ja) | 2007-10-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8983121B2 (en) | Image processing apparatus and method thereof | |
US7616782B2 (en) | Mesh based frame processing and applications | |
US8315481B2 (en) | Image transmitting apparatus, image receiving apparatus, image transmitting and receiving system, recording medium recording image transmitting program, and recording medium recording image receiving program | |
US8331617B2 (en) | Robot vision system and detection method | |
JP3926572B2 (ja) | 画像監視方法、画像監視装置及び記憶媒体 | |
WO2019076503A1 (en) | APPARATUS, METHOD AND COMPUTER PROGRAM FOR ENCODING VOLUMETRIC VIDEO | |
WO2017221643A1 (ja) | 画像処理装置、画像処理システム、および画像処理方法、並びにプログラム | |
KR20080049063A (ko) | 모션 감지 디바이스 | |
CN108012155A (zh) | 预拼接图像的视频编码方法、视频解码方法和相关的装置 | |
WO2005117448A1 (ja) | 移動物体検出装置および移動物体検出方法 | |
US11503267B2 (en) | Image processing device, content processing device, content processing system, and image processing method | |
KR20120072351A (ko) | 디지털 이미지 안정화 | |
WO2003024116A1 (en) | Motion estimation and/or compensation | |
US9584806B2 (en) | Using depth information to assist motion compensation-based video coding | |
US5596370A (en) | Boundary matching motion estimation apparatus | |
WO2017221644A1 (ja) | 画像処理装置、画像処理システム、および画像処理方法、並びにプログラム | |
KR20110111106A (ko) | 객체추적 및 로이터링 장치 및 방법 | |
US11044399B2 (en) | Video surveillance system | |
CA2812890C (en) | Mesh based frame processing and applications | |
KR20030049804A (ko) | 카메라 움직임 판별 장치 및 방법 | |
Hofer et al. | Comparison of Analyze-Then-Compress Methods in Edge-Assisted Visual SLAM | |
JP3279354B2 (ja) | 3次元ボリュームデータの動き補償予測方式 | |
JP2009268065A (ja) | 画像処理システム、画像処理方法、およびプログラム | |
JP2701393B2 (ja) | 動画像符号化装置 | |
US6898244B1 (en) | Movement vector generating apparatus and method and image encoding apparatus and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200580000797.4 Country of ref document: CN |
|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2005743856 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2005743856 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: DE |
|
NENP | Non-entry into the national phase |
Ref country code: JP |