WO2020001216A1 - Abnormal event detection - Google Patents
Abnormal event detection Download PDFInfo
- Publication number
- WO2020001216A1 WO2020001216A1 PCT/CN2019/088703 CN2019088703W WO2020001216A1 WO 2020001216 A1 WO2020001216 A1 WO 2020001216A1 CN 2019088703 W CN2019088703 W CN 2019088703W WO 2020001216 A1 WO2020001216 A1 WO 2020001216A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- target
- foreground
- targets
- detection
- frame
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G06T7/246—Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30196—Human being; Person
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30232—Surveillance
Definitions
- the present application relates to the field of image processing, and in particular, to a method, an apparatus, an electronic device, and a computer-readable storage medium for detecting an abnormal event.
- ATMs Automatic Teller Machines
- a protective cabin is installed outside the ATM.
- some public telephones are also equipped with a protective compartment.
- the protection cabin brings convenience to the depositor (or the user of the public telephone).
- illegal and criminal activities in the protection cabin also occur from time to time, mainly including trailing robbery and criminals staying for a long time to cause damage.
- users may leave their belongings in the protective cabin after completing the necessary business departure in the protective cabin.
- the monitoring personnel can learn the three types of abnormal events in the protection cabin in time, the safety of the protection cabin and the user experience can be effectively improved.
- embodiments of the present application provide a method, an apparatus, an electronic device, and a computer-readable storage medium for detecting an abnormal event, so as to accurately detect abnormal events such as a person's tail, a person's stay, and an item's left.
- an embodiment of the present application provides a method for detecting an abnormal event, including: using a trained convolutional neural network CNN to obtain one or more feature targets from a monitored video stream, wherein the video stream It is a video stream obtained by monitoring a specified area, and the characteristic target represents a target person appearing in the specified area; using a preset foreground model for foreground detection to obtain one or more foreground targets from the video stream; based on The feature target and the foreground target determine one or more detection targets; track each of the detection targets by recording the presence of the detection target in the video stream to obtain a tracking result; and according to the tracking result Identify abnormal events.
- an embodiment of the present application provides an abnormal event detection device, including: a first obtaining unit, configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN Wherein, the video stream is obtained by monitoring a specified area, and the characteristic target represents a target person appearing in the specified area; a second obtaining unit is configured to use a preset foreground model for foreground detection from the One or more foreground targets are obtained from the video stream; a first determining unit is configured to determine one or more detection targets based on the feature target and the foreground target; a second determining unit is configured to record the detection targets in the The existence of the video stream tracks each of the detection targets to obtain a tracking result, and determines an abnormal event based on the tracking result.
- an embodiment of the present application provides an electronic device, including a processor, and a memory for storing executable instructions of the processor.
- the processor is configured to execute the method for detecting an abnormal event according to the first aspect.
- an embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the method for detecting an abnormal event according to the first aspect is implemented.
- FIG. 1 is a flowchart of a method for detecting an abnormal event shown in the present application
- FIG. 2 is a block diagram of an embodiment of an abnormal event detection device shown in the present application.
- FIG. 3 is a hardware structural diagram of an electronic device shown in the present application.
- a camera on the top of the protective cabin is used to record the monitoring video in the protective cabin in real time, and then a foreground model of the protective cabin is established, and a foreground target (people entering the protective cabin) is extracted based on the foreground model. Further, the moving foreground target is tracked to judge the entry and exit of the personnel in the protective cabin, so as to realize the judgment of abnormal events such as the following of the personnel and the retention of the personnel, and to alert the monitoring personnel.
- the method of obtaining a foreground target by establishing a foreground model is susceptible to environmental influences and may cause misjudgment of abnormal events.
- the discrimination between the foreground object and the background is low, the probability of detecting the foreground object is low.
- the probability of detecting the foreground object is low. For example, when the light outside the protective cabin changes or the shadow of people outside the cabin is projected into the cabin, false foreground targets may be generated, causing misjudgment of anomalous events; and the color of the personnel ’s coat and the background in the protective cabin When the colors are similar, it is difficult to extract the foreground target.
- a flowchart of a method for detecting an abnormal event shown in this application includes the following steps:
- Step 101 Use the trained convolutional neural network CNN for feature detection to obtain one or more feature targets of the target person from the monitored video stream.
- the video stream is a video stream obtained by monitoring a specified area.
- the execution method of the above method may be an electronic device docked with a monitoring device (such as a monitoring camera).
- the electronic device may be a hard disk video recorder.
- the hard disk video recorder is taken as the main execution body in the following text.
- CNNs Convolutional Neural Network
- CNNs can identify the feature targets appearing in the video frames of the video stream through the training of human features in advance.
- CNNs can be trained with the head and shoulders of a person, so that the CNN can identify the head and shoulder targets appearing in the video frame.
- CNNs can also be trained with other personnel characteristics (for example, limbs and torso), which may be specifically based on the characteristics of personnel that can be easily monitored by the monitoring device.
- the specified area may be any area where an abnormal event may occur, and the specified area is monitored by a monitoring device to generate a video stream.
- the above-mentioned designated area may be the inside of the protection cabin.
- the trained CNN is used to obtain the person's characteristic target from the video stream to detect the persons appearing in the video stream.
- the hard disk video recorder can record the obtained coordinates of the upper-left corner of the target frame of the feature target in the feature target table, the width, the height, and the confidence that the feature target is a human feature.
- the feature target table includes a mapping relationship between coordinates, width, height, and confidence of an upper left corner of a target frame of the feature target.
- Step 102 Use a preset foreground model for foreground detection to obtain one or more foreground objects from the video stream.
- the foreground models used for foreground detection include Gaussian Mixture Model (Vius) and ViBe (visual background extractor) algorithms.
- the foregoing foreground model is established based on the RGB (Red Green Blue) information of the monitored specified area, and can be used to identify the foreground target appearing in the video frame of the video stream.
- the foreground target refers to the target relative to the established foreground model.
- the hard disk video recorder uses the trained foreground model to obtain a foreground target from the video stream.
- the foreground target may be a body part of a person, an item, or even a part of the scene in the protective cabin (for example, an ATM machine illuminated by light may be identified as a foreground target). Therefore, further analysis and determination are needed in the future. The specifics of the prospects.
- the hard disk video recorder can record the obtained coordinates and the width and height of the upper left corner of the target frame of the foreground target in the foreground target table.
- the foregoing foreground target table includes a mapping relationship of coordinates, width, and height of an upper left corner of the foreground target.
- Step 103 Determine one or more detection targets based on the feature target and the foreground target.
- the hard disk video recorder may determine a detection target to be tracked subsequently based on the feature target and the foreground target. Specifically, the hard disk video recorder may select a foreground object of interest that is not associated with each characteristic object from the obtained foreground objects, and then determine each characteristic object and the foreground object of interest as detection targets.
- the hard disk video recorder may first calculate, for each foreground target, the area of intersection between the target frame of the foreground target and the target frame of each characteristic target.
- the position and area of the target frame of each foreground target in the video frame may be determined by the coordinates and width and height of the upper left corner of the target frame of each foreground target recorded in the foreground target table. And, the position and area of the target frame of each feature target in the video frame are determined by the coordinates and width and height of the upper left corner of the target frame of each feature target recorded in the feature target table. Further, for each foreground target, an area of intersection between the target frame of the foreground target and the target frame of each feature target is determined.
- the hard disk video recorder can determine whether the area of the intersection reaches a preset area threshold.
- the focus on the foreground target is not the person's body part.
- the foreground target is associated with the feature target.
- the foreground target is the body part of the person indicated by the characteristic target.
- the hard disk video recorder may determine the foreground target of interest that is not associated with each characteristic target and each characteristic target as detection targets to track the above detection targets.
- Step 104 Track each detection target by recording the existence of the detection target in the video stream, and determine an abnormal event according to the tracking result.
- the hard disk video recorder can use multiple object tracking technology (Multiple Object Tracking / Multiple Target Tracking) to track the center point of the target frame of the detection target.
- the center point of the target frame of the detection target may be determined based on the coordinates and width and height of the upper left corner of the target frame.
- the hard disk video recorder can record the type of each detection target tracked, the historical coordinates of the center point of the target frame of each detection target, and the video frame identifier of the video frame where the detection target is located in the tracking table.
- the historical coordinates are coordinates of a center point of a target frame of the detection target in each video frame of a video stream in which the detection target exists.
- the hard disk video recorder will continuously record the coordinates of the center point of the target frame in the video frame of the detection target during the tracking process.
- the tracking table includes the type of the detection target, the historical coordinates of the center point of the target frame of the detection target, and the mapping relationship of the video frame identifier of the video frame where the detection target is located.
- the types of detection targets mentioned above include personnel and non-persons, where non-persons include objects and false foregrounds (such as silhouettes).
- the hard disk video recorder may indicate that the type of the characteristic target is a person by recording the confidence level of the person's characteristics, and the confidence level of the foreground target is temporarily recorded as zero.
- the video frame identifier may be a frame number of a video frame, and the frame number indicates a position of the video frame in the video stream, and frame numbers of two frames before and after the video frame are different by one. Therefore, in practical applications, the position of the video frame in which the detection target exists in the video stream can be determined by recording the frame number of the video frame in which the detection target is located.
- the DVR determines abnormal events based on the tracking results.
- the hard disk video recorder may convert a preset duration judgment threshold for several abnormal events into a count threshold for the number of video frames. For example, for a trailing event, the preset duration judgment threshold is 5 minutes. Since there are 25 frames per second, the converted trailing count threshold is 7500. For a detained event, the preset duration judgment threshold is 10 minutes. The retention count threshold value of 15,000 is 15,000; for the item leftover event, the preset duration judgment threshold value is 10 minutes, and the converted residual count threshold value is 15000.
- the hard disk video recorder may determine the first target video frame based on the above-mentioned tracking table and according to the type of each detection target in the tracking result and the video frame identifier of the video frame in which it is located. Wherein, there is at least one characteristic target in the first target video frame. It is determined whether the number of the first target video frames reaches a preset retention count threshold. If so, determine that a detention event exists.
- the hard disk video recorder tracks the detection target, as long as there is a tracking entry of at least one type of detection target in the tracking table, the number of video frames in which at least one person exists (the video frames described above) (That is, the first target video frame), and a retention count is obtained. Every time a new first target video frame is obtained, the hard disk video recorder may increase the above-mentioned retention count by one, and determine whether the retention count reaches the above-mentioned retention count threshold.
- the stay count is continuously updated.
- the hard disk video recorder can output stuck alarm information to the video surveillance personnel.
- each of the first target video frames corresponding to the staying count is continuous in time.
- the hard disk video recorder may determine the second target video frame based on the above tracking table, according to the type of each detection target in the tracking result, and the video frame identifier of the video frame in which it is located. There are at least two feature targets in the second target video frame. It is determined whether the number of the second target video frames reaches a preset trailing count threshold. If so, determine that a trailing event exists.
- the hard disk video recorder tracks the detection target, if there are at least two detection target types in the tracking table, the number of video frames in which there are at least two persons can be counted (the above)
- the video frame is the second target video frame), and the trailing count is obtained.
- the hard disk video recorder may increase the above-mentioned trailing count by one, and determine whether the trailing count reaches the above-mentioned trailing count threshold.
- the DVR can output a trailing alarm message to the video surveillance personnel.
- each second target video frame corresponding to the above-mentioned trailing count is temporally continuous.
- the retention count and the trailing count can be counted simultaneously without affecting each other.
- the hard disk video recorder can determine the third target video frame based on the above tracking table, according to the type of each detection target in the tracking result, the video frame identification of the video frame where it is located, and the historical coordinates of the center point of the target frame of each detection target . Wherein, there is no feature target but a foreground target in the third target video frame, and the coordinates of the center point of the target frame of the foreground target in the third target video frame where the foreground target exists are located in a preset detection area. .
- the hard disk video recorder tracks the above-mentioned detection target
- the target can be determined.
- the video frame is the third target video frame, it is counted to obtain a legacy count.
- the above-mentioned preset detection area may be an area where the user easily leaves an item in an actual application environment. For example, for an ATM protective cabin, the above detection area may be an area close to the ATM.
- the above-mentioned legacy count may be increased by one, and it is determined whether the legacy count reaches the aforementioned legacy count threshold.
- Each third target video frame corresponding to the above-mentioned legacy count is continuous in time.
- the above-mentioned legacy count does not reach the above-mentioned legacy count threshold, the above-mentioned legacy count is continuously updated.
- the foreground target may be extracted from at least one third target video frame.
- the hard disk video recorder may use the preset CNN classification model to classify the extracted foreground targets, and obtain the confidence that the foreground targets correspond to N different types of foreground targets.
- N is an integer greater than 1
- the N different foreground target types include at least items and non-items.
- the types of foreground targets can include people, items, and non-items.
- Non-items include false prospects.
- the above-mentioned CNN classification model is trained in advance through human characteristics, items that may appear in a specified area, and background content of the specified area.
- items include bank cards, keys, bags, umbrellas, luggage, etc.
- the background content of the designated area includes the ground, ATM, posters posted in the protective cabin, and changes in light outside the protective cabin or shadows Background content when arriving in the cabin, etc.
- the content of the foreground target can be more accurately identified, and the misjudgment of the event left by the item can be avoided.
- the hard disk video recorder may determine the actual content of the foreground target based on the confidence corresponding to the person, the item, and the non-item.
- the confidence level corresponding to the person is the largest, it means that although there are no feature targets in the current video frame, there are still people, and the above-mentioned legacy count can be cleared to zero;
- the confidence level corresponding to the non-item is the largest, it means that the current third target video frame does not exist or the item, and the above-mentioned legacy count can be cleared to zero;
- the hard disk video recorder can output alarms on items left to video surveillance personnel.
- the hard disk video recorder can extract the feature target of the target person in the video stream through the CNN used for feature detection, and obtain the foreground target from the video stream through the foreground model used for foreground detection, and based on
- the detection targets determined by the above characteristic targets and foreground targets can greatly reduce the impact of the environment, can more accurately identify the target person and item, improve the detection rate, and then by tracking the above detection targets, can effectively detect Anomalous events such as personnel tracking, detention of personnel and leftovers of items.
- this application further provides an embodiment of a device for detecting an abnormal event.
- FIG. 2 a block diagram of an embodiment of an abnormal event detection device is shown.
- the abnormal event detection device 20 includes:
- a first obtaining unit 210 is configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN, wherein the video stream is obtained by monitoring a specified area, and the feature target represents Target person appearing in the designated area.
- the second obtaining unit 220 is configured to obtain one or more foreground objects from the video stream by using a preset foreground model for foreground detection.
- the first determining unit 230 is configured to determine one or more detection targets based on the feature target and the foreground target.
- a second determining unit 240 is configured to track each of the detection targets by recording the presence of the detection targets in the video stream to obtain a tracking result, and determine an abnormal event according to the tracking result.
- the first determining unit 230 is further configured to: select a foreground object of interest that is not associated with each of the characteristic objects from the foreground objects; determine each of the characteristic object and the foreground object of interest Is the detection target.
- the first determining unit 230 is further configured to calculate, for each foreground target, an area of intersection between a target frame of the foreground target and a target frame of each characteristic target.
- the area of the intersection between the frame and the target frame of each feature target is less than a preset area threshold, then it is determined that the foreground target is not associated with each feature target.
- the second determining unit 240 is further configured to record the type of the detection target, the coordinates of the center point of the target frame of the detection target in each video frame in which the detection target is located, and the A video frame identifier of a video frame of the detection target exists in the video stream.
- the second determining unit 240 is further configured to determine a first target video frame in the video stream according to the tracking result, wherein at least one of the characteristic targets exists in the first target video frame; When the number of the first target video frames associated with the at least one characteristic target reaches a preset retention count threshold, it is determined that a retention event exists.
- the second determining unit 240 is further configured to determine a second target video frame in the video stream according to the tracking result, wherein at least two of the characteristic targets exist in the second target video frame; When the number of consecutive second target video frames associated with the at least two feature targets reaches a preset trailing count threshold, it is determined that a trailing event exists.
- the second determining unit 240 is further configured to determine a third target video frame in the video stream according to the tracking result; wherein the foreground target exists in the third target video frame, but no one exists.
- the coordinates of the center point of the target frame of the characteristic target in the foreground target in the third target video frame in which the foreground target is located are within a preset detection area; in the continuous association with the foreground target,
- the foreground target is extracted from at least one of the third target video frames;
- the preset foreground target is extracted using a preset CNN classification model Classify to obtain the confidence that the foreground target corresponds to N different foreground target types, where N is an integer greater than 1, and the N different foreground target types include at least items and non-items; and if the foreground target corresponds to the confidence of the item If the degree is the largest, it is determined that there is an item leftover event.
- the embodiment of the apparatus for detecting an abnormal event of the present application can be applied to an electronic device.
- the device embodiments can be implemented by software, or by hardware or a combination of software and hardware. Taking software implementation as an example, as a device in a logical sense, it is formed by reading the corresponding computer program instructions in the non-volatile memory into the memory through the processor of the electronic device where it is located.
- FIG. 3 it is a hardware structure diagram of the electronic device where the abnormal event detection device of this application is located, except for the processor, memory, network interface, and non-volatile memory shown in FIG. 3.
- the electronic equipment in which the device is located in the embodiment may generally include other hardware according to the actual function of the detection device of the abnormal event, and details are not described herein again.
- the memory and non-volatile memory of the electronic device are also equipped with machine-executable instructions corresponding to the first obtaining unit 210, machine-executable instructions corresponding to the second obtaining unit 220, and the first determining unit 230, respectively.
- the relevant part may refer to the description of the method embodiment.
- the device embodiments described above are only schematic, and the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located One place, or it can be distributed across multiple network elements. Some or all of these modules can be selected according to actual needs to achieve the purpose of the solution of this application. Those of ordinary skill in the art can understand and implement without creative efforts.
- An embodiment of the present application further provides a computer-readable storage medium on which a computer program is stored.
- the program is executed by a processor, the method for detecting an abnormal event according to the foregoing method embodiment is implemented.
- the computer-readable storage medium includes a non-transitory computer-readable storage medium.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Image Analysis (AREA)
- Closed-Circuit Television Systems (AREA)
Abstract
Description
Claims (16)
- 一种异常事件的检测方法,包括:A method for detecting abnormal events, including:利用已训练的卷积神经网络CNN从视频流中获取一个或多个特征目标,其中,所述视频流是监控指定区域得到的,所述特征目标表示出现在所述指定区域的目标人员;Using a trained convolutional neural network CNN to obtain one or more feature targets from a video stream, wherein the video stream is obtained by monitoring a designated area, and the feature target represents a target person appearing in the designated area;利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标;Obtaining one or more foreground objects from the video stream by using a preset foreground model for foreground detection;基于所述特征目标和所述前景目标确定一个或多个检测目标;Determining one or more detection targets based on the feature target and the foreground target;通过记录所述检测目标在所述视频流中的存在情况来跟踪每个所述检测目标,以获取跟踪结果;Track each detection target by recording the existence of the detection target in the video stream to obtain a tracking result;依据所述跟踪结果确定异常事件。An abnormal event is determined according to the tracking result.
- 根据权利要求1所述的方法,其特征在于,基于所述特征目标和所述前景目标确定所述检测目标,包括:The method according to claim 1, wherein determining the detection target based on the characteristic target and the foreground target comprises:从所述前景目标中选择与各个所述特征目标无关联的关注前景目标;Selecting a foreground target of interest that is not associated with each of the characteristic targets from the foreground targets;将各个所述特征目标和所述关注前景目标确定为所述检测目标。Each of the feature target and the attention foreground target is determined as the detection target.
- 根据权利要求2所述的方法,其特征在于,从所述前景目标中选择与各个所述特征目标无关联的关注前景目标,包括:The method according to claim 2, wherein selecting the foreground object of interest that is not associated with each of the characteristic objects from the foreground objects comprises:针对每一所述前景目标,For each stated goal,计算该前景目标的目标框与每一个所述特征目标的目标框之间交集的面积;Calculating the area of intersection between the target frame of the foreground target and the target frame of each of the characteristic targets;若该前景目标的目标框与每一个所述特征目标的目标框之间的交集的面积均小于预设的面积阈值,则确定该前景目标与各个所述特征目标无关联。If the area of the intersection between the target frame of the foreground target and the target frame of each of the feature targets is less than a preset area threshold, it is determined that the foreground target is not associated with each of the feature targets.
- 根据权利要求1所述的方法,其特征在于,记录所述检测目标在所述视频流中的存在情况,包括:The method according to claim 1, wherein recording the existence of the detection target in the video stream comprises:记录所述检测目标的类型、所述检测目标的目标框的中心点在存在所述检测目标的各视频帧中的坐标以及所述视频流中存在所述检测目标的视频帧的视频帧标识。Record the type of the detection target, the coordinates of the center point of the target frame of the detection target in each video frame in which the detection target is present, and the video frame identifier of the video frame in which the detection target is present in the video stream.
- 根据权利要求4所述的方法,其特征在于,依据所述跟踪结果确定异常事件,包括:The method according to claim 4, wherein determining an abnormal event according to the tracking result comprises:依据所述跟踪结果确定所述视频流中的第一目标视频帧,其中,所述第一目标视频帧中存在至少一个所述特征目标;Determining a first target video frame in the video stream according to the tracking result, wherein at least one of the characteristic targets exists in the first target video frame;在连续的与所述至少一个特征目标关联的所述第一目标视频帧的个数达到预设的滞留计数阈值时,确定存在滞留事件。When the number of consecutive first target video frames associated with the at least one characteristic target reaches a preset retention count threshold, it is determined that a retention event exists.
- 根据权利要求4所述的方法,其特征在于,依据所述跟踪结果确定异常事件,包括:The method according to claim 4, wherein determining an abnormal event according to the tracking result comprises:依据所述跟踪结果确定所述视频流中的第二目标视频帧,其中,所述第二目标视频帧中存在至少两个所述特征目标;Determining a second target video frame in the video stream according to the tracking result, wherein at least two of the characteristic targets exist in the second target video frame;在连续的与所述至少两个特征目标关联的所述第二目标视频帧的个数达到预设的尾随计数阈值时,确定存在尾随事件。When the number of consecutive second target video frames associated with the at least two characteristic targets reaches a preset trailing count threshold, it is determined that a trailing event exists.
- 根据权利要求4所述的方法,其特征在于,依据所述跟踪结果确定异常事件,包括:The method according to claim 4, wherein determining an abnormal event according to the tracking result comprises:依据所述跟踪结果确定所述视频流中的第三目标视频帧,其中,所述第三目标视频帧中存在所述前景目标、但不存在任一所述特征目标,所述前景目标的目标框的中心点在存在所述前景目标的第三目标视频帧中的坐标位于预设的检测区域内;Determining a third target video frame in the video stream according to the tracking result, wherein the foreground target exists in the third target video frame, but there is no one of the characteristic targets, and the target of the foreground target The coordinates of the center point of the frame in the third target video frame where the foreground target is located are within a preset detection area;在连续的与所述前景目标关联的所述第三目标视频帧的个数达到预设的遗留计数阈值时,从至少一个所述第三目标视频帧中提取所述前景目标;Extracting the foreground target from at least one third target video frame when the number of consecutive third target video frames associated with the foreground target reaches a preset legacy count threshold;利用预设的CNN分类模型对提取出的所述前景目标进行分类,获得所述前景目标对应于N种不同前景目标类型的置信度,N是大于1的整数,N种不同前景目标类型至少包括物品、非物品;The preset CNN classification model is used to classify the extracted foreground targets to obtain the confidence that the foreground targets correspond to N different types of foreground targets, where N is an integer greater than 1, and the N different types of foreground targets include Articles and non-items;如果所述前景目标对应于物品的置信度最大,则确定存在物品遗留事件。If the foreground object corresponds to the item with the greatest confidence, it is determined that an item legacy event exists.
- 一种异常事件的检测装置,包括:An abnormal event detection device includes:第一获取单元,用于利用已训练的卷积神经网络CNN从监控到的视频流中获取一个或多个特征目标,其中,所述视频流是监控指定区域得到的,所述特征目标表示出现在所述指定区域的目标人员;A first obtaining unit is configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN, wherein the video stream is obtained by monitoring a specified area, and the feature targets indicate occurrence Target personnel in said designated area;第二获取单元,用于利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标;A second obtaining unit, configured to obtain one or more foreground objects from the video stream by using a preset foreground model for foreground detection;第一确定单元,用于基于所述特征目标和所述前景目标确定一个或多个检测目标;A first determining unit, configured to determine one or more detection targets based on the characteristic target and the foreground target;第二确定单元,用于通过记录所述检测目标在所述视频流中的存在情况跟踪每个所述检测目标,以获取跟踪结果,并依据所述跟踪结果确定异常事件。A second determining unit is configured to track each detection target by recording the presence of the detection target in the video stream to obtain a tracking result, and determine an abnormal event according to the tracking result.
- 根据权利要求8所述的装置,其特征在于,所述第一确定单元,进一步用于:The apparatus according to claim 8, wherein the first determining unit is further configured to:从所述前景目标中选择与各个所述特征目标无关联的关注前景目标;Selecting a foreground target of interest that is not associated with each of the characteristic targets from the foreground targets;将各个所述特征目标和所述关注前景目标确定为所述检测目标。Each of the feature target and the attention foreground target is determined as the detection target.
- 根据权利要求9所述的装置,其特征在于,所述第一确定单元,进一步用于:The apparatus according to claim 9, wherein the first determining unit is further configured to:针对每一所述前景目标,For each stated goal,计算该前景目标的目标框与每一个所述特征目标的目标框之间交集的面积;Calculating the area of intersection between the target frame of the foreground target and the target frame of each of the characteristic targets;若该前景目标的目标框与每一个所述特征目标的目标框之间的交集的面积均 小于预设的面积阈值,则确定该前景目标与各个所述特征目标无关联。If the area of the intersection between the target frame of the foreground target and the target frame of each of the characteristic targets is less than a preset area threshold, it is determined that the foreground target is not associated with each of the characteristic targets.
- 根据权利要求8所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 8, wherein the second determining unit is further configured to:记录所述检测目标的类型、所述检测目标的目标框的中心点在存在所述检测目标的各视频帧中的坐标以及所述视频流中存在所述检测目标的视频帧的视频帧标识。Record the type of the detection target, the coordinates of the center point of the target frame of the detection target in each video frame in which the detection target is present, and the video frame identifier of the video frame in which the detection target is present in the video stream.
- 根据权利要求11所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 11, wherein the second determining unit is further configured to:依据所述跟踪结果确定所述视频流中的第一目标视频帧,其中,所述第一目标视频帧中存在至少一个所述特征目标;Determining a first target video frame in the video stream according to the tracking result, wherein at least one of the characteristic targets exists in the first target video frame;在连续的与所述至少一个特征目标关联的所述第一目标视频帧的个数达到预设的滞留计数阈值时,确定存在滞留事件。When the number of consecutive first target video frames associated with the at least one characteristic target reaches a preset retention count threshold, it is determined that a retention event exists.
- 根据权利要求11所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 11, wherein the second determining unit is further configured to:依据所述跟踪结果确定所述视频流中的第二目标视频帧,其中,所述第二目标视频帧中存在至少两个所述特征目标;Determining a second target video frame in the video stream according to the tracking result, wherein at least two of the characteristic targets exist in the second target video frame;在连续的与所述至少两个特征目标关联的所述第二目标视频帧的数量达到预设的尾随计数阈值时,确定存在尾随事件。When the number of consecutive second target video frames associated with the at least two feature targets reaches a preset trailing count threshold, it is determined that a trailing event exists.
- 根据权利要求11所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 11, wherein the second determining unit is further configured to:依据所述跟踪结果确定所述视频流中的第三目标视频帧;其中,所述第三目标视频帧中存在所述前景目标、但不存在任一所述特征目标,所述前景目标的目标框的中心点在存在所述前景目标的第三目标视频帧中的坐标位于预设的检测区域内;Determining a third target video frame in the video stream according to the tracking result; wherein the foreground target exists in the third target video frame, but none of the characteristic targets exists, and the target of the foreground target The coordinates of the center point of the frame in the third target video frame where the foreground target is located are within a preset detection area;在连续的与所述前景目标关联的所述第三目标视频帧的个数达到预设的遗留计数阈值时,从至少一个所述第三目标视频帧中提取所述前景目标;Extracting the foreground target from at least one third target video frame when the number of consecutive third target video frames associated with the foreground target reaches a preset legacy count threshold;利用预设的CNN分类模型对提取出的所述前景目标进行分类,获得所述前景目标对应于N种不同前景目标类型的置信度,N是大于1的整数,N种不同前景目标类型至少包括物品、非物品;The preset CNN classification model is used to classify the extracted foreground targets to obtain the confidence that the foreground targets correspond to N different types of foreground targets, where N is an integer greater than 1, and the N different types of foreground targets include Articles and non-items;如果所述前景目标对应于物品的置信度最大,则确定存在物品遗留事件。If the foreground object corresponds to the item with the greatest confidence, it is determined that an item legacy event exists.
- 一种电子设备,包括:An electronic device includes:处理器,以及,Processors, and,用于存储所述处理器可执行指令的存储器;A memory for storing the processor-executable instructions;其中,所述处理器被配置为执行权利要求1-7中任一项所述的异常事件的检测方法。The processor is configured to execute the method for detecting an abnormal event according to any one of claims 1-7.
- 一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现权利要求1-7中任一项所述的异常事件的检测方法。A computer-readable storage medium having stored thereon a computer program that, when executed by a processor, implements the method for detecting an abnormal event according to any one of claims 1-7.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810672457.5 | 2018-06-26 | ||
CN201810672457.5A CN110648352B (en) | 2018-06-26 | 2018-06-26 | Abnormal event detection method and device and electronic equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020001216A1 true WO2020001216A1 (en) | 2020-01-02 |
Family
ID=68985488
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/088703 WO2020001216A1 (en) | 2018-06-26 | 2019-05-28 | Abnormal event detection |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN110648352B (en) |
WO (1) | WO2020001216A1 (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111259827A (en) * | 2020-01-19 | 2020-06-09 | 河海大学 | Automatic detection method and device for water surface floating objects for urban river supervision |
CN111860140A (en) * | 2020-06-10 | 2020-10-30 | 北京迈格威科技有限公司 | Target event detection method and device, computer equipment and storage medium |
CN112700657A (en) * | 2020-12-21 | 2021-04-23 | 北京百度网讯科技有限公司 | Method and device for generating detection information, road side equipment and cloud control platform |
CN113011290A (en) * | 2021-03-03 | 2021-06-22 | 上海商汤智能科技有限公司 | Event detection method and device, electronic equipment and storage medium |
CN114245200A (en) * | 2021-11-02 | 2022-03-25 | 浙江大华技术股份有限公司 | Monitoring scene change determination method, device and system and electronic device |
CN114640608A (en) * | 2022-04-01 | 2022-06-17 | 上海商汤智能科技有限公司 | Test method and device, electronic equipment and computer readable storage medium |
CN115690665A (en) * | 2023-01-03 | 2023-02-03 | 华东交通大学 | Video anomaly detection method and device based on cross U-Net network |
CN117541957A (en) * | 2023-11-08 | 2024-02-09 | 继善(广东)科技有限公司 | Method, system and medium for generating event solving strategy based on artificial intelligence |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113470013A (en) * | 2021-07-28 | 2021-10-01 | 浙江大华技术股份有限公司 | Method and device for detecting moved article |
CN117178301A (en) * | 2022-03-28 | 2023-12-05 | 京东方科技集团股份有限公司 | Method and device for detecting illegal behaviors |
CN117392585B (en) * | 2023-10-24 | 2024-06-18 | 广州广电运通智能科技有限公司 | Gate traffic detection method and device, electronic equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN201638305U (en) * | 2009-05-18 | 2010-11-17 | 上海银晨智能识别科技有限公司 | Intelligent monitoring device for self-service bank and automated teller machine |
US20140055610A1 (en) * | 2012-08-27 | 2014-02-27 | Korea University Research And Business Foundation | Indoor surveillance system and indoor surveillance method |
CN103679123A (en) * | 2012-09-17 | 2014-03-26 | 浙江大华技术股份有限公司 | Method and system for detecting remnant on ATM (Automatic Teller Machine) panel |
CN105718857A (en) * | 2016-01-13 | 2016-06-29 | 兴唐通信科技有限公司 | Human body abnormal behavior detection method and system |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8614744B2 (en) * | 2008-07-21 | 2013-12-24 | International Business Machines Corporation | Area monitoring using prototypical tracks |
CN103605983B (en) * | 2013-10-30 | 2017-01-25 | 天津大学 | Remnant detection and tracking method |
CN105976400B (en) * | 2016-05-10 | 2017-06-30 | 北京旷视科技有限公司 | Method for tracking target and device based on neural network model |
CN105872477B (en) * | 2016-05-27 | 2018-11-23 | 北京旷视科技有限公司 | video monitoring method and video monitoring system |
CN106203274B (en) * | 2016-06-29 | 2020-03-31 | 长沙慧联智能科技有限公司 | Real-time pedestrian detection system and method in video monitoring |
-
2018
- 2018-06-26 CN CN201810672457.5A patent/CN110648352B/en active Active
-
2019
- 2019-05-28 WO PCT/CN2019/088703 patent/WO2020001216A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN201638305U (en) * | 2009-05-18 | 2010-11-17 | 上海银晨智能识别科技有限公司 | Intelligent monitoring device for self-service bank and automated teller machine |
US20140055610A1 (en) * | 2012-08-27 | 2014-02-27 | Korea University Research And Business Foundation | Indoor surveillance system and indoor surveillance method |
CN103679123A (en) * | 2012-09-17 | 2014-03-26 | 浙江大华技术股份有限公司 | Method and system for detecting remnant on ATM (Automatic Teller Machine) panel |
CN105718857A (en) * | 2016-01-13 | 2016-06-29 | 兴唐通信科技有限公司 | Human body abnormal behavior detection method and system |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111259827B (en) * | 2020-01-19 | 2022-07-29 | 河海大学 | Automatic detection method and device for water surface floating objects for urban river supervision |
CN111259827A (en) * | 2020-01-19 | 2020-06-09 | 河海大学 | Automatic detection method and device for water surface floating objects for urban river supervision |
CN111860140A (en) * | 2020-06-10 | 2020-10-30 | 北京迈格威科技有限公司 | Target event detection method and device, computer equipment and storage medium |
CN111860140B (en) * | 2020-06-10 | 2024-05-17 | 北京迈格威科技有限公司 | Target event detection method, device, computer equipment and storage medium |
CN112700657B (en) * | 2020-12-21 | 2023-04-28 | 阿波罗智联(北京)科技有限公司 | Method and device for generating detection information, road side equipment and cloud control platform |
CN112700657A (en) * | 2020-12-21 | 2021-04-23 | 北京百度网讯科技有限公司 | Method and device for generating detection information, road side equipment and cloud control platform |
CN113011290A (en) * | 2021-03-03 | 2021-06-22 | 上海商汤智能科技有限公司 | Event detection method and device, electronic equipment and storage medium |
CN114245200A (en) * | 2021-11-02 | 2022-03-25 | 浙江大华技术股份有限公司 | Monitoring scene change determination method, device and system and electronic device |
CN114245200B (en) * | 2021-11-02 | 2024-05-07 | 浙江大华技术股份有限公司 | Monitoring scene change determining method, device, system and electronic device |
CN114640608A (en) * | 2022-04-01 | 2022-06-17 | 上海商汤智能科技有限公司 | Test method and device, electronic equipment and computer readable storage medium |
CN115690665A (en) * | 2023-01-03 | 2023-02-03 | 华东交通大学 | Video anomaly detection method and device based on cross U-Net network |
CN115690665B (en) * | 2023-01-03 | 2023-03-28 | 华东交通大学 | Video anomaly detection method and device based on cross U-Net network |
CN117541957A (en) * | 2023-11-08 | 2024-02-09 | 继善(广东)科技有限公司 | Method, system and medium for generating event solving strategy based on artificial intelligence |
CN117541957B (en) * | 2023-11-08 | 2024-05-24 | 继善(广东)科技有限公司 | Method, system and medium for generating event solving strategy based on artificial intelligence |
Also Published As
Publication number | Publication date |
---|---|
CN110648352A (en) | 2020-01-03 |
CN110648352B (en) | 2022-09-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020001216A1 (en) | Abnormal event detection | |
Tian et al. | Robust detection of abandoned and removed objects in complex surveillance videos | |
US10943204B2 (en) | Realtime video monitoring applied to reduce customer wait times | |
CN109376639B (en) | Accompanying personnel early warning system and method based on portrait recognition | |
Liao et al. | A localized approach to abandoned luggage detection with foreground-mask sampling | |
US20180115749A1 (en) | Surveillance system and surveillance method | |
JP6148480B2 (en) | Image processing apparatus and image processing method | |
Kamthe et al. | Suspicious activity recognition in video surveillance system | |
CN101556717A (en) | ATM intelligent security system and monitoring method | |
US10635908B2 (en) | Image processing system and image processing method | |
CN104424721B (en) | Recognition methods is blocked in a kind of face of combination ATM | |
JP7197485B2 (en) | Detection system, detection device and method | |
Tripathi et al. | A framework for abandoned object detection from video surveillance | |
Chang et al. | Localized detection of abandoned luggage | |
CN111353338A (en) | Energy efficiency improvement method based on business hall video monitoring | |
Patil et al. | Suspicious movement detection and tracking based on color histogram | |
Zaidi et al. | Video anomaly detection and classification for human activity recognition | |
CN114783037B (en) | Object re-recognition method, object re-recognition apparatus, and computer-readable storage medium | |
JP5752976B2 (en) | Image monitoring device | |
JP5752975B2 (en) | Image monitoring device | |
JP2012212968A (en) | Image monitoring apparatus | |
Islam et al. | Correlating belongings with passengers in a simulated airport security checkpoint | |
JP2021012657A (en) | Information processing apparatus, information processing method, and camera | |
Lin et al. | An abandoned objects management system based on the gaussian mixture model | |
Hongxing et al. | Facial area forecast and occluded face detection based on the YCbCr elliptical model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19825500 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19825500 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19825500 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 22.06.2021) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19825500 Country of ref document: EP Kind code of ref document: A1 |