WO2020001216A1 - Abnormal event detection - Google Patents

Abnormal event detection Download PDF

Info

Publication number
WO2020001216A1
WO2020001216A1 PCT/CN2019/088703 CN2019088703W WO2020001216A1 WO 2020001216 A1 WO2020001216 A1 WO 2020001216A1 CN 2019088703 W CN2019088703 W CN 2019088703W WO 2020001216 A1 WO2020001216 A1 WO 2020001216A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
foreground
targets
detection
frame
Prior art date
Application number
PCT/CN2019/088703
Other languages
French (fr)
Chinese (zh)
Inventor
邓亦梁
Original Assignee
杭州海康威视数字技术股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 杭州海康威视数字技术股份有限公司 filed Critical 杭州海康威视数字技术股份有限公司
Publication of WO2020001216A1 publication Critical patent/WO2020001216A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30232Surveillance

Definitions

  • the present application relates to the field of image processing, and in particular, to a method, an apparatus, an electronic device, and a computer-readable storage medium for detecting an abnormal event.
  • ATMs Automatic Teller Machines
  • a protective cabin is installed outside the ATM.
  • some public telephones are also equipped with a protective compartment.
  • the protection cabin brings convenience to the depositor (or the user of the public telephone).
  • illegal and criminal activities in the protection cabin also occur from time to time, mainly including trailing robbery and criminals staying for a long time to cause damage.
  • users may leave their belongings in the protective cabin after completing the necessary business departure in the protective cabin.
  • the monitoring personnel can learn the three types of abnormal events in the protection cabin in time, the safety of the protection cabin and the user experience can be effectively improved.
  • embodiments of the present application provide a method, an apparatus, an electronic device, and a computer-readable storage medium for detecting an abnormal event, so as to accurately detect abnormal events such as a person's tail, a person's stay, and an item's left.
  • an embodiment of the present application provides a method for detecting an abnormal event, including: using a trained convolutional neural network CNN to obtain one or more feature targets from a monitored video stream, wherein the video stream It is a video stream obtained by monitoring a specified area, and the characteristic target represents a target person appearing in the specified area; using a preset foreground model for foreground detection to obtain one or more foreground targets from the video stream; based on The feature target and the foreground target determine one or more detection targets; track each of the detection targets by recording the presence of the detection target in the video stream to obtain a tracking result; and according to the tracking result Identify abnormal events.
  • an embodiment of the present application provides an abnormal event detection device, including: a first obtaining unit, configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN Wherein, the video stream is obtained by monitoring a specified area, and the characteristic target represents a target person appearing in the specified area; a second obtaining unit is configured to use a preset foreground model for foreground detection from the One or more foreground targets are obtained from the video stream; a first determining unit is configured to determine one or more detection targets based on the feature target and the foreground target; a second determining unit is configured to record the detection targets in the The existence of the video stream tracks each of the detection targets to obtain a tracking result, and determines an abnormal event based on the tracking result.
  • an embodiment of the present application provides an electronic device, including a processor, and a memory for storing executable instructions of the processor.
  • the processor is configured to execute the method for detecting an abnormal event according to the first aspect.
  • an embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the method for detecting an abnormal event according to the first aspect is implemented.
  • FIG. 1 is a flowchart of a method for detecting an abnormal event shown in the present application
  • FIG. 2 is a block diagram of an embodiment of an abnormal event detection device shown in the present application.
  • FIG. 3 is a hardware structural diagram of an electronic device shown in the present application.
  • a camera on the top of the protective cabin is used to record the monitoring video in the protective cabin in real time, and then a foreground model of the protective cabin is established, and a foreground target (people entering the protective cabin) is extracted based on the foreground model. Further, the moving foreground target is tracked to judge the entry and exit of the personnel in the protective cabin, so as to realize the judgment of abnormal events such as the following of the personnel and the retention of the personnel, and to alert the monitoring personnel.
  • the method of obtaining a foreground target by establishing a foreground model is susceptible to environmental influences and may cause misjudgment of abnormal events.
  • the discrimination between the foreground object and the background is low, the probability of detecting the foreground object is low.
  • the probability of detecting the foreground object is low. For example, when the light outside the protective cabin changes or the shadow of people outside the cabin is projected into the cabin, false foreground targets may be generated, causing misjudgment of anomalous events; and the color of the personnel ’s coat and the background in the protective cabin When the colors are similar, it is difficult to extract the foreground target.
  • a flowchart of a method for detecting an abnormal event shown in this application includes the following steps:
  • Step 101 Use the trained convolutional neural network CNN for feature detection to obtain one or more feature targets of the target person from the monitored video stream.
  • the video stream is a video stream obtained by monitoring a specified area.
  • the execution method of the above method may be an electronic device docked with a monitoring device (such as a monitoring camera).
  • the electronic device may be a hard disk video recorder.
  • the hard disk video recorder is taken as the main execution body in the following text.
  • CNNs Convolutional Neural Network
  • CNNs can identify the feature targets appearing in the video frames of the video stream through the training of human features in advance.
  • CNNs can be trained with the head and shoulders of a person, so that the CNN can identify the head and shoulder targets appearing in the video frame.
  • CNNs can also be trained with other personnel characteristics (for example, limbs and torso), which may be specifically based on the characteristics of personnel that can be easily monitored by the monitoring device.
  • the specified area may be any area where an abnormal event may occur, and the specified area is monitored by a monitoring device to generate a video stream.
  • the above-mentioned designated area may be the inside of the protection cabin.
  • the trained CNN is used to obtain the person's characteristic target from the video stream to detect the persons appearing in the video stream.
  • the hard disk video recorder can record the obtained coordinates of the upper-left corner of the target frame of the feature target in the feature target table, the width, the height, and the confidence that the feature target is a human feature.
  • the feature target table includes a mapping relationship between coordinates, width, height, and confidence of an upper left corner of a target frame of the feature target.
  • Step 102 Use a preset foreground model for foreground detection to obtain one or more foreground objects from the video stream.
  • the foreground models used for foreground detection include Gaussian Mixture Model (Vius) and ViBe (visual background extractor) algorithms.
  • the foregoing foreground model is established based on the RGB (Red Green Blue) information of the monitored specified area, and can be used to identify the foreground target appearing in the video frame of the video stream.
  • the foreground target refers to the target relative to the established foreground model.
  • the hard disk video recorder uses the trained foreground model to obtain a foreground target from the video stream.
  • the foreground target may be a body part of a person, an item, or even a part of the scene in the protective cabin (for example, an ATM machine illuminated by light may be identified as a foreground target). Therefore, further analysis and determination are needed in the future. The specifics of the prospects.
  • the hard disk video recorder can record the obtained coordinates and the width and height of the upper left corner of the target frame of the foreground target in the foreground target table.
  • the foregoing foreground target table includes a mapping relationship of coordinates, width, and height of an upper left corner of the foreground target.
  • Step 103 Determine one or more detection targets based on the feature target and the foreground target.
  • the hard disk video recorder may determine a detection target to be tracked subsequently based on the feature target and the foreground target. Specifically, the hard disk video recorder may select a foreground object of interest that is not associated with each characteristic object from the obtained foreground objects, and then determine each characteristic object and the foreground object of interest as detection targets.
  • the hard disk video recorder may first calculate, for each foreground target, the area of intersection between the target frame of the foreground target and the target frame of each characteristic target.
  • the position and area of the target frame of each foreground target in the video frame may be determined by the coordinates and width and height of the upper left corner of the target frame of each foreground target recorded in the foreground target table. And, the position and area of the target frame of each feature target in the video frame are determined by the coordinates and width and height of the upper left corner of the target frame of each feature target recorded in the feature target table. Further, for each foreground target, an area of intersection between the target frame of the foreground target and the target frame of each feature target is determined.
  • the hard disk video recorder can determine whether the area of the intersection reaches a preset area threshold.
  • the focus on the foreground target is not the person's body part.
  • the foreground target is associated with the feature target.
  • the foreground target is the body part of the person indicated by the characteristic target.
  • the hard disk video recorder may determine the foreground target of interest that is not associated with each characteristic target and each characteristic target as detection targets to track the above detection targets.
  • Step 104 Track each detection target by recording the existence of the detection target in the video stream, and determine an abnormal event according to the tracking result.
  • the hard disk video recorder can use multiple object tracking technology (Multiple Object Tracking / Multiple Target Tracking) to track the center point of the target frame of the detection target.
  • the center point of the target frame of the detection target may be determined based on the coordinates and width and height of the upper left corner of the target frame.
  • the hard disk video recorder can record the type of each detection target tracked, the historical coordinates of the center point of the target frame of each detection target, and the video frame identifier of the video frame where the detection target is located in the tracking table.
  • the historical coordinates are coordinates of a center point of a target frame of the detection target in each video frame of a video stream in which the detection target exists.
  • the hard disk video recorder will continuously record the coordinates of the center point of the target frame in the video frame of the detection target during the tracking process.
  • the tracking table includes the type of the detection target, the historical coordinates of the center point of the target frame of the detection target, and the mapping relationship of the video frame identifier of the video frame where the detection target is located.
  • the types of detection targets mentioned above include personnel and non-persons, where non-persons include objects and false foregrounds (such as silhouettes).
  • the hard disk video recorder may indicate that the type of the characteristic target is a person by recording the confidence level of the person's characteristics, and the confidence level of the foreground target is temporarily recorded as zero.
  • the video frame identifier may be a frame number of a video frame, and the frame number indicates a position of the video frame in the video stream, and frame numbers of two frames before and after the video frame are different by one. Therefore, in practical applications, the position of the video frame in which the detection target exists in the video stream can be determined by recording the frame number of the video frame in which the detection target is located.
  • the DVR determines abnormal events based on the tracking results.
  • the hard disk video recorder may convert a preset duration judgment threshold for several abnormal events into a count threshold for the number of video frames. For example, for a trailing event, the preset duration judgment threshold is 5 minutes. Since there are 25 frames per second, the converted trailing count threshold is 7500. For a detained event, the preset duration judgment threshold is 10 minutes. The retention count threshold value of 15,000 is 15,000; for the item leftover event, the preset duration judgment threshold value is 10 minutes, and the converted residual count threshold value is 15000.
  • the hard disk video recorder may determine the first target video frame based on the above-mentioned tracking table and according to the type of each detection target in the tracking result and the video frame identifier of the video frame in which it is located. Wherein, there is at least one characteristic target in the first target video frame. It is determined whether the number of the first target video frames reaches a preset retention count threshold. If so, determine that a detention event exists.
  • the hard disk video recorder tracks the detection target, as long as there is a tracking entry of at least one type of detection target in the tracking table, the number of video frames in which at least one person exists (the video frames described above) (That is, the first target video frame), and a retention count is obtained. Every time a new first target video frame is obtained, the hard disk video recorder may increase the above-mentioned retention count by one, and determine whether the retention count reaches the above-mentioned retention count threshold.
  • the stay count is continuously updated.
  • the hard disk video recorder can output stuck alarm information to the video surveillance personnel.
  • each of the first target video frames corresponding to the staying count is continuous in time.
  • the hard disk video recorder may determine the second target video frame based on the above tracking table, according to the type of each detection target in the tracking result, and the video frame identifier of the video frame in which it is located. There are at least two feature targets in the second target video frame. It is determined whether the number of the second target video frames reaches a preset trailing count threshold. If so, determine that a trailing event exists.
  • the hard disk video recorder tracks the detection target, if there are at least two detection target types in the tracking table, the number of video frames in which there are at least two persons can be counted (the above)
  • the video frame is the second target video frame), and the trailing count is obtained.
  • the hard disk video recorder may increase the above-mentioned trailing count by one, and determine whether the trailing count reaches the above-mentioned trailing count threshold.
  • the DVR can output a trailing alarm message to the video surveillance personnel.
  • each second target video frame corresponding to the above-mentioned trailing count is temporally continuous.
  • the retention count and the trailing count can be counted simultaneously without affecting each other.
  • the hard disk video recorder can determine the third target video frame based on the above tracking table, according to the type of each detection target in the tracking result, the video frame identification of the video frame where it is located, and the historical coordinates of the center point of the target frame of each detection target . Wherein, there is no feature target but a foreground target in the third target video frame, and the coordinates of the center point of the target frame of the foreground target in the third target video frame where the foreground target exists are located in a preset detection area. .
  • the hard disk video recorder tracks the above-mentioned detection target
  • the target can be determined.
  • the video frame is the third target video frame, it is counted to obtain a legacy count.
  • the above-mentioned preset detection area may be an area where the user easily leaves an item in an actual application environment. For example, for an ATM protective cabin, the above detection area may be an area close to the ATM.
  • the above-mentioned legacy count may be increased by one, and it is determined whether the legacy count reaches the aforementioned legacy count threshold.
  • Each third target video frame corresponding to the above-mentioned legacy count is continuous in time.
  • the above-mentioned legacy count does not reach the above-mentioned legacy count threshold, the above-mentioned legacy count is continuously updated.
  • the foreground target may be extracted from at least one third target video frame.
  • the hard disk video recorder may use the preset CNN classification model to classify the extracted foreground targets, and obtain the confidence that the foreground targets correspond to N different types of foreground targets.
  • N is an integer greater than 1
  • the N different foreground target types include at least items and non-items.
  • the types of foreground targets can include people, items, and non-items.
  • Non-items include false prospects.
  • the above-mentioned CNN classification model is trained in advance through human characteristics, items that may appear in a specified area, and background content of the specified area.
  • items include bank cards, keys, bags, umbrellas, luggage, etc.
  • the background content of the designated area includes the ground, ATM, posters posted in the protective cabin, and changes in light outside the protective cabin or shadows Background content when arriving in the cabin, etc.
  • the content of the foreground target can be more accurately identified, and the misjudgment of the event left by the item can be avoided.
  • the hard disk video recorder may determine the actual content of the foreground target based on the confidence corresponding to the person, the item, and the non-item.
  • the confidence level corresponding to the person is the largest, it means that although there are no feature targets in the current video frame, there are still people, and the above-mentioned legacy count can be cleared to zero;
  • the confidence level corresponding to the non-item is the largest, it means that the current third target video frame does not exist or the item, and the above-mentioned legacy count can be cleared to zero;
  • the hard disk video recorder can output alarms on items left to video surveillance personnel.
  • the hard disk video recorder can extract the feature target of the target person in the video stream through the CNN used for feature detection, and obtain the foreground target from the video stream through the foreground model used for foreground detection, and based on
  • the detection targets determined by the above characteristic targets and foreground targets can greatly reduce the impact of the environment, can more accurately identify the target person and item, improve the detection rate, and then by tracking the above detection targets, can effectively detect Anomalous events such as personnel tracking, detention of personnel and leftovers of items.
  • this application further provides an embodiment of a device for detecting an abnormal event.
  • FIG. 2 a block diagram of an embodiment of an abnormal event detection device is shown.
  • the abnormal event detection device 20 includes:
  • a first obtaining unit 210 is configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN, wherein the video stream is obtained by monitoring a specified area, and the feature target represents Target person appearing in the designated area.
  • the second obtaining unit 220 is configured to obtain one or more foreground objects from the video stream by using a preset foreground model for foreground detection.
  • the first determining unit 230 is configured to determine one or more detection targets based on the feature target and the foreground target.
  • a second determining unit 240 is configured to track each of the detection targets by recording the presence of the detection targets in the video stream to obtain a tracking result, and determine an abnormal event according to the tracking result.
  • the first determining unit 230 is further configured to: select a foreground object of interest that is not associated with each of the characteristic objects from the foreground objects; determine each of the characteristic object and the foreground object of interest Is the detection target.
  • the first determining unit 230 is further configured to calculate, for each foreground target, an area of intersection between a target frame of the foreground target and a target frame of each characteristic target.
  • the area of the intersection between the frame and the target frame of each feature target is less than a preset area threshold, then it is determined that the foreground target is not associated with each feature target.
  • the second determining unit 240 is further configured to record the type of the detection target, the coordinates of the center point of the target frame of the detection target in each video frame in which the detection target is located, and the A video frame identifier of a video frame of the detection target exists in the video stream.
  • the second determining unit 240 is further configured to determine a first target video frame in the video stream according to the tracking result, wherein at least one of the characteristic targets exists in the first target video frame; When the number of the first target video frames associated with the at least one characteristic target reaches a preset retention count threshold, it is determined that a retention event exists.
  • the second determining unit 240 is further configured to determine a second target video frame in the video stream according to the tracking result, wherein at least two of the characteristic targets exist in the second target video frame; When the number of consecutive second target video frames associated with the at least two feature targets reaches a preset trailing count threshold, it is determined that a trailing event exists.
  • the second determining unit 240 is further configured to determine a third target video frame in the video stream according to the tracking result; wherein the foreground target exists in the third target video frame, but no one exists.
  • the coordinates of the center point of the target frame of the characteristic target in the foreground target in the third target video frame in which the foreground target is located are within a preset detection area; in the continuous association with the foreground target,
  • the foreground target is extracted from at least one of the third target video frames;
  • the preset foreground target is extracted using a preset CNN classification model Classify to obtain the confidence that the foreground target corresponds to N different foreground target types, where N is an integer greater than 1, and the N different foreground target types include at least items and non-items; and if the foreground target corresponds to the confidence of the item If the degree is the largest, it is determined that there is an item leftover event.
  • the embodiment of the apparatus for detecting an abnormal event of the present application can be applied to an electronic device.
  • the device embodiments can be implemented by software, or by hardware or a combination of software and hardware. Taking software implementation as an example, as a device in a logical sense, it is formed by reading the corresponding computer program instructions in the non-volatile memory into the memory through the processor of the electronic device where it is located.
  • FIG. 3 it is a hardware structure diagram of the electronic device where the abnormal event detection device of this application is located, except for the processor, memory, network interface, and non-volatile memory shown in FIG. 3.
  • the electronic equipment in which the device is located in the embodiment may generally include other hardware according to the actual function of the detection device of the abnormal event, and details are not described herein again.
  • the memory and non-volatile memory of the electronic device are also equipped with machine-executable instructions corresponding to the first obtaining unit 210, machine-executable instructions corresponding to the second obtaining unit 220, and the first determining unit 230, respectively.
  • the relevant part may refer to the description of the method embodiment.
  • the device embodiments described above are only schematic, and the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located One place, or it can be distributed across multiple network elements. Some or all of these modules can be selected according to actual needs to achieve the purpose of the solution of this application. Those of ordinary skill in the art can understand and implement without creative efforts.
  • An embodiment of the present application further provides a computer-readable storage medium on which a computer program is stored.
  • the program is executed by a processor, the method for detecting an abnormal event according to the foregoing method embodiment is implemented.
  • the computer-readable storage medium includes a non-transitory computer-readable storage medium.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)
  • Closed-Circuit Television Systems (AREA)

Abstract

The present application provides a method and apparatus for detecting an abnormal event, an electronic device, and a computer-readable storage medium. The method comprises: using a trained convolutional neural network (CNN) to acquire one or more feature targets from a monitored video stream, wherein the video stream is a video stream obtained by monitoring a specified region and the feature target represents a target person present in the specified region; using a preset foreground model for foreground detection to acquire one or more foreground targets from the video stream; identifying one or more detection targets on the basis of the feature target and the foreground target; tracking each detection target by recording the presence status of the detection target in the video stream, so as to obtain a tracking result; and identifying an abnormal event according to the tracking result.

Description

异常事件的检测Detection of abnormal events 技术领域Technical field
本申请涉及图像处理领域,特别涉及一种异常事件的检测方法、装置、电子设备及计算机可读存储介质。The present application relates to the field of image processing, and in particular, to a method, an apparatus, an electronic device, and a computer-readable storage medium for detecting an abnormal event.
背景技术Background technique
为方便储户存取款,各银行一般会安装ATM(Automated Teller Machine,自动柜员机)为储户提供24小时的自助服务。为保证储户能够在一个独立安全的操作空间内使用ATM,通常情况下,ATM外部会安装防护舱。此外,一些公用电话的外部也会安装防护舱。In order to facilitate depositors' deposits and withdrawals, banks generally install ATMs (Automated Teller Machines) to provide depositors with 24-hour self-service. To ensure that depositors can use ATM in an independent and safe operating space, usually, a protective cabin is installed outside the ATM. In addition, some public telephones are also equipped with a protective compartment.
防护舱给储户(或公用电话的用户)带来便利,然而,防护舱内的违法犯罪活动也时有发生,主要包括尾随抢劫、犯罪人员长时间滞留搞破坏等。此外,用户在防护舱内完成必要的业务离开后,可能会将自身携带的物品遗留在防护舱内。The protection cabin brings convenience to the depositor (or the user of the public telephone). However, illegal and criminal activities in the protection cabin also occur from time to time, mainly including trailing robbery and criminals staying for a long time to cause damage. In addition, users may leave their belongings in the protective cabin after completing the necessary business departure in the protective cabin.
如果监控人员能及时获知防护舱内发生的上述三类异常事件,可有效提升防护舱的安全性和用户体验。If the monitoring personnel can learn the three types of abnormal events in the protection cabin in time, the safety of the protection cabin and the user experience can be effectively improved.
发明内容Summary of the invention
有鉴于此,本申请实施例提供一种异常事件的检测方法、装置、电子设备及计算机可读存储介质,用以准确地检测出人员尾随、人员滞留和物品遗留等异常事件。In view of this, embodiments of the present application provide a method, an apparatus, an electronic device, and a computer-readable storage medium for detecting an abnormal event, so as to accurately detect abnormal events such as a person's tail, a person's stay, and an item's left.
第一方面,本申请实施例提供了一种异常事件的检测方法,包括:利用已训练的卷积神经网络CNN从监控到的视频流中获取一个或多个特征目标,其中,所述视频流是监控指定区域得到的视频流,所述特征目标表示出现在所述指定区域的目标人员;利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标;基于所述特征目标和所述前景目标确定一个或多个检测目标;通过记录所述检测目标在所述视频流中的存在情况跟踪每个所述检测目标,以获取跟踪结果;依据所述跟踪结果确定异常事件。In a first aspect, an embodiment of the present application provides a method for detecting an abnormal event, including: using a trained convolutional neural network CNN to obtain one or more feature targets from a monitored video stream, wherein the video stream It is a video stream obtained by monitoring a specified area, and the characteristic target represents a target person appearing in the specified area; using a preset foreground model for foreground detection to obtain one or more foreground targets from the video stream; based on The feature target and the foreground target determine one or more detection targets; track each of the detection targets by recording the presence of the detection target in the video stream to obtain a tracking result; and according to the tracking result Identify abnormal events.
第二方面,本申请实施例提供了一种异常事件的检测装置,包括:第一获取单元,用于利用已训练的卷积神经网络CNN从监控到的视频流中获取一个或多个特征目标, 其中,所述视频流是监控指定区域得到的,所述特征目标表示出现在所述指定区域的目标人员;第二获取单元,用于利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标;第一确定单元,用于基于所述特征目标和所述前景目标确定一个或多个检测目标;第二确定单元,用于通过记录所述检测目标在所述视频流中的存在情况跟踪每个所述检测目标,以获取跟踪结果,并依据所述跟踪结果确定异常事件。In a second aspect, an embodiment of the present application provides an abnormal event detection device, including: a first obtaining unit, configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN Wherein, the video stream is obtained by monitoring a specified area, and the characteristic target represents a target person appearing in the specified area; a second obtaining unit is configured to use a preset foreground model for foreground detection from the One or more foreground targets are obtained from the video stream; a first determining unit is configured to determine one or more detection targets based on the feature target and the foreground target; a second determining unit is configured to record the detection targets in the The existence of the video stream tracks each of the detection targets to obtain a tracking result, and determines an abnormal event based on the tracking result.
第三方面,本申请实施例提供了一种电子设备,包括处理器,以及用于存储所述处理器可执行指令的存储器。其中,所述处理器被配置为执行第一方面所述的异常事件的检测方法。According to a third aspect, an embodiment of the present application provides an electronic device, including a processor, and a memory for storing executable instructions of the processor. The processor is configured to execute the method for detecting an abnormal event according to the first aspect.
第四方面,本申请实施例提供了一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现第一方面所述的异常事件的检测方法。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the method for detecting an abnormal event according to the first aspect is implemented.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本申请示出的一种异常事件的检测方法的流程图;FIG. 1 is a flowchart of a method for detecting an abnormal event shown in the present application;
图2是本申请示出的一种异常事件的检测装置的实施例框图;FIG. 2 is a block diagram of an embodiment of an abnormal event detection device shown in the present application; FIG.
图3是本申请示出的一种电子设备的硬件结构图。FIG. 3 is a hardware structural diagram of an electronic device shown in the present application.
具体实施方式detailed description
为了使本技术领域的人员更好地理解本发明实施例中的技术方案,并使本发明实施例的上述目的、特征和优点能够更加明显易懂,下面结合附图对现有技术方案和本发明实施例中的技术方案作进一步详细的说明。In order to enable those skilled in the art to better understand the technical solutions in the embodiments of the present invention, and to make the foregoing objectives, features, and advantages of the embodiments of the present invention more comprehensible, the prior art solutions and the present invention are described below with reference to the accompanying drawings. The technical solutions in the embodiments of the invention will be further described in detail.
在一些实施例中,利用防护舱顶部的摄像头实时记录防护舱内的监控视频,然后建立防护舱的前景模型,并基于该前景模型提取前景目标(进入防护舱的人员)。进一步地,对移动的前景目标进行跟踪以判断防护舱内的人员进出情况,从而实现人员尾随和人员滞留等异常事件的判断,并向监控人员报警。In some embodiments, a camera on the top of the protective cabin is used to record the monitoring video in the protective cabin in real time, and then a foreground model of the protective cabin is established, and a foreground target (people entering the protective cabin) is extracted based on the foreground model. Further, the moving foreground target is tracked to judge the entry and exit of the personnel in the protective cabin, so as to realize the judgment of abnormal events such as the following of the personnel and the retention of the personnel, and to alert the monitoring personnel.
然而,通过建立前景模型获取前景目标的方法,容易受环境影响,有可能产生异常事件的误判。并且,在前景目标与背景的区分度低时,检出前景目标的概率较低。比如,当防护舱外光线变化或舱外人员的影子投射到舱内时,可能产生虚假的前景目标,造成异常事件的误判;而在舱内人员的衣帽的颜色与防护舱内的背景颜色近似时,难以提取出前景目标。However, the method of obtaining a foreground target by establishing a foreground model is susceptible to environmental influences and may cause misjudgment of abnormal events. In addition, when the discrimination between the foreground object and the background is low, the probability of detecting the foreground object is low. For example, when the light outside the protective cabin changes or the shadow of people outside the cabin is projected into the cabin, false foreground targets may be generated, causing misjudgment of anomalous events; and the color of the personnel ’s coat and the background in the protective cabin When the colors are similar, it is difficult to extract the foreground target.
参见图1,为本申请示出的一种异常事件的检测方法的流程图,包括以下步骤:Referring to FIG. 1, a flowchart of a method for detecting an abnormal event shown in this application includes the following steps:
步骤101:利用已训练的用于特征检测的卷积神经网络CNN从监控到的视频流中获取目标人员的一个或多个特征目标。其中,所述视频流是监控指定区域得到的视频流。Step 101: Use the trained convolutional neural network CNN for feature detection to obtain one or more feature targets of the target person from the monitored video stream. The video stream is a video stream obtained by monitoring a specified area.
上述方法的执行主体可以是与监控设备(比如:监控摄像机)对接的电子设备。在示出的一种实施方式中,该电子设备可以是硬盘录像机。为方便描述本申请方案,后文以硬盘录像机为执行主体。The execution method of the above method may be an electronic device docked with a monitoring device (such as a monitoring camera). In one embodiment shown, the electronic device may be a hard disk video recorder. In order to facilitate the description of the solution of the present application, the hard disk video recorder is taken as the main execution body in the following text.
上述用于特征检测的CNN(Convolutional Neural Network,卷积神经网络)预先通过人员特征的训练,可以识别出视频流的视频帧中出现的特征目标。在实际应用中,可以用人员的头肩对CNN进行训练,使得CNN可以识别视频帧中出现的头肩目标。当然,也可以用其它人员特征(比如,四肢和躯干)训练CNN,具体可基于监控设备容易监控到的人员特征而定。The above-mentioned CNN (Convolutional Neural Network) for feature detection can identify the feature targets appearing in the video frames of the video stream through the training of human features in advance. In practical applications, CNNs can be trained with the head and shoulders of a person, so that the CNN can identify the head and shoulder targets appearing in the video frame. Of course, CNNs can also be trained with other personnel characteristics (for example, limbs and torso), which may be specifically based on the characteristics of personnel that can be easily monitored by the monitoring device.
上述指定区域可以是任意可能发生异常事件的区域,上述指定区域由监控设备监控而产生视频流。对于防护舱场景而言,上述指定区域可以是防护舱的内部。The specified area may be any area where an abnormal event may occur, and the specified area is monitored by a monitoring device to generate a video stream. For the protection cabin scenario, the above-mentioned designated area may be the inside of the protection cabin.
硬盘录像机从监控设备获取视频流后,利用上述已训练的CNN从上述视频流中获取人员的特征目标,用以检测视频流中出现的人员。After the hard disk video recorder obtains the video stream from the monitoring device, the trained CNN is used to obtain the person's characteristic target from the video stream to detect the persons appearing in the video stream.
在实际应用中,硬盘录像机可以在特征目标表中记录所获取到的特征目标的目标框的左上角的坐标、宽、高和上述特征目标为人员特征的置信度。其中,上述特征目标表包括特征目标的目标框的左上角的坐标、宽、高和置信度的映射关系。In practical applications, the hard disk video recorder can record the obtained coordinates of the upper-left corner of the target frame of the feature target in the feature target table, the width, the height, and the confidence that the feature target is a human feature. The feature target table includes a mapping relationship between coordinates, width, height, and confidence of an upper left corner of a target frame of the feature target.
步骤102:利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标。Step 102: Use a preset foreground model for foreground detection to obtain one or more foreground objects from the video stream.
用于前景检测的前景模型包括高斯混合模型(Gaussian Mixture Model)和ViBe(visual background extractor,视觉背景提取)算法等。上述前景模型基于所监控的指定区域的RGB(Red Green Blue,红绿蓝)信息建立,可以用于识别视频流的视频帧中出现的前景目标。其中,前景目标指的是相对于已建立的前景模型而言的目标。The foreground models used for foreground detection include Gaussian Mixture Model (Vius) and ViBe (visual background extractor) algorithms. The foregoing foreground model is established based on the RGB (Red Green Blue) information of the monitored specified area, and can be used to identify the foreground target appearing in the video frame of the video stream. Among them, the foreground target refers to the target relative to the established foreground model.
硬盘录像机利用上述已训练的前景模型从上述视频流中获取前景目标。前景目标可能人员的某一身体部位、也可能是物品,甚至是防护舱内的场景的一部分(比如:被光线照射的ATM机可能会被识别为前景目标),因此,后续还需进一步分析确定前景目标的具体内容。The hard disk video recorder uses the trained foreground model to obtain a foreground target from the video stream. The foreground target may be a body part of a person, an item, or even a part of the scene in the protective cabin (for example, an ATM machine illuminated by light may be identified as a foreground target). Therefore, further analysis and determination are needed in the future. The specifics of the prospects.
在实际应用中,硬盘录像机可以在前景目标表中记录所获取到的前景目标的目标框的左上角的坐标和宽高。其中,上述前景目标表包括前景目标的左上角的坐标、宽、高的映射关系。In practical applications, the hard disk video recorder can record the obtained coordinates and the width and height of the upper left corner of the target frame of the foreground target in the foreground target table. The foregoing foreground target table includes a mapping relationship of coordinates, width, and height of an upper left corner of the foreground target.
步骤103:基于所述特征目标和所述前景目标确定一个或多个检测目标。Step 103: Determine one or more detection targets based on the feature target and the foreground target.
硬盘录像机可以基于上述特征目标和上述前景目标确定后续要跟踪的检测目标。具体地,硬盘录像机可以从获取的前景目标中选择与各个特征目标无关联的关注前景目标,然后将各个特征目标和关注前景目标确定为检测目标。The hard disk video recorder may determine a detection target to be tracked subsequently based on the feature target and the foreground target. Specifically, the hard disk video recorder may select a foreground object of interest that is not associated with each characteristic object from the obtained foreground objects, and then determine each characteristic object and the foreground object of interest as detection targets.
在示出的一种实施方式中,硬盘录像机可以首先针对每一前景目标,计算该前景目标的目标框与每一特征目标的目标框之间交集的面积。In an embodiment shown, the hard disk video recorder may first calculate, for each foreground target, the area of intersection between the target frame of the foreground target and the target frame of each characteristic target.
具体地,可以通过上述前景目标表中已记录的各前景目标的目标框的左上角的坐标和宽高,确定各前景目标的目标框在视频帧中的位置和面积。以及,通过上述特征目标表中已记录的各特征目标的目标框的左上角的坐标和宽高,确定各特征目标的目标框在视频帧中的位置和面积。进一步地,针对每一前景目标,确定该前景目标的目标框与每一特征目标的目标框之间交集的面积。Specifically, the position and area of the target frame of each foreground target in the video frame may be determined by the coordinates and width and height of the upper left corner of the target frame of each foreground target recorded in the foreground target table. And, the position and area of the target frame of each feature target in the video frame are determined by the coordinates and width and height of the upper left corner of the target frame of each feature target recorded in the feature target table. Further, for each foreground target, an area of intersection between the target frame of the foreground target and the target frame of each feature target is determined.
硬盘录像机可以判断交集的面积是否达到预设的面积阈值。The hard disk video recorder can determine whether the area of the intersection reaches a preset area threshold.
如果前景目标的目标框与每一特征目标的目标框之间的交集的面积均小于上述面积阈值,则确定该前景目标与各个特征目标无关联并且将该前景目标确定为关注前景目标。换而言之,该关注前景目标不是人员的身体部位。If the area of the intersection between the target frame of the foreground target and the target frame of each feature target is less than the above-mentioned area threshold, it is determined that the foreground target is not associated with each feature target and the foreground target is determined as the foreground target of interest. In other words, the focus on the foreground target is not the person's body part.
如果前景目标的目标框与某一特征目标的目标框之间的交集的面积不小于上述面积阈值,则确定该前景目标与上述特征目标有关联。换而言之,该前景目标为上述特征目标指示的人员的身体部位。If the area of the intersection between the target frame of the foreground target and the target frame of a feature target is not less than the area threshold, it is determined that the foreground target is associated with the feature target. In other words, the foreground target is the body part of the person indicated by the characteristic target.
硬盘录像机可以将与各个特征目标无关联的关注前景目标以及各个特征目标确定为检测目标,以对上述检测目标进行跟踪。The hard disk video recorder may determine the foreground target of interest that is not associated with each characteristic target and each characteristic target as detection targets to track the above detection targets.
步骤104:通过记录所述检测目标在所述视频流中的存在情况来跟踪每个检测目标,依据跟踪结果确定异常事件。Step 104: Track each detection target by recording the existence of the detection target in the video stream, and determine an abnormal event according to the tracking result.
硬盘录像机可以利用多目标跟踪技术(Multiple Object Tracking/Multiple Target Tracking)对上述检测目标的目标框的中心点进行跟踪。其中,上述检测目标的目标框的中心点可以基于目标框的左上角的坐标和宽高来确定。The hard disk video recorder can use multiple object tracking technology (Multiple Object Tracking / Multiple Target Tracking) to track the center point of the target frame of the detection target. The center point of the target frame of the detection target may be determined based on the coordinates and width and height of the upper left corner of the target frame.
硬盘录像机可以在跟踪表中记录所跟踪的各检测目标的类型、各检测目标的目标框的中心点的历史坐标和检测目标所处的视频帧的视频帧标识。其中,上述历史坐标为上述检测目标的目标框的中心点在存在上述检测目标的视频流的各视频帧中的坐标。对于每一检测目标而言,硬盘录像机在跟踪过程中,会持续将存在该检测目标的视频帧中的目标框的中心点的坐标记录下来。The hard disk video recorder can record the type of each detection target tracked, the historical coordinates of the center point of the target frame of each detection target, and the video frame identifier of the video frame where the detection target is located in the tracking table. The historical coordinates are coordinates of a center point of a target frame of the detection target in each video frame of a video stream in which the detection target exists. For each detection target, the hard disk video recorder will continuously record the coordinates of the center point of the target frame in the video frame of the detection target during the tracking process.
上述跟踪表包括检测目标的类型、检测目标的目标框的中心点的历史坐标、检测目标所处的视频帧的视频帧标识的映射关系。The tracking table includes the type of the detection target, the historical coordinates of the center point of the target frame of the detection target, and the mapping relationship of the video frame identifier of the video frame where the detection target is located.
上述检测目标的类型包括人员和非人员,其中,非人员包括物品和虚假前景(比如:人影)。由于前景目标的具体内容需进一步分析确定,作为一种实施例,硬盘录像机可以通过记录人员特征的置信度来表明特征目标的类型为人员,而前景目标的置信度暂记为零。The types of detection targets mentioned above include personnel and non-persons, where non-persons include objects and false foregrounds (such as silhouettes). As the specific content of the foreground target needs to be further analyzed and determined, as an embodiment, the hard disk video recorder may indicate that the type of the characteristic target is a person by recording the confidence level of the person's characteristics, and the confidence level of the foreground target is temporarily recorded as zero.
上述视频帧标识可以是视频帧的帧号,帧号表示视频帧在视频流中的位置,前后两帧视频帧的帧号相差一。因此,在实际应用中,可以通过记录检测目标所在的视频帧在视频流中的帧号,来确定存在上述检测目标所在的视频帧在视频流中的位置。The video frame identifier may be a frame number of a video frame, and the frame number indicates a position of the video frame in the video stream, and frame numbers of two frames before and after the video frame are different by one. Therefore, in practical applications, the position of the video frame in which the detection target exists in the video stream can be determined by recording the frame number of the video frame in which the detection target is located.
硬盘录像机依据跟踪结果确定异常事件。The DVR determines abnormal events based on the tracking results.
在示出的一种实施方式中,硬盘录像机可以将预设的针对几种异常事件的时长判断阈值换算成视频帧的数量的计数阈值。比如:对于尾随事件,预设的时长判断阈值为5分钟,由于每秒有25帧,因此,换算出的尾随计数阈值为7500;对于滞留事件,预设的时长判断阈值为10分钟,换算出的滞留计数阈值为15000;对于物品遗留事件,预设的时长判断阈值为10分钟,换算出的遗留计数阈值为15000。In one embodiment shown, the hard disk video recorder may convert a preset duration judgment threshold for several abnormal events into a count threshold for the number of video frames. For example, for a trailing event, the preset duration judgment threshold is 5 minutes. Since there are 25 frames per second, the converted trailing count threshold is 7500. For a detained event, the preset duration judgment threshold is 10 minutes. The retention count threshold value of 15,000 is 15,000; for the item leftover event, the preset duration judgment threshold value is 10 minutes, and the converted residual count threshold value is 15000.
针对滞留事件,硬盘录像机可以基于上述跟踪表,依据跟踪结果中各检测目标的类型、所处视频帧的视频帧标识确定第一目标视频帧。其中,上述第一目标视频帧中存在至少一个特征目标。确定上述第一目标视频帧的数量是否达到预设的滞留计数阈值。如果是,确定存在滞留事件。For the stuck event, the hard disk video recorder may determine the first target video frame based on the above-mentioned tracking table and according to the type of each detection target in the tracking result and the video frame identifier of the video frame in which it is located. Wherein, there is at least one characteristic target in the first target video frame. It is determined whether the number of the first target video frames reaches a preset retention count threshold. If so, determine that a detention event exists.
具体地,硬盘录像机在对上述检测目标进行跟踪时,只要上述跟踪表存在至少一个检测目标的类型为人员的跟踪表项,即可以对存在至少一个人员的视频帧的数量进行计数(上述视频帧即为第一目标视频帧),获得滞留计数。每获取到新的第一目标视频帧,硬盘录像机可以将上述滞留计数加一,并确定该滞留计数是否达到上述滞留计数阈值。Specifically, when the hard disk video recorder tracks the detection target, as long as there is a tracking entry of at least one type of detection target in the tracking table, the number of video frames in which at least one person exists (the video frames described above) (That is, the first target video frame), and a retention count is obtained. Every time a new first target video frame is obtained, the hard disk video recorder may increase the above-mentioned retention count by one, and determine whether the retention count reaches the above-mentioned retention count threshold.
如果上述滞留计数未达到上述滞留计数阈值,则继续更新上述滞留计数。If the stay count does not reach the stay count threshold, the stay count is continuously updated.
如果上述滞留计数达到上述滞留计数阈值,则确定存在滞留事件。在这种情况下,硬盘录像机可以向视频监控人员输出滞留报警信息。If the retention count reaches the retention count threshold, it is determined that a retention event exists. In this case, the hard disk video recorder can output stuck alarm information to the video surveillance personnel.
当然,若在上述滞留计数达到上述滞留计数阈值以前,上述特征目标从视频帧中消失,则可以将上述滞留计数清零。换而言之,上述滞留计数对应的各第一目标视频帧在时间上是连续的。Of course, if the characteristic target disappears from the video frame before the retention count reaches the retention count threshold, the retention count may be cleared to zero. In other words, each of the first target video frames corresponding to the staying count is continuous in time.
针对尾随事件,硬盘录像机可以基于上述跟踪表,依据跟踪结果中各检测目标的类型、所处视频帧的视频帧标识确定第二目标视频帧。其中,上述第二目标视频帧中存在至少两个特征目标。确定上述第二目标视频帧的数量是否达到预设的尾随计数阈值。如果是,确定存在尾随事件。For a trailing event, the hard disk video recorder may determine the second target video frame based on the above tracking table, according to the type of each detection target in the tracking result, and the video frame identifier of the video frame in which it is located. There are at least two feature targets in the second target video frame. It is determined whether the number of the second target video frames reaches a preset trailing count threshold. If so, determine that a trailing event exists.
具体地,硬盘录像机在对上述检测目标进行跟踪时,如果上述跟踪表存在至少两个检测目标的类型为人员的跟踪表项,则可以对存在至少两个人员的视频帧的数量进行计数(上述视频帧即为第二目标视频帧),获得尾随计数。每获取到新的第二目标视频帧,硬盘录像机可以将上述尾随计数加一,并确定该尾随计数是否达到上述尾随计数阈值。Specifically, when the hard disk video recorder tracks the detection target, if there are at least two detection target types in the tracking table, the number of video frames in which there are at least two persons can be counted (the above) The video frame is the second target video frame), and the trailing count is obtained. Each time a new second target video frame is obtained, the hard disk video recorder may increase the above-mentioned trailing count by one, and determine whether the trailing count reaches the above-mentioned trailing count threshold.
如果上述尾随计数未达到上述尾随计数阈值,则继续更新上述尾随计数。If the above-mentioned trailing count does not reach the above-mentioned trailing count threshold, then the above-mentioned trailing count is continuously updated.
如果上述尾随计数达到上述尾随计数阈值,则确定存在尾随事件。在这种情况下,硬盘录像机可以向视频监控人员输出尾随报警信息。If the aforementioned trailing count reaches the aforementioned trailing count threshold, it is determined that a trailing event exists. In this case, the DVR can output a trailing alarm message to the video surveillance personnel.
当然,若在上述尾随计数达到上述尾随计数阈值以前,上述特征目标从视频帧中消失,则可以将上述尾随计数清零。换而言之,上述尾随计数对应的各第二目标视频帧在时间上是连续的。Of course, if the feature target disappears from the video frame before the trailing count reaches the threshold of the trailing count, the trailing count can be cleared to zero. In other words, each second target video frame corresponding to the above-mentioned trailing count is temporally continuous.
滞留计数和尾随计数可以同时统计,互相无影响。The retention count and the trailing count can be counted simultaneously without affecting each other.
针对物品遗留事件,硬盘录像机可以基于上述跟踪表,依据跟踪结果中各检测目标的类型、所处视频帧的视频帧标识和各检测目标的目标框的中心点的历史坐标确定第三目标视频帧。其中,上述第三目标视频帧中不存在特征目标但存在前景目标,其上述前景目标的目标框的中心点在存在所述前景目标的第三目标视频帧中的坐标位于预设的检测区域内。For the item leftover event, the hard disk video recorder can determine the third target video frame based on the above tracking table, according to the type of each detection target in the tracking result, the video frame identification of the video frame where it is located, and the historical coordinates of the center point of the target frame of each detection target . Wherein, there is no feature target but a foreground target in the third target video frame, and the coordinates of the center point of the target frame of the foreground target in the third target video frame where the foreground target exists are located in a preset detection area. .
确定上述第三目标视频帧的个数是否达到预设的遗留计数阈值。如果是,从至少一个第三目标视频帧中提取前景目标。It is determined whether the number of the third target video frames reaches a preset legacy count threshold. If so, extract a foreground target from at least one third target video frame.
具体地,硬盘录像机在对上述检测目标进行跟踪时,如果获取到的视频帧中失去特 征目标,且该视频帧中存在目标框的中心点位于预设的检测区域内的前景目标,可确定该视频帧为第三目标视频帧,则进行计数,获得遗留计数。其中,上述预设的检测区域内可以是实际应用环境中,用户容易留下物品的区域。比如,对于ATM防护舱而言,上述检测区域可以是靠近ATM的区域。每获取新的第三目标视频帧,可以将上述遗留计数加一,并确定该遗留计数是否达到上述遗留计数阈值。上述遗留计数对应的各第三目标视频帧在时间上是连续的。Specifically, when the hard disk video recorder tracks the above-mentioned detection target, if the acquired video frame loses a characteristic target, and there is a foreground target in the video frame whose center point is located in a preset detection area, the target can be determined. If the video frame is the third target video frame, it is counted to obtain a legacy count. The above-mentioned preset detection area may be an area where the user easily leaves an item in an actual application environment. For example, for an ATM protective cabin, the above detection area may be an area close to the ATM. Each time a new third target video frame is acquired, the above-mentioned legacy count may be increased by one, and it is determined whether the legacy count reaches the aforementioned legacy count threshold. Each third target video frame corresponding to the above-mentioned legacy count is continuous in time.
如果上述遗留计数未达到上述遗留计数阈值,则继续更新上述遗留计数。If the above-mentioned legacy count does not reach the above-mentioned legacy count threshold, the above-mentioned legacy count is continuously updated.
如果上述遗留计数达到上述遗留计数阈值,则可以将上述前景目标从至少一个第三目标视频帧中抠出。If the legacy count reaches the threshold of the legacy count, the foreground target may be extracted from at least one third target video frame.
进一步地,硬盘录像机可以利用预设的CNN分类模型对提取出的上述前景目标进行分类,获得上述前景目标对应于N种不同前景目标类型的置信度。其中,N是大于1的整数,N种不同前景目标类型至少包括物品和非物品。Further, the hard disk video recorder may use the preset CNN classification model to classify the extracted foreground targets, and obtain the confidence that the foreground targets correspond to N different types of foreground targets. Among them, N is an integer greater than 1, and the N different foreground target types include at least items and non-items.
如果所述前景目标对应于物品的置信度最大,则确定存在物品遗留事件。If the foreground object corresponds to the item with the greatest confidence, it is determined that an item legacy event exists.
在实际应用中,前景目标类型可以包括人员、物品和非物品。非物品包括虚假前景。在这种情况下,上述CNN分类模型预先通过人员特征、指定区域内可能出现的物品和指定区域的背景内容进行训练。比如:对于防护舱场景而言,物品包括银行卡、钥匙、包、雨伞、行李箱等,指定区域的背景内容包括地面、ATM、防护舱内张贴的海报以及防护舱外光线变化或有影子投射到舱内时的背景内容等。In practical applications, the types of foreground targets can include people, items, and non-items. Non-items include false prospects. In this case, the above-mentioned CNN classification model is trained in advance through human characteristics, items that may appear in a specified area, and background content of the specified area. For example, in the case of a protective cabin, items include bank cards, keys, bags, umbrellas, luggage, etc. The background content of the designated area includes the ground, ATM, posters posted in the protective cabin, and changes in light outside the protective cabin or shadows Background content when arriving in the cabin, etc.
通过上述CNN分类模型对前景目标的进一步区分,可以更准确地识别前景目标的内容,避免了物品遗留事件的误判。By further distinguishing the foreground target by the above-mentioned CNN classification model, the content of the foreground target can be more accurately identified, and the misjudgment of the event left by the item can be avoided.
具体地,硬盘录像机可以基于对应于人员、物品和非物品的置信度,确定上述前景目标实际的内容。Specifically, the hard disk video recorder may determine the actual content of the foreground target based on the confidence corresponding to the person, the item, and the non-item.
如果对应于人员的置信度最大,则说明虽然当前视频帧中不存在特征目标,仍存在人员,可以将上述遗留计数清零;If the confidence level corresponding to the person is the largest, it means that although there are no feature targets in the current video frame, there are still people, and the above-mentioned legacy count can be cleared to zero;
如果对应于非物品的置信度最大,则说明当前第三目标视频帧中不存在或物品,可以将上述遗留计数清零;If the confidence level corresponding to the non-item is the largest, it means that the current third target video frame does not exist or the item, and the above-mentioned legacy count can be cleared to zero;
如果对应于物品的置信度最大,则确定存在物品遗留事件。在这种情况下,硬盘录像机可以向视频监控人员输出物品遗留报警信息。If the confidence level corresponding to the item is greatest, it is determined that there is an item legacy event. In this case, the hard disk video recorder can output alarms on items left to video surveillance personnel.
综上所述,在本申请实施例中,硬盘录像机可以通过用于特征检测的CNN提取视频流中目标人员的特征目标,通过用于前景检测的前景模型从视频流中获取前景目标,并基于上述特征目标和前景目标确定检测目标,可极大地减小受环境的影响,可以更准确地识别出目标人员和物品,提高了检出率,进而通过跟踪上述检测目标,可以有效地检测出包括人员尾随、人员滞留和物品遗留等异常事件。To sum up, in the embodiment of the present application, the hard disk video recorder can extract the feature target of the target person in the video stream through the CNN used for feature detection, and obtain the foreground target from the video stream through the foreground model used for foreground detection, and based on The detection targets determined by the above characteristic targets and foreground targets can greatly reduce the impact of the environment, can more accurately identify the target person and item, improve the detection rate, and then by tracking the above detection targets, can effectively detect Anomalous events such as personnel tracking, detention of personnel and leftovers of items.
与前述异常事件的检测方法的实施例相对应,本申请还提供了异常事件的检测装置的实施例。Corresponding to the foregoing embodiment of the method for detecting an abnormal event, this application further provides an embodiment of a device for detecting an abnormal event.
参见图2,为本申请示出的一种异常事件的检测装置的实施例框图。Referring to FIG. 2, a block diagram of an embodiment of an abnormal event detection device is shown.
如图2所示,该异常事件的检测装置20,包括:As shown in FIG. 2, the abnormal event detection device 20 includes:
第一获取单元210,用于利用已训练的卷积神经网络CNN从监控到的视频流中获取一个或多个特征目标,其中,所述视频流是监控指定区域得到的,所述特征目标表示出现在所述指定区域的目标人员。A first obtaining unit 210 is configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN, wherein the video stream is obtained by monitoring a specified area, and the feature target represents Target person appearing in the designated area.
第二获取单元220,用于利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标。The second obtaining unit 220 is configured to obtain one or more foreground objects from the video stream by using a preset foreground model for foreground detection.
第一确定单元230,用于基于所述特征目标和所述前景目标确定一个或多个检测目标。The first determining unit 230 is configured to determine one or more detection targets based on the feature target and the foreground target.
第二确定单元240,用于通过记录所述检测目标在所述视频流中的存在情况跟踪每个所述检测目标,以获取跟踪结果,并依据所述跟踪结果确定异常事件。A second determining unit 240 is configured to track each of the detection targets by recording the presence of the detection targets in the video stream to obtain a tracking result, and determine an abnormal event according to the tracking result.
在本例中,所述第一确定单元230,进一步用于:从所述前景目标中选择与各个所述特征目标无关联的关注前景目标;将各个所述特征目标和所述关注前景目标确定为所述检测目标。In this example, the first determining unit 230 is further configured to: select a foreground object of interest that is not associated with each of the characteristic objects from the foreground objects; determine each of the characteristic object and the foreground object of interest Is the detection target.
在本例中,所述第一确定单元230,进一步用于:针对每一前景目标,计算该前景目标的目标框与每一特征目标的目标框之间交集的面积,若该前景目标的目标框与每一特征目标的目标框之间的交集的面积均小于预设的面积阈值,则确定该前景目标与各个特征目标无关联。In this example, the first determining unit 230 is further configured to calculate, for each foreground target, an area of intersection between a target frame of the foreground target and a target frame of each characteristic target. The area of the intersection between the frame and the target frame of each feature target is less than a preset area threshold, then it is determined that the foreground target is not associated with each feature target.
在本例中,所述第二确定单元240,进一步用于:记录所述检测目标的类型、所述检测目标的目标框的中心点在存在所述检测目标的各视频帧中的坐标以及所述视频流中存在所述检测目标的视频帧的视频帧标识。In this example, the second determining unit 240 is further configured to record the type of the detection target, the coordinates of the center point of the target frame of the detection target in each video frame in which the detection target is located, and the A video frame identifier of a video frame of the detection target exists in the video stream.
所述第二确定单元240进一步用于:依据所述跟踪结果确定所述视频流中的第一目标视频帧,其中,所述第一目标视频帧中存在至少一个所述特征目标;在连续的与所述至少一个特征目标关联的所述第一目标视频帧的个数达到预设的滞留计数阈值时,确定存在滞留事件。所述第二确定单元240还进一步用于:依据所述跟踪结果确定所述视频流中的第二目标视频帧,其中,所述第二目标视频帧中存在至少两个所述特征目标;在连续的与所述至少两个特征目标关联的所述第二目标视频帧的数量达到预设的尾随计数阈值时,确定存在尾随事件。The second determining unit 240 is further configured to determine a first target video frame in the video stream according to the tracking result, wherein at least one of the characteristic targets exists in the first target video frame; When the number of the first target video frames associated with the at least one characteristic target reaches a preset retention count threshold, it is determined that a retention event exists. The second determining unit 240 is further configured to determine a second target video frame in the video stream according to the tracking result, wherein at least two of the characteristic targets exist in the second target video frame; When the number of consecutive second target video frames associated with the at least two feature targets reaches a preset trailing count threshold, it is determined that a trailing event exists.
所述第二确定单元240进一步用于:依据所述跟踪结果确定所述视频流中的第三目标视频帧;其中,所述第三目标视频帧中存在所述前景目标、但不存在任一所述特征目标,所述前景目标的目标框的中心点在存在所述前景目标的第三目标视频帧中的坐标位于预设的检测区域内;在连续的与所述前景目标关联的所述第三目标视频帧的个数达到预设的遗留计数阈值时,从至少一个所述第三目标视频帧中提取所述前景目标;利用预设的CNN分类模型对提取出的所述前景目标进行分类,获得所述前景目标对应于N种不同前景目标类型的置信度,N是大于1的整数,N种不同前景目标类型至少包括物品、非物品;和如果所述前景目标对应于物品的置信度最大,则确定存在物品遗留事件。The second determining unit 240 is further configured to determine a third target video frame in the video stream according to the tracking result; wherein the foreground target exists in the third target video frame, but no one exists. The coordinates of the center point of the target frame of the characteristic target in the foreground target in the third target video frame in which the foreground target is located are within a preset detection area; in the continuous association with the foreground target, When the number of the third target video frames reaches a preset legacy count threshold, the foreground target is extracted from at least one of the third target video frames; the preset foreground target is extracted using a preset CNN classification model Classify to obtain the confidence that the foreground target corresponds to N different foreground target types, where N is an integer greater than 1, and the N different foreground target types include at least items and non-items; and if the foreground target corresponds to the confidence of the item If the degree is the largest, it is determined that there is an item leftover event.
本申请异常事件的检测装置的实施例可以应用在电子设备上。装置实施例可以通过软件实现,也可以通过硬件或者软硬件结合的方式实现。以软件实现为例,作为一个逻辑意义上的装置,是通过其所在电子设备的处理器将非易失性存储器中对应的计算机程序指令读取到内存中运行形成的。The embodiment of the apparatus for detecting an abnormal event of the present application can be applied to an electronic device. The device embodiments can be implemented by software, or by hardware or a combination of software and hardware. Taking software implementation as an example, as a device in a logical sense, it is formed by reading the corresponding computer program instructions in the non-volatile memory into the memory through the processor of the electronic device where it is located.
从硬件层面而言,如图3所示,为本申请异常事件的检测装置所在电子设备的一种硬件结构图,除了图3所示的处理器、内存、网络接口、以及非易失性存储器之外,实施例中装置所在的电子设备通常根据该异常事件的检测装置的实际功能,还可以包括其他硬件,对此不再赘述。其中,该电子设备的内存和非易失性存储器中还分别搭载了上述第一获取单元210对应的机器可执行指令、上述第二获取单元220对应的机器可执行指令、上述第一确定单元230对应的机器可执行指令和上述第二确定单元240对应的机器可执行指令。In terms of hardware, as shown in FIG. 3, it is a hardware structure diagram of the electronic device where the abnormal event detection device of this application is located, except for the processor, memory, network interface, and non-volatile memory shown in FIG. 3. In addition, the electronic equipment in which the device is located in the embodiment may generally include other hardware according to the actual function of the detection device of the abnormal event, and details are not described herein again. The memory and non-volatile memory of the electronic device are also equipped with machine-executable instructions corresponding to the first obtaining unit 210, machine-executable instructions corresponding to the second obtaining unit 220, and the first determining unit 230, respectively. The corresponding machine-executable instructions and the machine-executable instructions corresponding to the second determining unit 240 described above.
上述装置中各个单元的功能和作用的实现过程具体详见上述方法中对应步骤的实现过程,在此不再赘述。For details about the implementation process of the functions and functions of the units in the above device, refer to the implementation process of the corresponding steps in the foregoing method for details, and details are not described herein again.
对于装置实施例而言,由于其基本对应于方法实施例,所以相关之处参见方法实施例的部分说明即可。以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件 说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本申请方案的目的。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。As for the device embodiment, since it basically corresponds to the method embodiment, the relevant part may refer to the description of the method embodiment. The device embodiments described above are only schematic, and the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located One place, or it can be distributed across multiple network elements. Some or all of these modules can be selected according to actual needs to achieve the purpose of the solution of this application. Those of ordinary skill in the art can understand and implement without creative efforts.
本申请实施例还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述方法实施例所述的异常事件的检测方法。在一实施例中,该计算机可读存储介质包括非暂态计算机可读存储介质。An embodiment of the present application further provides a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the method for detecting an abnormal event according to the foregoing method embodiment is implemented. In one embodiment, the computer-readable storage medium includes a non-transitory computer-readable storage medium.
以上所述仅为本申请的较佳实施例而已,并不用以限制本申请,凡在本申请的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本申请保护的范围之内。The above are only preferred embodiments of this application, and are not intended to limit this application. Any modification, equivalent replacement, or improvement made within the spirit and principle of this application shall be included in this application Within the scope of protection.

Claims (16)

  1. 一种异常事件的检测方法,包括:A method for detecting abnormal events, including:
    利用已训练的卷积神经网络CNN从视频流中获取一个或多个特征目标,其中,所述视频流是监控指定区域得到的,所述特征目标表示出现在所述指定区域的目标人员;Using a trained convolutional neural network CNN to obtain one or more feature targets from a video stream, wherein the video stream is obtained by monitoring a designated area, and the feature target represents a target person appearing in the designated area;
    利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标;Obtaining one or more foreground objects from the video stream by using a preset foreground model for foreground detection;
    基于所述特征目标和所述前景目标确定一个或多个检测目标;Determining one or more detection targets based on the feature target and the foreground target;
    通过记录所述检测目标在所述视频流中的存在情况来跟踪每个所述检测目标,以获取跟踪结果;Track each detection target by recording the existence of the detection target in the video stream to obtain a tracking result;
    依据所述跟踪结果确定异常事件。An abnormal event is determined according to the tracking result.
  2. 根据权利要求1所述的方法,其特征在于,基于所述特征目标和所述前景目标确定所述检测目标,包括:The method according to claim 1, wherein determining the detection target based on the characteristic target and the foreground target comprises:
    从所述前景目标中选择与各个所述特征目标无关联的关注前景目标;Selecting a foreground target of interest that is not associated with each of the characteristic targets from the foreground targets;
    将各个所述特征目标和所述关注前景目标确定为所述检测目标。Each of the feature target and the attention foreground target is determined as the detection target.
  3. 根据权利要求2所述的方法,其特征在于,从所述前景目标中选择与各个所述特征目标无关联的关注前景目标,包括:The method according to claim 2, wherein selecting the foreground object of interest that is not associated with each of the characteristic objects from the foreground objects comprises:
    针对每一所述前景目标,For each stated goal,
    计算该前景目标的目标框与每一个所述特征目标的目标框之间交集的面积;Calculating the area of intersection between the target frame of the foreground target and the target frame of each of the characteristic targets;
    若该前景目标的目标框与每一个所述特征目标的目标框之间的交集的面积均小于预设的面积阈值,则确定该前景目标与各个所述特征目标无关联。If the area of the intersection between the target frame of the foreground target and the target frame of each of the feature targets is less than a preset area threshold, it is determined that the foreground target is not associated with each of the feature targets.
  4. 根据权利要求1所述的方法,其特征在于,记录所述检测目标在所述视频流中的存在情况,包括:The method according to claim 1, wherein recording the existence of the detection target in the video stream comprises:
    记录所述检测目标的类型、所述检测目标的目标框的中心点在存在所述检测目标的各视频帧中的坐标以及所述视频流中存在所述检测目标的视频帧的视频帧标识。Record the type of the detection target, the coordinates of the center point of the target frame of the detection target in each video frame in which the detection target is present, and the video frame identifier of the video frame in which the detection target is present in the video stream.
  5. 根据权利要求4所述的方法,其特征在于,依据所述跟踪结果确定异常事件,包括:The method according to claim 4, wherein determining an abnormal event according to the tracking result comprises:
    依据所述跟踪结果确定所述视频流中的第一目标视频帧,其中,所述第一目标视频帧中存在至少一个所述特征目标;Determining a first target video frame in the video stream according to the tracking result, wherein at least one of the characteristic targets exists in the first target video frame;
    在连续的与所述至少一个特征目标关联的所述第一目标视频帧的个数达到预设的滞留计数阈值时,确定存在滞留事件。When the number of consecutive first target video frames associated with the at least one characteristic target reaches a preset retention count threshold, it is determined that a retention event exists.
  6. 根据权利要求4所述的方法,其特征在于,依据所述跟踪结果确定异常事件,包括:The method according to claim 4, wherein determining an abnormal event according to the tracking result comprises:
    依据所述跟踪结果确定所述视频流中的第二目标视频帧,其中,所述第二目标视频帧中存在至少两个所述特征目标;Determining a second target video frame in the video stream according to the tracking result, wherein at least two of the characteristic targets exist in the second target video frame;
    在连续的与所述至少两个特征目标关联的所述第二目标视频帧的个数达到预设的尾随计数阈值时,确定存在尾随事件。When the number of consecutive second target video frames associated with the at least two characteristic targets reaches a preset trailing count threshold, it is determined that a trailing event exists.
  7. 根据权利要求4所述的方法,其特征在于,依据所述跟踪结果确定异常事件,包括:The method according to claim 4, wherein determining an abnormal event according to the tracking result comprises:
    依据所述跟踪结果确定所述视频流中的第三目标视频帧,其中,所述第三目标视频帧中存在所述前景目标、但不存在任一所述特征目标,所述前景目标的目标框的中心点在存在所述前景目标的第三目标视频帧中的坐标位于预设的检测区域内;Determining a third target video frame in the video stream according to the tracking result, wherein the foreground target exists in the third target video frame, but there is no one of the characteristic targets, and the target of the foreground target The coordinates of the center point of the frame in the third target video frame where the foreground target is located are within a preset detection area;
    在连续的与所述前景目标关联的所述第三目标视频帧的个数达到预设的遗留计数阈值时,从至少一个所述第三目标视频帧中提取所述前景目标;Extracting the foreground target from at least one third target video frame when the number of consecutive third target video frames associated with the foreground target reaches a preset legacy count threshold;
    利用预设的CNN分类模型对提取出的所述前景目标进行分类,获得所述前景目标对应于N种不同前景目标类型的置信度,N是大于1的整数,N种不同前景目标类型至少包括物品、非物品;The preset CNN classification model is used to classify the extracted foreground targets to obtain the confidence that the foreground targets correspond to N different types of foreground targets, where N is an integer greater than 1, and the N different types of foreground targets include Articles and non-items;
    如果所述前景目标对应于物品的置信度最大,则确定存在物品遗留事件。If the foreground object corresponds to the item with the greatest confidence, it is determined that an item legacy event exists.
  8. 一种异常事件的检测装置,包括:An abnormal event detection device includes:
    第一获取单元,用于利用已训练的卷积神经网络CNN从监控到的视频流中获取一个或多个特征目标,其中,所述视频流是监控指定区域得到的,所述特征目标表示出现在所述指定区域的目标人员;A first obtaining unit is configured to obtain one or more feature targets from a monitored video stream by using a trained convolutional neural network CNN, wherein the video stream is obtained by monitoring a specified area, and the feature targets indicate occurrence Target personnel in said designated area;
    第二获取单元,用于利用预设的用于前景检测的前景模型从所述视频流中获取一个或多个前景目标;A second obtaining unit, configured to obtain one or more foreground objects from the video stream by using a preset foreground model for foreground detection;
    第一确定单元,用于基于所述特征目标和所述前景目标确定一个或多个检测目标;A first determining unit, configured to determine one or more detection targets based on the characteristic target and the foreground target;
    第二确定单元,用于通过记录所述检测目标在所述视频流中的存在情况跟踪每个所述检测目标,以获取跟踪结果,并依据所述跟踪结果确定异常事件。A second determining unit is configured to track each detection target by recording the presence of the detection target in the video stream to obtain a tracking result, and determine an abnormal event according to the tracking result.
  9. 根据权利要求8所述的装置,其特征在于,所述第一确定单元,进一步用于:The apparatus according to claim 8, wherein the first determining unit is further configured to:
    从所述前景目标中选择与各个所述特征目标无关联的关注前景目标;Selecting a foreground target of interest that is not associated with each of the characteristic targets from the foreground targets;
    将各个所述特征目标和所述关注前景目标确定为所述检测目标。Each of the feature target and the attention foreground target is determined as the detection target.
  10. 根据权利要求9所述的装置,其特征在于,所述第一确定单元,进一步用于:The apparatus according to claim 9, wherein the first determining unit is further configured to:
    针对每一所述前景目标,For each stated goal,
    计算该前景目标的目标框与每一个所述特征目标的目标框之间交集的面积;Calculating the area of intersection between the target frame of the foreground target and the target frame of each of the characteristic targets;
    若该前景目标的目标框与每一个所述特征目标的目标框之间的交集的面积均 小于预设的面积阈值,则确定该前景目标与各个所述特征目标无关联。If the area of the intersection between the target frame of the foreground target and the target frame of each of the characteristic targets is less than a preset area threshold, it is determined that the foreground target is not associated with each of the characteristic targets.
  11. 根据权利要求8所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 8, wherein the second determining unit is further configured to:
    记录所述检测目标的类型、所述检测目标的目标框的中心点在存在所述检测目标的各视频帧中的坐标以及所述视频流中存在所述检测目标的视频帧的视频帧标识。Record the type of the detection target, the coordinates of the center point of the target frame of the detection target in each video frame in which the detection target is present, and the video frame identifier of the video frame in which the detection target is present in the video stream.
  12. 根据权利要求11所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 11, wherein the second determining unit is further configured to:
    依据所述跟踪结果确定所述视频流中的第一目标视频帧,其中,所述第一目标视频帧中存在至少一个所述特征目标;Determining a first target video frame in the video stream according to the tracking result, wherein at least one of the characteristic targets exists in the first target video frame;
    在连续的与所述至少一个特征目标关联的所述第一目标视频帧的个数达到预设的滞留计数阈值时,确定存在滞留事件。When the number of consecutive first target video frames associated with the at least one characteristic target reaches a preset retention count threshold, it is determined that a retention event exists.
  13. 根据权利要求11所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 11, wherein the second determining unit is further configured to:
    依据所述跟踪结果确定所述视频流中的第二目标视频帧,其中,所述第二目标视频帧中存在至少两个所述特征目标;Determining a second target video frame in the video stream according to the tracking result, wherein at least two of the characteristic targets exist in the second target video frame;
    在连续的与所述至少两个特征目标关联的所述第二目标视频帧的数量达到预设的尾随计数阈值时,确定存在尾随事件。When the number of consecutive second target video frames associated with the at least two feature targets reaches a preset trailing count threshold, it is determined that a trailing event exists.
  14. 根据权利要求11所述的装置,其特征在于,所述第二确定单元,进一步用于:The apparatus according to claim 11, wherein the second determining unit is further configured to:
    依据所述跟踪结果确定所述视频流中的第三目标视频帧;其中,所述第三目标视频帧中存在所述前景目标、但不存在任一所述特征目标,所述前景目标的目标框的中心点在存在所述前景目标的第三目标视频帧中的坐标位于预设的检测区域内;Determining a third target video frame in the video stream according to the tracking result; wherein the foreground target exists in the third target video frame, but none of the characteristic targets exists, and the target of the foreground target The coordinates of the center point of the frame in the third target video frame where the foreground target is located are within a preset detection area;
    在连续的与所述前景目标关联的所述第三目标视频帧的个数达到预设的遗留计数阈值时,从至少一个所述第三目标视频帧中提取所述前景目标;Extracting the foreground target from at least one third target video frame when the number of consecutive third target video frames associated with the foreground target reaches a preset legacy count threshold;
    利用预设的CNN分类模型对提取出的所述前景目标进行分类,获得所述前景目标对应于N种不同前景目标类型的置信度,N是大于1的整数,N种不同前景目标类型至少包括物品、非物品;The preset CNN classification model is used to classify the extracted foreground targets to obtain the confidence that the foreground targets correspond to N different types of foreground targets, where N is an integer greater than 1, and the N different types of foreground targets include Articles and non-items;
    如果所述前景目标对应于物品的置信度最大,则确定存在物品遗留事件。If the foreground object corresponds to the item with the greatest confidence, it is determined that an item legacy event exists.
  15. 一种电子设备,包括:An electronic device includes:
    处理器,以及,Processors, and,
    用于存储所述处理器可执行指令的存储器;A memory for storing the processor-executable instructions;
    其中,所述处理器被配置为执行权利要求1-7中任一项所述的异常事件的检测方法。The processor is configured to execute the method for detecting an abnormal event according to any one of claims 1-7.
  16. 一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现权利要求1-7中任一项所述的异常事件的检测方法。A computer-readable storage medium having stored thereon a computer program that, when executed by a processor, implements the method for detecting an abnormal event according to any one of claims 1-7.
PCT/CN2019/088703 2018-06-26 2019-05-28 Abnormal event detection WO2020001216A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810672457.5 2018-06-26
CN201810672457.5A CN110648352B (en) 2018-06-26 2018-06-26 Abnormal event detection method and device and electronic equipment

Publications (1)

Publication Number Publication Date
WO2020001216A1 true WO2020001216A1 (en) 2020-01-02

Family

ID=68985488

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/088703 WO2020001216A1 (en) 2018-06-26 2019-05-28 Abnormal event detection

Country Status (2)

Country Link
CN (1) CN110648352B (en)
WO (1) WO2020001216A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111259827A (en) * 2020-01-19 2020-06-09 河海大学 Automatic detection method and device for water surface floating objects for urban river supervision
CN111860140A (en) * 2020-06-10 2020-10-30 北京迈格威科技有限公司 Target event detection method and device, computer equipment and storage medium
CN112700657A (en) * 2020-12-21 2021-04-23 北京百度网讯科技有限公司 Method and device for generating detection information, road side equipment and cloud control platform
CN113011290A (en) * 2021-03-03 2021-06-22 上海商汤智能科技有限公司 Event detection method and device, electronic equipment and storage medium
CN114245200A (en) * 2021-11-02 2022-03-25 浙江大华技术股份有限公司 Monitoring scene change determination method, device and system and electronic device
CN114640608A (en) * 2022-04-01 2022-06-17 上海商汤智能科技有限公司 Test method and device, electronic equipment and computer readable storage medium
CN115690665A (en) * 2023-01-03 2023-02-03 华东交通大学 Video anomaly detection method and device based on cross U-Net network
CN117541957A (en) * 2023-11-08 2024-02-09 继善(广东)科技有限公司 Method, system and medium for generating event solving strategy based on artificial intelligence

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113470013A (en) * 2021-07-28 2021-10-01 浙江大华技术股份有限公司 Method and device for detecting moved article
CN117178301A (en) * 2022-03-28 2023-12-05 京东方科技集团股份有限公司 Method and device for detecting illegal behaviors
CN117392585B (en) * 2023-10-24 2024-06-18 广州广电运通智能科技有限公司 Gate traffic detection method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201638305U (en) * 2009-05-18 2010-11-17 上海银晨智能识别科技有限公司 Intelligent monitoring device for self-service bank and automated teller machine
US20140055610A1 (en) * 2012-08-27 2014-02-27 Korea University Research And Business Foundation Indoor surveillance system and indoor surveillance method
CN103679123A (en) * 2012-09-17 2014-03-26 浙江大华技术股份有限公司 Method and system for detecting remnant on ATM (Automatic Teller Machine) panel
CN105718857A (en) * 2016-01-13 2016-06-29 兴唐通信科技有限公司 Human body abnormal behavior detection method and system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8614744B2 (en) * 2008-07-21 2013-12-24 International Business Machines Corporation Area monitoring using prototypical tracks
CN103605983B (en) * 2013-10-30 2017-01-25 天津大学 Remnant detection and tracking method
CN105976400B (en) * 2016-05-10 2017-06-30 北京旷视科技有限公司 Method for tracking target and device based on neural network model
CN105872477B (en) * 2016-05-27 2018-11-23 北京旷视科技有限公司 video monitoring method and video monitoring system
CN106203274B (en) * 2016-06-29 2020-03-31 长沙慧联智能科技有限公司 Real-time pedestrian detection system and method in video monitoring

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201638305U (en) * 2009-05-18 2010-11-17 上海银晨智能识别科技有限公司 Intelligent monitoring device for self-service bank and automated teller machine
US20140055610A1 (en) * 2012-08-27 2014-02-27 Korea University Research And Business Foundation Indoor surveillance system and indoor surveillance method
CN103679123A (en) * 2012-09-17 2014-03-26 浙江大华技术股份有限公司 Method and system for detecting remnant on ATM (Automatic Teller Machine) panel
CN105718857A (en) * 2016-01-13 2016-06-29 兴唐通信科技有限公司 Human body abnormal behavior detection method and system

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111259827B (en) * 2020-01-19 2022-07-29 河海大学 Automatic detection method and device for water surface floating objects for urban river supervision
CN111259827A (en) * 2020-01-19 2020-06-09 河海大学 Automatic detection method and device for water surface floating objects for urban river supervision
CN111860140A (en) * 2020-06-10 2020-10-30 北京迈格威科技有限公司 Target event detection method and device, computer equipment and storage medium
CN111860140B (en) * 2020-06-10 2024-05-17 北京迈格威科技有限公司 Target event detection method, device, computer equipment and storage medium
CN112700657B (en) * 2020-12-21 2023-04-28 阿波罗智联(北京)科技有限公司 Method and device for generating detection information, road side equipment and cloud control platform
CN112700657A (en) * 2020-12-21 2021-04-23 北京百度网讯科技有限公司 Method and device for generating detection information, road side equipment and cloud control platform
CN113011290A (en) * 2021-03-03 2021-06-22 上海商汤智能科技有限公司 Event detection method and device, electronic equipment and storage medium
CN114245200A (en) * 2021-11-02 2022-03-25 浙江大华技术股份有限公司 Monitoring scene change determination method, device and system and electronic device
CN114245200B (en) * 2021-11-02 2024-05-07 浙江大华技术股份有限公司 Monitoring scene change determining method, device, system and electronic device
CN114640608A (en) * 2022-04-01 2022-06-17 上海商汤智能科技有限公司 Test method and device, electronic equipment and computer readable storage medium
CN115690665A (en) * 2023-01-03 2023-02-03 华东交通大学 Video anomaly detection method and device based on cross U-Net network
CN115690665B (en) * 2023-01-03 2023-03-28 华东交通大学 Video anomaly detection method and device based on cross U-Net network
CN117541957A (en) * 2023-11-08 2024-02-09 继善(广东)科技有限公司 Method, system and medium for generating event solving strategy based on artificial intelligence
CN117541957B (en) * 2023-11-08 2024-05-24 继善(广东)科技有限公司 Method, system and medium for generating event solving strategy based on artificial intelligence

Also Published As

Publication number Publication date
CN110648352A (en) 2020-01-03
CN110648352B (en) 2022-09-30

Similar Documents

Publication Publication Date Title
WO2020001216A1 (en) Abnormal event detection
Tian et al. Robust detection of abandoned and removed objects in complex surveillance videos
US10943204B2 (en) Realtime video monitoring applied to reduce customer wait times
CN109376639B (en) Accompanying personnel early warning system and method based on portrait recognition
Liao et al. A localized approach to abandoned luggage detection with foreground-mask sampling
US20180115749A1 (en) Surveillance system and surveillance method
JP6148480B2 (en) Image processing apparatus and image processing method
Kamthe et al. Suspicious activity recognition in video surveillance system
CN101556717A (en) ATM intelligent security system and monitoring method
US10635908B2 (en) Image processing system and image processing method
CN104424721B (en) Recognition methods is blocked in a kind of face of combination ATM
JP7197485B2 (en) Detection system, detection device and method
Tripathi et al. A framework for abandoned object detection from video surveillance
Chang et al. Localized detection of abandoned luggage
CN111353338A (en) Energy efficiency improvement method based on business hall video monitoring
Patil et al. Suspicious movement detection and tracking based on color histogram
Zaidi et al. Video anomaly detection and classification for human activity recognition
CN114783037B (en) Object re-recognition method, object re-recognition apparatus, and computer-readable storage medium
JP5752976B2 (en) Image monitoring device
JP5752975B2 (en) Image monitoring device
JP2012212968A (en) Image monitoring apparatus
Islam et al. Correlating belongings with passengers in a simulated airport security checkpoint
JP2021012657A (en) Information processing apparatus, information processing method, and camera
Lin et al. An abandoned objects management system based on the gaussian mixture model
Hongxing et al. Facial area forecast and occluded face detection based on the YCbCr elliptical model

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19825500

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19825500

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 19825500

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 22.06.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19825500

Country of ref document: EP

Kind code of ref document: A1