US20190188998A1 - Alert directives and focused alert directives in a behavioral recognition system - Google Patents
Alert directives and focused alert directives in a behavioral recognition system Download PDFInfo
- Publication number
- US20190188998A1 US20190188998A1 US16/119,227 US201816119227A US2019188998A1 US 20190188998 A1 US20190188998 A1 US 20190188998A1 US 201816119227 A US201816119227 A US 201816119227A US 2019188998 A1 US2019188998 A1 US 2019188998A1
- Authority
- US
- United States
- Prior art keywords
- alert
- directive
- matching
- scene
- directives
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/40—Software arrangements specially adapted for pattern recognition, e.g. user interfaces or toolboxes therefor
- G06F18/41—Interactive pattern learning with a human teacher
-
- G06K9/00369—
-
- G06K9/00771—
-
- G06K9/00778—
-
- G06K9/3241—
-
- G06K9/52—
-
- G06K9/6254—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/778—Active pattern-learning, e.g. online learning of image or video features
- G06V10/7784—Active pattern-learning, e.g. online learning of image or video features based on feedback from supervisors
- G06V10/7788—Active pattern-learning, e.g. online learning of image or video features based on feedback from supervisors the supervisor being a human, e.g. interactive learning with a human teacher
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
- G06V20/53—Recognition of crowd images, e.g. recognition of crowd congestion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/103—Static body considered as a whole, e.g. static pedestrian or occupant recognition
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B13/00—Burglar, theft or intruder alarms
- G08B13/18—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
- G08B13/189—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
- G08B13/194—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
- G08B13/196—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
- G08B13/19602—Image analysis to detect motion of the intruder, e.g. by frame subtraction
- G08B13/19613—Recognition of a predetermined image pattern or behaviour pattern indicating theft or intrusion
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B21/00—Alarms responsive to a single specified undesired or abnormal condition and not otherwise provided for
- G08B21/18—Status alarms
- G08B21/182—Level alarms, e.g. alarms responsive to variables exceeding a threshold
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B23/00—Alarms responsive to unspecified undesired or abnormal conditions
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B29/00—Checking or monitoring of signalling or alarm systems; Prevention or correction of operating errors, e.g. preventing unauthorised operation
- G08B29/18—Prevention or correction of operating errors
- G08B29/185—Signal analysis techniques for reducing or preventing false alarms or for enhancing the reliability of the system
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/002—Special television systems not provided for by H04N7/007 - H04N7/18
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0409—Adaptive resonance theory [ART] networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/088—Non-supervised learning, e.g. competitive learning
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B13/00—Burglar, theft or intruder alarms
- G08B13/18—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
- G08B13/189—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
- G08B13/194—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
- G08B13/196—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
- G08B13/19602—Image analysis to detect motion of the intruder, e.g. by frame subtraction
- G08B13/19608—Tracking movement of a target, e.g. by detecting an object predefined as a target, using target direction and or velocity to predict its new position
Definitions
- Embodiments of the present invention generally relate to configuring a behavioral recognition-based video surveillance system to generate alerts for certain events. More specifically, the embodiments provide techniques allowing a behavioral recognition system to identify events that should always or never result in an alert without impeding the unsupervised learning process of the surveillance system.
- a video surveillance system may be configured to classify a group of pixels (referred to as a “blob”) in a given frame as being a particular object (e.g., a person or vehicle). Once identified, a “blob” may be tracked from frame-to-frame in order to follow the “blob” moving through the scene over time, e.g., a person walking across the field of vision of a video surveillance camera. Further, such systems may be configured to determine when an object has engaged in certain predefined behaviors.
- the system may include definitions used to recognize the occurrence of a number of predefined events, e.g., the system may evaluate the appearance of an object classified as depicting a car (a vehicle-appear event) coming to a stop over a number of frames (a vehicle-stop event). Thereafter, a new foreground object may appear and be classified as a person (a person-appear event) and the person then walks out of frame (a person-disappear event). Further, the system may be able to recognize the combination of the first two events as a “parking-event.”
- Such surveillance systems typically require that the objects and/or behaviors which may be recognized by the system be defined in advance.
- these systems rely on predefined definitions for objects and/or behaviors to evaluate a video sequence. More generally, such systems rely on predefined rules and static patterns and are thus often unable to dynamically identify objects, events, behaviors, or patterns, much less even classify them as either normal or anomalous.
- a behavioral recognition system is a type of video surveillance system that may be configured to learn, identify, and recognize patterns of behavior by observing a sequence of individual frames, otherwise known as a video stream. Unlike rules-based video surveillance systems, a behavioral recognition system instead learns objects and behavioral patterns by generalizing video input and building memories of what is observed. Over time, a behavioral recognition system uses these memories to distinguish between normal and anomalous behavior captured in the field of view of a video stream. Upon detecting anomalous behavior, the behavioral recognition system publishes an alert to a user notifying the user of the behavior. After several recurrences of a particular event, the behavioral recognition system learns that the event is non-anomalous and ceases publishing subsequent alerts.
- a behavioral recognition system focused on a building corridor may initially publish alerts each time an individual appears in the corridor at a certain time of day within the field of view of the camera. If this event occurs a sufficient amount of times, the behavioral recognition system may learn that this is non-anomalous behavior and stop alerting a user to this event.
- the user may want the behavioral recognition system to always publish an alert for a particular behavioral event.
- the corridor were of limited access, security personnel may want to be notified each time someone appears in the corridor to ascertain that only people in the corridor are the ones authorized to be there.
- the user may not ever want the behavioral recognition system to publish an alert for a particular behavior. This situation may arise where the event occurs often but infrequently enough to result in an alert.
- a behavioral recognition system focused on a room in a building that is next to a construction site may create alerts whenever construction vehicles pass through the field of view of the camera outside a window in the room. In this instance, security personnel may not want the behavioral recognition system to ever alert on these occurrences.
- One embodiment of the invention provides a method for alerting a user to behavior corresponding to an alert directive.
- This method may generally include obtaining characteristic values from an observed event in a scene.
- This method may also include parsing a list of alert directives for a matching alert directive having ranges of criteria values. If the characteristic values are within the ranges of the criteria values, then the observed event corresponds to a matching alert directive.
- This method may also include upon identifying the matching alert directive, alerting the user to the observed event.
- the characteristic values of the observed event may be a pixel-height value, a pixel-width value, and an x- and y-coordinate center position of a foreground object.
- the characteristic values may also be a set of x- and y-coordinates corresponding to a trajectory of a foreground object.
- the matching alert directive may have a focus mask that intersects with a region in the scene where the observed event occurred.
- inventions include, without limitation, a computer-readable medium that includes instructions that enable a processing unit to implement one or more aspects of the disclosed methods as well as a system having a processor, memory, and application programs configured to implement one or more aspects of the disclosed methods.
- FIG. 1 illustrates components of a video analysis system, according to one embodiment.
- FIG. 2 further illustrates components of the video analysis system shown in FIG. 1 , according to one embodiment.
- FIG. 3 illustrates an example of an alert database in a client device, according to one embodiment.
- FIG. 4 illustrates a method for publishing alerts in a behavioral recognition system configured with alert directives and focused alert directives, according to one embodiment.
- FIG. 5 illustrates an example graphical representation of a set of tolerances applied to a trajectory alert, according to one embodiment.
- FIG. 6 illustrates an example graphical representation of an alert directive and a focused alert directive applied to a particular alert, according to one embodiment.
- Embodiments of the invention disclosed herein provide techniques for creating alert directives and focused alert directives in a behavioral recognition-based video surveillance system. That is, the disclosed techniques allow a user of a behavioral recognition system to identify previously alerted behavior that should always or never result in a subsequent alert. Because alert directives override only the behavioral recognition system's normal alert publication procedures (which take place after the system has already performed its learning procedures), this approach does not disrupt the behavioral recognition system's unsupervised learning.
- a behavioral recognition system includes a computer vision engine and a machine learning engine.
- the computer vision engine may be configured to process a field of view captured within a video stream. This field of view is generally referred to as the “scene.”
- the computer vision engine separates foreground objects (e.g., objects resembling people, vehicles, etc.) from background objects (e.g., objects resembling pavement, the sky, etc.).
- the computer vision engine may generate information streams of observed activity (e.g., appearance features, kinematic features, etc.) and pass the streams to the machine learning engine.
- the machine learning engine may be configured to learn object behaviors in the scene using that information.
- a machine learning engine may be configured to build models of certain behaviors within a scene and determine whether observations indicate that the behavior of an object is anomalous, relative to the model. Upon detecting anomalous behavior, the machine learning engine generates an alert. After determining that the alert should be published, the behavioral recognition system publishes the alert to a user interface.
- the user interface may contain a database of previously issued alerts that are generally accessible to a user of the system. The user can view these alerts as a list, where each list item displays information of the alert and may include corresponding video or image data.
- the machine learning engine learns the event is a non-anomalous occurrence and ceases to publish subsequent alerts for the event.
- a user may create an alert directive to override the normal alert publication process.
- An alert directive allows a user to provide feedback to the machine learning engine to either always or never create an alert for a certain behavioral event.
- the machine learning engine consults a list of alert directive definitions after learning information streams relating to an event and before evaluating the event for anomalous behavior. Thus, alert directives do not hinder the machine learning engine's learning procedures.
- a user selects an event occurrence or an alert previously generated by the system to use as a template. For example, a user may parse through a database of previous alerts for a scene, characterized based on time, type, name, event, or otherwise, as well as view underlying video of the activity that caused the alert. In one embodiment, a user may do this via a dialog box in an alert browser on the user interface.
- the user defines alert directive matching criteria.
- the criteria may include whether the behavior should always or never result in an alert, how frequently the alert should be published (e.g., in situations where the behavior results in numerous alerts within a short time span), and whether the machine learning engine should match behaviors or object types (or both).
- the user interface creates an alert directive in the alert database with references pointing back to the original alert used to create it. Thereafter, the user interface sends information about the alert directive to the machine learning engine.
- matching an alert directive to alert behavior by the machine learning engine may depend on both the alert type and series of parameters specified by a user. For example, once a user selects an alert to use for an alert directive, the corresponding video of that alert may show a person in front of the security door, along with a bounding box indicating the pixels classified by the system as depicting that person. In such a case, a graphical editor may allow a user to adjust the bounding box around the person in the selected alert to adjust the tolerances for the alert directive—creating a range, e.g., for the center (x,y) position of a foreground object, etc.
- a user may specify another bounding box for the relative position of the person in front of the door, i.e., a tolerance for object position.
- a tolerance for object position By adjusting a tolerance size and pose of a person and for the position of such a person, an alert directive may be used to specify a region in front of the security door, so that whenever any person is observed to be present, the machine learning engine creates an alert.
- this approach allows for variation in height, position, width, speed, etc., of observed objects to still satisfy the alert directive definitions.
- an alert directive may be expanded to provide a focused alert directive.
- This approach extends the cases where the machine learning engine can apply an alert directive to behavioral events in the scene. For example, a camera may focus on a building corridor with multiple security doors. In such a case, a user must create and tune a separate alert directive for each door to be alerted whenever someone appears in front of a door.
- a focused alert directive allows the user to create both an alert directive (e.g., for a person appearing in front of the security door) and a focus mask specifying different regions in the scene which should result in an alert.
- the user can extend the tolerance in position to the full field of view.
- the user defines one or more regions of the scene where an alert should be generated when a foreground object otherwise within the tolerances of the alert directive is observed.
- the alert of the person appearing in front of the first door (within the alert directive tolerances for height, width, and pose) defines an alert directive, but the position is extended to be the entire field of view of the camera, intersected with the user-defined regions.
- the user interface sends information of the alert directive (and focus mask, if applicable) to the machine learning engine.
- the machine learning engine processes information of subsequent events that matches an alert directive's match criteria and tolerances, the machine learning engine bypasses the normal publication methods of the behavioral recognition system and immediately publishes an alert or discards the event (given the matching criteria), irrespective of whether the machine learning engine regards the observed behavior as anomalous. This approach does not change the learned state regarding a particular scene or influence the undirected learning of the machine learning engine. In all cases, the machine learning engine has already performed its learning procedures before applying the alert directive.
- any reference to “the invention” or “the disclosure” shall not be construed as a generalization of any inventive subject matter disclosed herein and shall not be considered to be an element or limitation of the appended claims except where explicitly recited in a claim(s).
- One embodiment of the present invention is implemented as a program product for use with a computer system.
- the program(s) of the program product defines functions of the embodiments (including the methods described herein) and can be contained on a variety of computer-readable storage media.
- Examples of computer-readable storage media include (i) non-writable storage media (e.g., read-only memory devices within a computer such as CD-ROM or DVD-ROM disks readable by an optical media drive) on which information is permanently stored; (ii) writable storage media (e.g., a hard-disk drive) on which alterable information is stored.
- Such computer-readable storage media when carrying computer-readable instructions that direct the functions of the invention, are embodiments of the invention.
- Other examples media include communications media through which information is conveyed to a computer, such as through a computer or telephone network, including wireless communications networks.
- routines executed to implement the embodiments of the invention may be part of an operating system or a specific application, component, program, module, object, or sequence of instructions.
- the computer program of the invention is comprised typically of a multitude of instructions that will be translated by the native computer into a machine-readable format and hence executable instructions.
- programs are comprised of variables and data structures that either reside locally to the program or are found in memory or on storage devices.
- various programs described herein may be identified based upon the application for which they are implemented in a specific embodiment of the disclosure. However, it should be appreciated that any particular program nomenclature that follows is used merely for convenience, and thus the present disclosure should not be limited to use solely in any specific application identified and/or implied by such nomenclature.
- FIG. 1 illustrates components of a video analysis and behavioral recognition system 100 , according to one embodiment.
- the behavioral recognition system 100 includes a video input source 105 , a network 110 , a computer system 115 , and input and output devices 118 (e.g., a monitor, a keyboard, a mouse, a printer, and the like).
- the network 110 may transmit video data recorded by the video input 105 to the computer system 115 .
- the computer system 115 includes a CPU 120 , storage 125 (e.g., a disk drive, optical disk drive, floppy disk drive, and the like), and a memory 130 containing both a computer vision engine 135 and a machine learning engine 140 .
- the computer vision engine 135 and the machine learning engine 140 may provide software applications configured to analyze a sequence of video frames provided by the video input 105 .
- the Network 110 receives video data (e.g., video stream(s), video images, or the like) from the video input source 105 .
- the video input source 105 may be a video camera, a VCR, DVR, DVD, computer, web-cam device, or the like.
- the video input source 105 may be a stationary video camera aimed at a certain area (e.g., a subway station, a parking lot, a building entry/exit, etc.), which records the events taking place therein.
- a certain area e.g., a subway station, a parking lot, a building entry/exit, etc.
- the area within the camera's field of view is referred to as the scene.
- the video input source 105 may be configured to record the scene as a sequence of individual video frames at a specified frame-rate (e.g., 24 frames per second), where each frame includes a fixed number of pixels (e.g., 320 ⁇ 240). Each pixel of each frame may specify a color value (e.g., an RGB value) or grayscale value (e.g., a radiance value between 0-255). Further, the video stream may be formatted using known such formats e.g., MPEG2, MJPEG, MPEG4, H.263, H.264, and the like.
- the computer vision engine 135 may be configured to analyze this raw information to identify active objects in the video stream, identify a variety of appearance and kinematic features used by a machine learning engine 140 to derive object classifications, derive a variety of metadata regarding the actions and interactions of such objects, and supply this information to the machine learning engine 140 .
- the machine learning engine 140 may be configured to evaluate, observe, learn and remember details regarding events (and types of events) that transpire within the scene over time.
- the machine learning engine 140 receives the video frames and the data generated by the computer vision engine 135 .
- the machine learning engine 140 may be configured to analyze the received data, cluster objects having similar visual and/or kinematic features, build semantic representations of events depicted in the video frames.
- the machine learning engine 140 learns expected patterns of behavior for objects that map to a given cluster.
- the machine learning engine learns from these observed patterns to identify normal and/or abnormal events. That is, rather than having patterns, objects, object types, or activities defined in advance, the machine learning engine 140 builds its own model of what different object types have been observed (e.g., based on clusters of kinematic and or appearance features) as well as a model of expected behavior for a given object type. Thereafter, the machine learning engine can decide whether the behavior of an observed event is anomalous or not based on prior learning.
- Data describing whether a normal/abnormal behavior/event has been determined and/or what such behavior/event is may be provided to output devices 118 to issue alerts, for example, an alert message with corresponding video and image data presented on a GUI interface screen.
- Such output devices may also be configured with a database of previously issued alerts from which a user can create an alert directive.
- the computer vision engine 135 and the machine learning engine 140 both process video data in real-time.
- time scales for processing information by the computer vision engine 135 and the machine learning engine 140 may differ.
- the computer vision engine 135 processes the received video data frame-by-frame, while the machine learning engine 140 processes data every N-frames.
- the computer vision engine 135 may analyze each frame in real-time to derive a set of kinematic and appearance data related to objects observed in the frame, the machine learning engine 140 is not constrained by the real-time frame rate of the video input.
- FIG. 1 illustrates merely one possible arrangement of the behavior-recognition system 100 .
- the video input source 105 is shown connected to the computer system 115 via the network 110 , the network 110 is not always present or needed (e.g., the video input source 105 may be directly connected to the computer system 115 ).
- various components and modules of the behavior-recognition system 100 may be implemented in other systems.
- the computer vision engine 135 may be implemented as a part of a video input device (e.g., as a firmware component wired directly into a video camera). In such a case, the output of the video camera may be provided to the machine learning engine 140 for analysis.
- the output from the computer vision engine 135 and machine learning engine 140 may be supplied over computer network 110 to other computer systems.
- the computer vision engine 135 and machine learning engine 140 may be installed on a server system and configured to process video from multiple input sources (i.e., from multiple cameras).
- a client application 250 running on another computer system may request (or receive) the results of over network 110 .
- FIG. 2 further illustrates components of the computer vision engine 135 and the machine learning engine 140 first illustrated in FIG. 1 , according to one embodiment of the invention.
- the computer vision engine 135 includes a data ingestor 205 , a detector 215 , a tracker 215 , a context event generator 220 , an alert generator 225 , and an event bus 230 .
- the components 205 , 210 , 215 , and 220 provide a pipeline for processing an incoming sequence of video frames supplied by the video input source 105 (indicated by the solid arrows linking the components).
- the components 210 , 215 , and 220 may each provide a software module configured to provide the functions described herein.
- components 205 , 210 , 215 , and 220 may be combined (or further subdivided) to suit the needs of a particular case and further that additional components may be added (or some may be removed) from a video surveillance system.
- the data ingestor 205 receives video input from the video input source 105 .
- the data ingestor 205 may be configured to preprocess the input data before sending it to the detector 210 .
- the detector 210 may be configured to separate each frame of video provided into a stationary or static part (the scene background) and a collection of volatile parts (the scene foreground).
- the frame itself may include a two-dimensional array of pixel values for multiple channels (e.g., RGB channels for color video or grayscale channel or radiance channel for black and white video).
- the detector 210 may model background states for each pixel using an adaptive resonance theory (ART) network. That is, each pixel may be classified as depicting scene foreground or scene background using an ART network modeling a given pixel.
- ART adaptive resonance theory
- the detector 210 may be configured to generate a mask used to identify which pixels of the scene are classified as depicting foreground and, conversely, which pixels are classified as depicting scene background. The detector 210 then identifies regions of the scene that contain a portion of scene foreground (referred to as a foreground “blob” or “patch”) and supplies this information to subsequent stages of the pipeline. Additionally, pixels classified as depicting scene background may be used to generate a background image modeling the scene.
- the detector 210 may be configured to detect the flow of a scene. Once the foreground patches have been separated, the detector 210 examines, from frame-to-frame, any edges and corners of all foreground patches. The detector 210 will identify foreground patches moving in a similar flow of motion as most likely belonging to a single object or a single association of motions and send this information to the tracker 215 .
- the tracker 215 may receive the foreground patches produced by the detector 210 and generate computational models for the patches.
- the tracker 215 may be configured to use this information, and each successive frame of raw-video, to attempt to track the motion of an object depicted by a given foreground patch as it moves about the scene. That is, the tracker 215 provides continuity to other elements of the system by tracking a given object from frame-to-frame. It further calculates a variety of kinematic and/or appearance features of a foreground object, e.g., size, height, width, and area (in pixels), reflectivity, shininess rigidity, speed velocity, etc.
- the context event generator 220 may receive the output from other stages of the pipeline. Using this information, the context processor 220 may be configured to generate a stream of context events regarding objects tracked (by tracker component 210 ). For example, the context event generator 220 may package a stream of micro feature vectors and kinematic observations of an object and output this to the machine learning engine 140 , e.g., a rate of 5 Hz. In one embodiment, the context events are packaged as a trajectory. As used herein, a trajectory generally refers to a vector packaging the kinematic data of a particular foreground object in successive frames or samples. Each element in the trajectory represents the kinematic data captured for that object at a particular point in time.
- a complete trajectory includes the kinematic data obtained when an object is first observed in a frame of video along with each successive observation of that object up to when it leaves the scene (or becomes stationary to the point of dissolving into the frame background). Accordingly, assuming computer vision engine 135 is operating at a rate of 5 Hz, a trajectory for an object is updated every 200 milliseconds, until complete.
- the context event generator 220 may also calculate and package the appearance data of every tracked object by evaluating the object for various appearance attributes such as shape, width, and other physical features and assigning each attribute a numerical score.
- the computer vision engine 135 may take the output from the components 205 , 210 , 215 , and 220 describing the motions and actions of the tracked objects in the scene and supply this information to the machine learning engine 140 through the event bus 230 .
- the machine learning engine 140 includes a classifier 235 , a semantic module 240 , a mapper 245 , cognitive module 250 , and a normalization module 265 .
- the classifier 235 receives context events such as kinematic data and appearance data from the computer vision engine 135 and maps the data on a neural network.
- the neural network is a combination of a self-organizing map (SOM) and an ART network, shown in FIG. 2 as a SOM-ART classifier 236 .
- SOM self-organizing map
- the data is clustered and combined by features occurring repeatedly in association with each other.
- the classifier 235 defines types of objects.
- the classifier 235 may define foreground patches that have, for example, a high shininess rigidity and reflectivity as a Type 1 object. These defined types then propagate throughout the rest of the system.
- the mapper 240 may use these types by searching for spatial and temporal correlations and behaviors across the system for patches to create maps of where and when events are likely or unlikely to happen.
- the mapper 240 includes a temporal memory ART network 241 , a spatial memory ART network 242 , and statistical engines 243 .
- the mapper 240 may look for patches of Type 1 objects.
- the spatial memory ART network 242 uses the statistical engines 243 to create statistical data of these objects, such as where in the scene do these patches appear, in what direction do these patches tend to go, how fast do these patches go, whether these patches change direction, and the like.
- the mapper 240 then builds a neural network of this information, which becomes a memory template against which to compare object behaviors.
- the temporal memory ART network 241 uses the statistical engines 243 to create statistical data based on samplings of time slices. In one embodiment, initial sampling occurs at every thirty minute interval. If many events occur within a time slice, then the time resolution may be dynamically changed to a finer resolution. Conversely, if fewer events occur within a time slice, then the time resolution may be dynamically changed to a coarser resolution.
- the semantic module 245 includes a phase space partitioning component 246 .
- the semantic module 245 identifies patterns of motion or trajectories within a scene and analyzes the scene for anomalous behavior through generalization. By tessellating a scene and dividing the foreground patches into many different tessera, the semantic module 245 can traces an object's trajectory and learns patterns from the trajectory. The semantic module 245 analyzes these patterns and compares them with other patterns.
- the phase space partitioning component 246 builds an adaptive grid and maps the objects and their trajectories onto the grid. As more features and trajectories are populated onto the grid, the machine learning engine learns trajectories that are common to the scene and further distinguishes normal behavior from anomalous behavior.
- the cognitive module 250 includes a perceptual memory 251 , an episode memory 252 , a long term memory 253 , a workspace 254 , and codelets 255 .
- the workspace 254 provides a computational engine for the machine learning engine 140 .
- the workspace 240 may be configured to copy information from the perceptual memory 251 , retrieve relevant memories from the episodic memory 252 and the long-term memory 253 , select which codelets 255 to execute.
- each codelet 255 is a software program configured to evaluate different sequences of events and to determine how one sequence may follow (or otherwise relate to) another (e.g., a finite state machine).
- the codelet may provide a software module configured to detect interesting patterns from the streams of data fed to the machine learning engine.
- the codelet 255 may create, retrieve, reinforce, or modify memories in the episodic memory 252 and the long-term memory 253 .
- the machine learning engine 140 performs a cognitive cycle used to observe, and learn, about patterns of behavior that occur within the scene.
- the perceptual memory 251 , the episodic memory 252 , and the long-term memory 253 are used to identify patterns of behavior, evaluate events that transpire in the scene, and encode and store observations.
- the perceptual memory 251 receives the output of the computer vision engine 135 (e.g., a stream of context events).
- the episodic memory 252 stores data representing observed events with details related to a particular episode, e.g., information describing time and space details related on an event.
- the episodic memory 252 may encode specific details of a particular event, i.e., “what and where” something occurred within a scene, such as a particular vehicle (car A) moved to a location believed to be a parking space (parking space 5) at 9:43 AM.
- the long-term memory 253 may store data generalizing events observed in the scene.
- the long-term memory 253 may encode information capturing observations and generalizations learned by an analysis of the behavior of objects in the scene such as “vehicles tend to park in a particular place in the scene,” “when parking vehicles tend to move a certain speed,” and “after a vehicle parks, people tend to appear in the scene proximate to the vehicle,” etc.
- the long-term memory 253 stores observations about what happens within a scene with much of the particular episodic details stripped away.
- memories from the episodic memory 252 and the long-term memory 253 may be used to relate and understand a current event, i.e., the new event may be compared with past experience, leading to both reinforcement, decay, and adjustments to the information stored in the long-term memory 253 , over time.
- the long-term memory 253 may be implemented as an ART network and a sparse-distributed memory data structure. Importantly, however, this approach does not require the different object type classifications to be defined in advance.
- modules 235 , 240 , 245 , and 250 include an anomaly detection component, as depicted by components 237 , 244 , 247 , and 256 .
- Each anomaly detection component is configured to identify anomalous behavior, relative to past observations of the scene. Further, each component is configured to receive alert directive and focus mask information from alert database 270 . Generally, if any anomaly detection component identifies anomalous behavior, the component generates an alert and passes the alert through the normalization module 265 . For instance, anomaly detector 247 in the semantic module 245 detects unusual trajectories using learned patterns and models.
- anomaly detection component 247 evaluates the object trajectory using loitering models, subsequently generates an alert, and sends the alert to the normalization module 265 .
- the normalization module 265 evaluates whether the alert should be published based on the alert's rarity relative to previous alerts of that alert type. Once the normalization module 265 determines that the alert should be published, it passes the alert to the alert generator 225 (through event bus 230 ).
- an anomaly detection component identifies an event that matches an alert directive, then rather than evaluating the event for anomalous behavior, the anomaly detector component instead follows the match criteria of the alert directive. If the alert directive requires that an alert be published, the anomaly detection component sends an alert to the alert generator 225 (through event bus 230 ). Otherwise, the anomaly detection component discards the event. Note that in either case, the anomaly detection component does not send any information to the normalization module 265 if the event data matches an alert directive.
- the alert generator 225 resides in the computer vision engine 135 .
- the alert generator 225 receives alert information from the anomaly detection components 237 , 244 , 247 , and 256 and the normalization module 265 .
- the alert generator 225 publishes alert information to the GUI/client device 260 .
- the GUI/client device stores this alert information in the alert database 270 .
- the alert database 270 contains previously issued alerts and may be accessible to a user of the GUI/client device 270 .
- FIG. 3 illustrates an example of an alert database 300 in a client device, according to one embodiment.
- the alert database 300 stores previously issued alerts that a user may parse through to create an alert directive.
- the alert database 300 includes a plurality of alerts and an alert directive list 305 .
- Each alert 310 includes an identifier 311 , a directive identifier 312 , and a summary 313 .
- the identifier 311 is a unique numerical value assigned to the alert 310 .
- the directive identifier 312 is a numerical field that indicates whether the alert 310 has been assigned an alert directive.
- the summary 313 is a data-payload that contains a concise description of the data characterizing the alert.
- the summary 313 may include information about the type of anomaly, what time the anomaly occurred, height and width values and an x- and y-coordinate of an object (if the anomaly occurred at a point in time), a set of x- and y-coordinates corresponding to a trajectory (if the anomaly occurred over a series of frames), and the like.
- Alert directives evaluate object behaviors or object types (or both) that match the information provided in the summary 313 .
- the alert directive list 305 includes a plurality of alert directives.
- Each alert directive 320 has an identifier 321 , an alert pointer 322 , match criteria 323 , and an epilog 324 .
- the identifier 321 of the alert directive is a unique numerical value assigned to an alert directive.
- Alert pointer 322 is a pointer to the original alert to which the alert directive corresponds. By pointing to the original alert, the alert directive 320 can access the data provided by summary 313 .
- the information contained in summary 313 may be stored as a data packet in a corresponding alert directive 320 .
- Match criteria 323 contains user-specified information of how the alert directive should process a certain event, such as whether the machine learning engine should publish an alert or discard the behavior, and whether to match an alert directive to a behavior or to an object type (or both). For example, if a user chooses to disregard matching behavior for an “unusual location” alert, the machine learning engine may create alerts for an object at rest at the location specified by the alert directive, and it may create alerts for an object moving rapidly through the same location.
- the machine learning engine may create alerts for a object corresponding to a learning based classification type 1 (e.g., a car) positioned at the location, and the machine learning engine may also create alerts for an object corresponding to a learning based classification type 2 (e.g., a person) positioned at the location.
- a learning based classification type 1 e.g., a car
- a learning based classification type 2 e.g., a person
- the epilog 324 is an array of tolerance values of each corresponding alert characteristic in the data provided by summary 313 . Tolerances provide the machine learning engine with flexibility in matching object behaviors and types to an alert directive, as the likelihood of matching two objects having the same characteristics (height, width and the center (x,y) position) in a scene is very low.
- a user defines these tolerances by using a graphical editor on a selected alert. By drawing a bounding box around the object that triggered the alert, the user can adjust the tolerances for the alert directive, creating a range for several characteristics of the selected alert (e.g., for the heights and widths of the object).
- FIG. 4 is a method 400 for publishing alerts in a behavioral recognition system configured with alert directives, according to one embodiment.
- the method 400 begins at step 405 , where the machine learning engine loads an alert directives list (and a focus mask, if applicable).
- the machine learning engine loads the alert directives list at system startup.
- the user interface sends information of the alert directive to the machine learning engine.
- the machine learning engine processes a behavioral event. For instance, the machine learning engine may process information generated by the computer vision engine corresponding to a person standing at a point in the scene. By this point, the machine engine has completed its learning procedures.
- the machine learning engine searches the alert directives list to determine whether the behavior corresponds to an alert directive based on matching criteria. If there is a matching alert directive (step 425 ), then the machine learning engine bypasses the normal publication process and publishes an alert to the user interface.
- the alert directives list may include a directive to always issue an “unusual location” alert for any person (i.e., an object model corresponding to a person) standing in certain position of a scene, given tolerances for height, width, and the person's central (x,y) position. If the observed person's height and width and location coordinates match with the alert directive, the behavioral recognition system immediately publishes an “unusual location” alert. However, if there is no matching alert directive (step 430 ), the machine learning engine proceeds with the normal publication process and evaluates the event for anomalous behavior.
- FIG. 5 is an example graphical representation of a set of tolerances applied to a trajectory alert, according to one embodiment. Because a trajectory takes place over a series of video frames, the machine learning engine matches trajectory-based events to an alert directive differently from behavioral events that happen at a point in the scene, and thus tolerances are also created differently.
- the original trajectory 515 represents a trajectory that resulted in an alert. As shown, the original trajectory 515 includes a starting point 505 and an ending point 510 , with a distance 525 between the two points.
- an alert directive for the original trajectory 515 includes a set of coordinates corresponding to the path.
- a user may, through a graphical interface, assign tolerances to the trajectory so that future occurrences of the trajectory are not required to strictly adhere to the coordinates of original trajectory 515 .
- the user specifies a tolerance region (represented by the region enclosed by dotted lines) for a trajectory to occur to trigger the alert directive.
- a tolerance region represented by the region enclosed by dotted lines
- an object traveling on an alternate trajectory 520 triggers the alert directive because the trajectory is within the set of tolerances (shown by being within the region enclosed by the dotted lines).
- FIG. 6 is an example graphical representation of an alert directive and a focused alert directive applied to a particular alert within a scene, according to one embodiment.
- a behavioral recognition system is focused on a train platform.
- Images 605 , 610 , 615 , and 620 all represent an image of the same alert provided to a user.
- Image 605 represents the original alert, with a bounding box 606 around a person (i.e. pixels classified by the machine learning engine as a person) who triggered the alert.
- the alert is an “unusual location” alert.
- this alert data may include height and width pixel values of the object as well as the object's center (x,y) position.
- Image 610 represents a user creating an alert directive by drawing a wider bounding box 607 around the original alert.
- the user sets larger tolerances for the machine learning engine to match when processing similar occurrences within that area.
- a person appearing in the shaded part of the scene depicted in the wider bounding box 607 triggers an alert directive for an “unusual location” alert.
- Images 615 and 620 represent a user creating a focused alert directive for the “unusual location” alert depicted in image 605 .
- a user To create a focused alert directive from an existing alert, a user first creates a bounding box 616 over a portion where the user would like to apply a focus mask. Within that bounding box, a user can select a region (or regions), and a focus mask results from the intersection of the bounding box and the selected region(s). Thereafter, if a person wanders onto the railroad tracks in the scene, the machine learning engine processes this behavior using the focused alert directive and publishes an alert.
- embodiments of the present invention provide techniques of configuring a behavioral recognition system to generate an alert. More specifically, by creating alert directives (or focused alert directives) for a machine learning engine to follow, certain events always or never result in an alert.
- this approach does not impede the unsupervised learning process of the behavioral recognition system because when a behavioral event triggers an alert directive, the machine learning engine has already completed its learning process.
Landscapes
- Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Medical Informatics (AREA)
- Human Computer Interaction (AREA)
- Emergency Management (AREA)
- Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Computer Security & Cryptography (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Image Analysis (AREA)
- Alarm Systems (AREA)
- Burglar Alarm Systems (AREA)
- Closed-Circuit Television Systems (AREA)
- Image Processing (AREA)
Abstract
Description
- This application is a continuation of co-pending U.S. patent application Ser. No. 13/839,587, filed on Mar. 15, 2013, which in turn claims priority to and benefit of U.S. Provisional Application Ser. No. 61/611,284, filed on Mar. 15, 2012; the entire contents of each aforementioned application are herein expressly incorporated by reference for all purposes.
- Embodiments of the present invention generally relate to configuring a behavioral recognition-based video surveillance system to generate alerts for certain events. More specifically, the embodiments provide techniques allowing a behavioral recognition system to identify events that should always or never result in an alert without impeding the unsupervised learning process of the surveillance system.
- Some currently available video surveillance systems provide simple object recognition capabilities. For example, a video surveillance system may be configured to classify a group of pixels (referred to as a “blob”) in a given frame as being a particular object (e.g., a person or vehicle). Once identified, a “blob” may be tracked from frame-to-frame in order to follow the “blob” moving through the scene over time, e.g., a person walking across the field of vision of a video surveillance camera. Further, such systems may be configured to determine when an object has engaged in certain predefined behaviors. For example, the system may include definitions used to recognize the occurrence of a number of predefined events, e.g., the system may evaluate the appearance of an object classified as depicting a car (a vehicle-appear event) coming to a stop over a number of frames (a vehicle-stop event). Thereafter, a new foreground object may appear and be classified as a person (a person-appear event) and the person then walks out of frame (a person-disappear event). Further, the system may be able to recognize the combination of the first two events as a “parking-event.” Such surveillance systems typically require that the objects and/or behaviors which may be recognized by the system be defined in advance. Thus, in practice, these systems rely on predefined definitions for objects and/or behaviors to evaluate a video sequence. More generally, such systems rely on predefined rules and static patterns and are thus often unable to dynamically identify objects, events, behaviors, or patterns, much less even classify them as either normal or anomalous.
- On the other hand, a behavioral recognition system is a type of video surveillance system that may be configured to learn, identify, and recognize patterns of behavior by observing a sequence of individual frames, otherwise known as a video stream. Unlike rules-based video surveillance systems, a behavioral recognition system instead learns objects and behavioral patterns by generalizing video input and building memories of what is observed. Over time, a behavioral recognition system uses these memories to distinguish between normal and anomalous behavior captured in the field of view of a video stream. Upon detecting anomalous behavior, the behavioral recognition system publishes an alert to a user notifying the user of the behavior. After several recurrences of a particular event, the behavioral recognition system learns that the event is non-anomalous and ceases publishing subsequent alerts. For example, a behavioral recognition system focused on a building corridor may initially publish alerts each time an individual appears in the corridor at a certain time of day within the field of view of the camera. If this event occurs a sufficient amount of times, the behavioral recognition system may learn that this is non-anomalous behavior and stop alerting a user to this event.
- However, although in a plurality of cases this is how a user expects such a system to work, in some instances, the user may want the behavioral recognition system to always publish an alert for a particular behavioral event. Returning to the previous example, if the corridor were of limited access, security personnel may want to be notified each time someone appears in the corridor to ascertain that only people in the corridor are the ones authorized to be there. Conversely, the user may not ever want the behavioral recognition system to publish an alert for a particular behavior. This situation may arise where the event occurs often but infrequently enough to result in an alert. For example, a behavioral recognition system focused on a room in a building that is next to a construction site may create alerts whenever construction vehicles pass through the field of view of the camera outside a window in the room. In this instance, security personnel may not want the behavioral recognition system to ever alert on these occurrences.
- Behavioral recognition systems by their very nature avoid the use of predefined rules wherever possible in favor of unsupervised learning. Thus, approaching a solution for these issues requires a natural method for providing feedback to a behavioral recognition system regarding what behaviors should the system either always or never result in an alert.
- One embodiment of the invention provides a method for alerting a user to behavior corresponding to an alert directive. This method may generally include obtaining characteristic values from an observed event in a scene. This method may also include parsing a list of alert directives for a matching alert directive having ranges of criteria values. If the characteristic values are within the ranges of the criteria values, then the observed event corresponds to a matching alert directive. This method may also include upon identifying the matching alert directive, alerting the user to the observed event.
- Additionally, the characteristic values of the observed event may be a pixel-height value, a pixel-width value, and an x- and y-coordinate center position of a foreground object. The characteristic values may also be a set of x- and y-coordinates corresponding to a trajectory of a foreground object. Further, the matching alert directive may have a focus mask that intersects with a region in the scene where the observed event occurred.
- Other embodiments include, without limitation, a computer-readable medium that includes instructions that enable a processing unit to implement one or more aspects of the disclosed methods as well as a system having a processor, memory, and application programs configured to implement one or more aspects of the disclosed methods.
- So that the manner in which the above recited features, advantages, and objects of the present invention are attained and can be understood in detail, a more particular description, briefly summarized above, may be had by reference to the embodiments illustrated in the appended drawings.
- It is to be noted, however, that the appended drawings illustrate only typical embodiments of this invention and are therefore not to be considered limiting of its scope, for the disclosure may admit to other equally effective embodiments.
-
FIG. 1 illustrates components of a video analysis system, according to one embodiment. -
FIG. 2 further illustrates components of the video analysis system shown inFIG. 1 , according to one embodiment. -
FIG. 3 illustrates an example of an alert database in a client device, according to one embodiment. -
FIG. 4 illustrates a method for publishing alerts in a behavioral recognition system configured with alert directives and focused alert directives, according to one embodiment. -
FIG. 5 illustrates an example graphical representation of a set of tolerances applied to a trajectory alert, according to one embodiment. -
FIG. 6 illustrates an example graphical representation of an alert directive and a focused alert directive applied to a particular alert, according to one embodiment. - Embodiments of the invention disclosed herein provide techniques for creating alert directives and focused alert directives in a behavioral recognition-based video surveillance system. That is, the disclosed techniques allow a user of a behavioral recognition system to identify previously alerted behavior that should always or never result in a subsequent alert. Because alert directives override only the behavioral recognition system's normal alert publication procedures (which take place after the system has already performed its learning procedures), this approach does not disrupt the behavioral recognition system's unsupervised learning.
- In one embodiment, a behavioral recognition system includes a computer vision engine and a machine learning engine. The computer vision engine may be configured to process a field of view captured within a video stream. This field of view is generally referred to as the “scene.” In processing, the computer vision engine separates foreground objects (e.g., objects resembling people, vehicles, etc.) from background objects (e.g., objects resembling pavement, the sky, etc.). After processing the scene, the computer vision engine may generate information streams of observed activity (e.g., appearance features, kinematic features, etc.) and pass the streams to the machine learning engine. In turn, the machine learning engine may be configured to learn object behaviors in the scene using that information. In addition to learning-based behavior, a machine learning engine may be configured to build models of certain behaviors within a scene and determine whether observations indicate that the behavior of an object is anomalous, relative to the model. Upon detecting anomalous behavior, the machine learning engine generates an alert. After determining that the alert should be published, the behavioral recognition system publishes the alert to a user interface. The user interface may contain a database of previously issued alerts that are generally accessible to a user of the system. The user can view these alerts as a list, where each list item displays information of the alert and may include corresponding video or image data.
- After publishing a sufficient number of alerts for a particular behavioral event, the machine learning engine learns the event is a non-anomalous occurrence and ceases to publish subsequent alerts for the event. In one embodiment, a user may create an alert directive to override the normal alert publication process. An alert directive allows a user to provide feedback to the machine learning engine to either always or never create an alert for a certain behavioral event. The machine learning engine consults a list of alert directive definitions after learning information streams relating to an event and before evaluating the event for anomalous behavior. Thus, alert directives do not hinder the machine learning engine's learning procedures.
- To create an alert directive, a user selects an event occurrence or an alert previously generated by the system to use as a template. For example, a user may parse through a database of previous alerts for a scene, characterized based on time, type, name, event, or otherwise, as well as view underlying video of the activity that caused the alert. In one embodiment, a user may do this via a dialog box in an alert browser on the user interface. After selecting an alert, the user defines alert directive matching criteria. In one embodiment, the criteria may include whether the behavior should always or never result in an alert, how frequently the alert should be published (e.g., in situations where the behavior results in numerous alerts within a short time span), and whether the machine learning engine should match behaviors or object types (or both). Once the user has defined the matching criteria, the user interface creates an alert directive in the alert database with references pointing back to the original alert used to create it. Thereafter, the user interface sends information about the alert directive to the machine learning engine.
- Note that matching an alert directive to alert behavior by the machine learning engine may depend on both the alert type and series of parameters specified by a user. For example, once a user selects an alert to use for an alert directive, the corresponding video of that alert may show a person in front of the security door, along with a bounding box indicating the pixels classified by the system as depicting that person. In such a case, a graphical editor may allow a user to adjust the bounding box around the person in the selected alert to adjust the tolerances for the alert directive—creating a range, e.g., for the center (x,y) position of a foreground object, etc. Similarly, a user may specify another bounding box for the relative position of the person in front of the door, i.e., a tolerance for object position. By adjusting a tolerance size and pose of a person and for the position of such a person, an alert directive may be used to specify a region in front of the security door, so that whenever any person is observed to be present, the machine learning engine creates an alert. Thus, this approach allows for variation in height, position, width, speed, etc., of observed objects to still satisfy the alert directive definitions.
- In a further embodiment, an alert directive may be expanded to provide a focused alert directive. This approach extends the cases where the machine learning engine can apply an alert directive to behavioral events in the scene. For example, a camera may focus on a building corridor with multiple security doors. In such a case, a user must create and tune a separate alert directive for each door to be alerted whenever someone appears in front of a door. As an alternative, a focused alert directive allows the user to create both an alert directive (e.g., for a person appearing in front of the security door) and a focus mask specifying different regions in the scene which should result in an alert.
- That is, rather than specifying a tolerance of a position of a person in front of a security door, the user can extend the tolerance in position to the full field of view. The user defines one or more regions of the scene where an alert should be generated when a foreground object otherwise within the tolerances of the alert directive is observed. For example, the alert of the person appearing in front of the first door (within the alert directive tolerances for height, width, and pose) defines an alert directive, but the position is extended to be the entire field of view of the camera, intersected with the user-defined regions. So, to create a focused alert directive in the given example, where a camera is focused on an area with multiple security doors, the user would select a “person appears” alert for a person in front of any one of the doors, specify tolerances around the appearance of that person using a graphical editor, and then create a mask for a position of “person appears” alert to include the regions generally in front of each door.
- Once a user creates an alert directive (or focused alert directive), the user interface sends information of the alert directive (and focus mask, if applicable) to the machine learning engine. When the machine learning engine processes information of subsequent events that matches an alert directive's match criteria and tolerances, the machine learning engine bypasses the normal publication methods of the behavioral recognition system and immediately publishes an alert or discards the event (given the matching criteria), irrespective of whether the machine learning engine regards the observed behavior as anomalous. This approach does not change the learned state regarding a particular scene or influence the undirected learning of the machine learning engine. In all cases, the machine learning engine has already performed its learning procedures before applying the alert directive.
- In the following, reference is made to embodiments of the disclosure. However, it should be understood that the disclosure is not limited to any specifically described embodiment. Instead, any combination of the following features and elements, whether related to different embodiments or not, is contemplated to implement and practice what is disclosed. Furthermore, in various embodiments the present invention provides numerous advantages over the prior art. However, although embodiments may achieve advantages over other possible solutions and/or over the prior art, whether or not a particular advantage is achieved by a given embodiment is not limiting. Thus, the following aspects, features, embodiments and advantages are merely illustrative and are not considered elements or limitations of the appended claims except where explicitly recited in a claim(s). Likewise, any reference to “the invention” or “the disclosure” shall not be construed as a generalization of any inventive subject matter disclosed herein and shall not be considered to be an element or limitation of the appended claims except where explicitly recited in a claim(s).
- One embodiment of the present invention is implemented as a program product for use with a computer system. The program(s) of the program product defines functions of the embodiments (including the methods described herein) and can be contained on a variety of computer-readable storage media. Examples of computer-readable storage media include (i) non-writable storage media (e.g., read-only memory devices within a computer such as CD-ROM or DVD-ROM disks readable by an optical media drive) on which information is permanently stored; (ii) writable storage media (e.g., a hard-disk drive) on which alterable information is stored. Such computer-readable storage media, when carrying computer-readable instructions that direct the functions of the invention, are embodiments of the invention. Other examples media include communications media through which information is conveyed to a computer, such as through a computer or telephone network, including wireless communications networks.
- In general, the routines executed to implement the embodiments of the invention may be part of an operating system or a specific application, component, program, module, object, or sequence of instructions. The computer program of the invention is comprised typically of a multitude of instructions that will be translated by the native computer into a machine-readable format and hence executable instructions. Also, programs are comprised of variables and data structures that either reside locally to the program or are found in memory or on storage devices. In addition, various programs described herein may be identified based upon the application for which they are implemented in a specific embodiment of the disclosure. However, it should be appreciated that any particular program nomenclature that follows is used merely for convenience, and thus the present disclosure should not be limited to use solely in any specific application identified and/or implied by such nomenclature.
-
FIG. 1 illustrates components of a video analysis andbehavioral recognition system 100, according to one embodiment. As shown, thebehavioral recognition system 100 includes avideo input source 105, anetwork 110, acomputer system 115, and input and output devices 118 (e.g., a monitor, a keyboard, a mouse, a printer, and the like). Thenetwork 110 may transmit video data recorded by thevideo input 105 to thecomputer system 115. Illustratively, thecomputer system 115 includes aCPU 120, storage 125 (e.g., a disk drive, optical disk drive, floppy disk drive, and the like), and amemory 130 containing both acomputer vision engine 135 and amachine learning engine 140. As described in greater detail below, thecomputer vision engine 135 and themachine learning engine 140 may provide software applications configured to analyze a sequence of video frames provided by thevideo input 105. -
Network 110 receives video data (e.g., video stream(s), video images, or the like) from thevideo input source 105. Thevideo input source 105 may be a video camera, a VCR, DVR, DVD, computer, web-cam device, or the like. For example, thevideo input source 105 may be a stationary video camera aimed at a certain area (e.g., a subway station, a parking lot, a building entry/exit, etc.), which records the events taking place therein. Generally, the area within the camera's field of view is referred to as the scene. Thevideo input source 105 may be configured to record the scene as a sequence of individual video frames at a specified frame-rate (e.g., 24 frames per second), where each frame includes a fixed number of pixels (e.g., 320×240). Each pixel of each frame may specify a color value (e.g., an RGB value) or grayscale value (e.g., a radiance value between 0-255). Further, the video stream may be formatted using known such formats e.g., MPEG2, MJPEG, MPEG4, H.263, H.264, and the like. - As noted above, the
computer vision engine 135 may be configured to analyze this raw information to identify active objects in the video stream, identify a variety of appearance and kinematic features used by amachine learning engine 140 to derive object classifications, derive a variety of metadata regarding the actions and interactions of such objects, and supply this information to themachine learning engine 140. And in turn, themachine learning engine 140 may be configured to evaluate, observe, learn and remember details regarding events (and types of events) that transpire within the scene over time. - In one embodiment, the
machine learning engine 140 receives the video frames and the data generated by thecomputer vision engine 135. Themachine learning engine 140 may be configured to analyze the received data, cluster objects having similar visual and/or kinematic features, build semantic representations of events depicted in the video frames. Themachine learning engine 140 learns expected patterns of behavior for objects that map to a given cluster. Thus, over time, the machine learning engine learns from these observed patterns to identify normal and/or abnormal events. That is, rather than having patterns, objects, object types, or activities defined in advance, themachine learning engine 140 builds its own model of what different object types have been observed (e.g., based on clusters of kinematic and or appearance features) as well as a model of expected behavior for a given object type. Thereafter, the machine learning engine can decide whether the behavior of an observed event is anomalous or not based on prior learning. - Data describing whether a normal/abnormal behavior/event has been determined and/or what such behavior/event is may be provided to
output devices 118 to issue alerts, for example, an alert message with corresponding video and image data presented on a GUI interface screen. Such output devices may also be configured with a database of previously issued alerts from which a user can create an alert directive. - In general, the
computer vision engine 135 and themachine learning engine 140 both process video data in real-time. However, time scales for processing information by thecomputer vision engine 135 and themachine learning engine 140 may differ. For example, in one embodiment, thecomputer vision engine 135 processes the received video data frame-by-frame, while themachine learning engine 140 processes data every N-frames. In other words, while thecomputer vision engine 135 may analyze each frame in real-time to derive a set of kinematic and appearance data related to objects observed in the frame, themachine learning engine 140 is not constrained by the real-time frame rate of the video input. - Note, however,
FIG. 1 illustrates merely one possible arrangement of the behavior-recognition system 100. For example, although thevideo input source 105 is shown connected to thecomputer system 115 via thenetwork 110, thenetwork 110 is not always present or needed (e.g., thevideo input source 105 may be directly connected to the computer system 115). Further, various components and modules of the behavior-recognition system 100 may be implemented in other systems. For example, in one embodiment, thecomputer vision engine 135 may be implemented as a part of a video input device (e.g., as a firmware component wired directly into a video camera). In such a case, the output of the video camera may be provided to themachine learning engine 140 for analysis. Similarly, the output from thecomputer vision engine 135 andmachine learning engine 140 may be supplied overcomputer network 110 to other computer systems. For example, thecomputer vision engine 135 andmachine learning engine 140 may be installed on a server system and configured to process video from multiple input sources (i.e., from multiple cameras). In such a case, aclient application 250 running on another computer system may request (or receive) the results of overnetwork 110. -
FIG. 2 further illustrates components of thecomputer vision engine 135 and themachine learning engine 140 first illustrated inFIG. 1 , according to one embodiment of the invention. As shown, thecomputer vision engine 135 includes adata ingestor 205, adetector 215, atracker 215, acontext event generator 220, analert generator 225, and anevent bus 230. Collectively, thecomponents components components - In one embodiment, the data ingestor 205 receives video input from the
video input source 105. The data ingestor 205 may be configured to preprocess the input data before sending it to thedetector 210. Thedetector 210 may be configured to separate each frame of video provided into a stationary or static part (the scene background) and a collection of volatile parts (the scene foreground). The frame itself may include a two-dimensional array of pixel values for multiple channels (e.g., RGB channels for color video or grayscale channel or radiance channel for black and white video). In one embodiment, thedetector 210 may model background states for each pixel using an adaptive resonance theory (ART) network. That is, each pixel may be classified as depicting scene foreground or scene background using an ART network modeling a given pixel. Of course, other approaches to distinguish between scene foreground and background may be used. - Additionally, the
detector 210 may be configured to generate a mask used to identify which pixels of the scene are classified as depicting foreground and, conversely, which pixels are classified as depicting scene background. Thedetector 210 then identifies regions of the scene that contain a portion of scene foreground (referred to as a foreground “blob” or “patch”) and supplies this information to subsequent stages of the pipeline. Additionally, pixels classified as depicting scene background may be used to generate a background image modeling the scene. - In one embodiment, the
detector 210 may be configured to detect the flow of a scene. Once the foreground patches have been separated, thedetector 210 examines, from frame-to-frame, any edges and corners of all foreground patches. Thedetector 210 will identify foreground patches moving in a similar flow of motion as most likely belonging to a single object or a single association of motions and send this information to thetracker 215. - The
tracker 215 may receive the foreground patches produced by thedetector 210 and generate computational models for the patches. Thetracker 215 may be configured to use this information, and each successive frame of raw-video, to attempt to track the motion of an object depicted by a given foreground patch as it moves about the scene. That is, thetracker 215 provides continuity to other elements of the system by tracking a given object from frame-to-frame. It further calculates a variety of kinematic and/or appearance features of a foreground object, e.g., size, height, width, and area (in pixels), reflectivity, shininess rigidity, speed velocity, etc. - The
context event generator 220 may receive the output from other stages of the pipeline. Using this information, thecontext processor 220 may be configured to generate a stream of context events regarding objects tracked (by tracker component 210). For example, thecontext event generator 220 may package a stream of micro feature vectors and kinematic observations of an object and output this to themachine learning engine 140, e.g., a rate of 5 Hz. In one embodiment, the context events are packaged as a trajectory. As used herein, a trajectory generally refers to a vector packaging the kinematic data of a particular foreground object in successive frames or samples. Each element in the trajectory represents the kinematic data captured for that object at a particular point in time. Typically, a complete trajectory includes the kinematic data obtained when an object is first observed in a frame of video along with each successive observation of that object up to when it leaves the scene (or becomes stationary to the point of dissolving into the frame background). Accordingly, assumingcomputer vision engine 135 is operating at a rate of 5 Hz, a trajectory for an object is updated every 200 milliseconds, until complete. Thecontext event generator 220 may also calculate and package the appearance data of every tracked object by evaluating the object for various appearance attributes such as shape, width, and other physical features and assigning each attribute a numerical score. - The
computer vision engine 135 may take the output from thecomponents machine learning engine 140 through theevent bus 230. Illustratively, themachine learning engine 140 includes aclassifier 235, asemantic module 240, amapper 245,cognitive module 250, and anormalization module 265. - The
classifier 235 receives context events such as kinematic data and appearance data from thecomputer vision engine 135 and maps the data on a neural network. In one embodiment, the neural network is a combination of a self-organizing map (SOM) and an ART network, shown inFIG. 2 as a SOM-ART classifier 236. The data is clustered and combined by features occurring repeatedly in association with each other. Then, based on those recurring types, theclassifier 235 defines types of objects. For example, theclassifier 235 may define foreground patches that have, for example, a high shininess rigidity and reflectivity as aType 1 object. These defined types then propagate throughout the rest of the system. - The
mapper 240 may use these types by searching for spatial and temporal correlations and behaviors across the system for patches to create maps of where and when events are likely or unlikely to happen. In one embodiment, themapper 240 includes a temporalmemory ART network 241, a spatialmemory ART network 242, andstatistical engines 243. For example, themapper 240 may look for patches ofType 1 objects. The spatialmemory ART network 242 uses thestatistical engines 243 to create statistical data of these objects, such as where in the scene do these patches appear, in what direction do these patches tend to go, how fast do these patches go, whether these patches change direction, and the like. Themapper 240 then builds a neural network of this information, which becomes a memory template against which to compare object behaviors. The temporalmemory ART network 241 uses thestatistical engines 243 to create statistical data based on samplings of time slices. In one embodiment, initial sampling occurs at every thirty minute interval. If many events occur within a time slice, then the time resolution may be dynamically changed to a finer resolution. Conversely, if fewer events occur within a time slice, then the time resolution may be dynamically changed to a coarser resolution. - In one embodiment, the
semantic module 245 includes a phasespace partitioning component 246. Thesemantic module 245 identifies patterns of motion or trajectories within a scene and analyzes the scene for anomalous behavior through generalization. By tessellating a scene and dividing the foreground patches into many different tessera, thesemantic module 245 can traces an object's trajectory and learns patterns from the trajectory. Thesemantic module 245 analyzes these patterns and compares them with other patterns. As objects enter a scene, the phasespace partitioning component 246 builds an adaptive grid and maps the objects and their trajectories onto the grid. As more features and trajectories are populated onto the grid, the machine learning engine learns trajectories that are common to the scene and further distinguishes normal behavior from anomalous behavior. - In one embodiment, the
cognitive module 250 includes aperceptual memory 251, anepisode memory 252, along term memory 253, aworkspace 254, and codelets 255. Generally, theworkspace 254 provides a computational engine for themachine learning engine 140. For example, theworkspace 240 may be configured to copy information from theperceptual memory 251, retrieve relevant memories from theepisodic memory 252 and the long-term memory 253, select which codelets 255 to execute. In one embodiment, each codelet 255 is a software program configured to evaluate different sequences of events and to determine how one sequence may follow (or otherwise relate to) another (e.g., a finite state machine). More generally, the codelet may provide a software module configured to detect interesting patterns from the streams of data fed to the machine learning engine. In turn, the codelet 255 may create, retrieve, reinforce, or modify memories in theepisodic memory 252 and the long-term memory 253. By repeatedly scheduling codelets 255 for execution, copying memories and percepts to/from theworkspace 240, themachine learning engine 140 performs a cognitive cycle used to observe, and learn, about patterns of behavior that occur within the scene. - In one embodiment, the
perceptual memory 251, theepisodic memory 252, and the long-term memory 253 are used to identify patterns of behavior, evaluate events that transpire in the scene, and encode and store observations. Generally, theperceptual memory 251 receives the output of the computer vision engine 135 (e.g., a stream of context events). Theepisodic memory 252 stores data representing observed events with details related to a particular episode, e.g., information describing time and space details related on an event. That is, theepisodic memory 252 may encode specific details of a particular event, i.e., “what and where” something occurred within a scene, such as a particular vehicle (car A) moved to a location believed to be a parking space (parking space 5) at 9:43 AM. - In contrast, the long-
term memory 253 may store data generalizing events observed in the scene. To continue with the example of a vehicle parking, the long-term memory 253 may encode information capturing observations and generalizations learned by an analysis of the behavior of objects in the scene such as “vehicles tend to park in a particular place in the scene,” “when parking vehicles tend to move a certain speed,” and “after a vehicle parks, people tend to appear in the scene proximate to the vehicle,” etc. Thus, the long-term memory 253 stores observations about what happens within a scene with much of the particular episodic details stripped away. In this way, when a new event occurs, memories from theepisodic memory 252 and the long-term memory 253 may be used to relate and understand a current event, i.e., the new event may be compared with past experience, leading to both reinforcement, decay, and adjustments to the information stored in the long-term memory 253, over time. In a particular embodiment, the long-term memory 253 may be implemented as an ART network and a sparse-distributed memory data structure. Importantly, however, this approach does not require the different object type classifications to be defined in advance. - In one embodiment,
modules components alert database 270. Generally, if any anomaly detection component identifies anomalous behavior, the component generates an alert and passes the alert through thenormalization module 265. For instance,anomaly detector 247 in thesemantic module 245 detects unusual trajectories using learned patterns and models. If a foreground object exhibits loitering behavior, for example,anomaly detection component 247 evaluates the object trajectory using loitering models, subsequently generates an alert, and sends the alert to thenormalization module 265. Upon receiving an alert, thenormalization module 265 evaluates whether the alert should be published based on the alert's rarity relative to previous alerts of that alert type. Once thenormalization module 265 determines that the alert should be published, it passes the alert to the alert generator 225 (through event bus 230). - However, if an anomaly detection component identifies an event that matches an alert directive, then rather than evaluating the event for anomalous behavior, the anomaly detector component instead follows the match criteria of the alert directive. If the alert directive requires that an alert be published, the anomaly detection component sends an alert to the alert generator 225 (through event bus 230). Otherwise, the anomaly detection component discards the event. Note that in either case, the anomaly detection component does not send any information to the
normalization module 265 if the event data matches an alert directive. - In one embodiment, the
alert generator 225 resides in thecomputer vision engine 135. Thealert generator 225 receives alert information from theanomaly detection components normalization module 265. Thealert generator 225 publishes alert information to the GUI/client device 260. The GUI/client device stores this alert information in thealert database 270. Thealert database 270 contains previously issued alerts and may be accessible to a user of the GUI/client device 270. -
FIG. 3 illustrates an example of an alert database 300 in a client device, according to one embodiment. The alert database 300 stores previously issued alerts that a user may parse through to create an alert directive. As shown, the alert database 300 includes a plurality of alerts and analert directive list 305. Each alert 310 includes anidentifier 311, adirective identifier 312, and asummary 313. Theidentifier 311 is a unique numerical value assigned to thealert 310. Thedirective identifier 312 is a numerical field that indicates whether the alert 310 has been assigned an alert directive. - The
summary 313 is a data-payload that contains a concise description of the data characterizing the alert. Thesummary 313 may include information about the type of anomaly, what time the anomaly occurred, height and width values and an x- and y-coordinate of an object (if the anomaly occurred at a point in time), a set of x- and y-coordinates corresponding to a trajectory (if the anomaly occurred over a series of frames), and the like. Alert directives evaluate object behaviors or object types (or both) that match the information provided in thesummary 313. - The
alert directive list 305 includes a plurality of alert directives. Eachalert directive 320 has anidentifier 321, analert pointer 322,match criteria 323, and anepilog 324. Theidentifier 321 of the alert directive is a unique numerical value assigned to an alert directive.Alert pointer 322 is a pointer to the original alert to which the alert directive corresponds. By pointing to the original alert, thealert directive 320 can access the data provided bysummary 313. In one embodiment, the information contained insummary 313 may be stored as a data packet in acorresponding alert directive 320. -
Match criteria 323 contains user-specified information of how the alert directive should process a certain event, such as whether the machine learning engine should publish an alert or discard the behavior, and whether to match an alert directive to a behavior or to an object type (or both). For example, if a user chooses to disregard matching behavior for an “unusual location” alert, the machine learning engine may create alerts for an object at rest at the location specified by the alert directive, and it may create alerts for an object moving rapidly through the same location. As another example, if a user chooses to disregard types in matching for an “unusual location” alert, the machine learning engine may create alerts for a object corresponding to a learning based classification type 1 (e.g., a car) positioned at the location, and the machine learning engine may also create alerts for an object corresponding to a learning based classification type 2 (e.g., a person) positioned at the location. - The
epilog 324 is an array of tolerance values of each corresponding alert characteristic in the data provided bysummary 313. Tolerances provide the machine learning engine with flexibility in matching object behaviors and types to an alert directive, as the likelihood of matching two objects having the same characteristics (height, width and the center (x,y) position) in a scene is very low. In one embodiment, a user defines these tolerances by using a graphical editor on a selected alert. By drawing a bounding box around the object that triggered the alert, the user can adjust the tolerances for the alert directive, creating a range for several characteristics of the selected alert (e.g., for the heights and widths of the object). -
FIG. 4 is a method 400 for publishing alerts in a behavioral recognition system configured with alert directives, according to one embodiment. The method 400 begins atstep 405, where the machine learning engine loads an alert directives list (and a focus mask, if applicable). In one embodiment, the machine learning engine loads the alert directives list at system startup. Additionally, when a user creates an alert directive after startup has occurred, the user interface sends information of the alert directive to the machine learning engine. Atstep 410, the machine learning engine processes a behavioral event. For instance, the machine learning engine may process information generated by the computer vision engine corresponding to a person standing at a point in the scene. By this point, the machine engine has completed its learning procedures. Atstep 415, the machine learning engine searches the alert directives list to determine whether the behavior corresponds to an alert directive based on matching criteria. If there is a matching alert directive (step 425), then the machine learning engine bypasses the normal publication process and publishes an alert to the user interface. In the ongoing example, the alert directives list may include a directive to always issue an “unusual location” alert for any person (i.e., an object model corresponding to a person) standing in certain position of a scene, given tolerances for height, width, and the person's central (x,y) position. If the observed person's height and width and location coordinates match with the alert directive, the behavioral recognition system immediately publishes an “unusual location” alert. However, if there is no matching alert directive (step 430), the machine learning engine proceeds with the normal publication process and evaluates the event for anomalous behavior. -
FIG. 5 is an example graphical representation of a set of tolerances applied to a trajectory alert, according to one embodiment. Because a trajectory takes place over a series of video frames, the machine learning engine matches trajectory-based events to an alert directive differently from behavioral events that happen at a point in the scene, and thus tolerances are also created differently. Theoriginal trajectory 515 represents a trajectory that resulted in an alert. As shown, theoriginal trajectory 515 includes astarting point 505 and anending point 510, with adistance 525 between the two points. In addition to these components, an alert directive for theoriginal trajectory 515 includes a set of coordinates corresponding to the path. In one embodiment, a user may, through a graphical interface, assign tolerances to the trajectory so that future occurrences of the trajectory are not required to strictly adhere to the coordinates oforiginal trajectory 515. By creating bounding boxes around bothstarting point 505 and endingpoint 510, the user specifies a tolerance region (represented by the region enclosed by dotted lines) for a trajectory to occur to trigger the alert directive. Thus, an object traveling on analternate trajectory 520 triggers the alert directive because the trajectory is within the set of tolerances (shown by being within the region enclosed by the dotted lines). -
FIG. 6 is an example graphical representation of an alert directive and a focused alert directive applied to a particular alert within a scene, according to one embodiment. In this example, a behavioral recognition system is focused on a train platform.Images Image 605 represents the original alert, with abounding box 606 around a person (i.e. pixels classified by the machine learning engine as a person) who triggered the alert. For the purposes of this example, assume that the alert is an “unusual location” alert. In the behavioral recognition system, this alert data may include height and width pixel values of the object as well as the object's center (x,y) position.Image 610 represents a user creating an alert directive by drawing awider bounding box 607 around the original alert. By creating awider bounding box 607, the user sets larger tolerances for the machine learning engine to match when processing similar occurrences within that area. Thus, a person appearing in the shaded part of the scene depicted in thewider bounding box 607 triggers an alert directive for an “unusual location” alert. - The user may want to the same “unusual location” alert directive to apply to objects appearing within the area of the scene corresponding to railroad tracks. Accordingly, the user may create a focused alert directive to accomplish this.
Images image 605. To create a focused alert directive from an existing alert, a user first creates abounding box 616 over a portion where the user would like to apply a focus mask. Within that bounding box, a user can select a region (or regions), and a focus mask results from the intersection of the bounding box and the selected region(s). Thereafter, if a person wanders onto the railroad tracks in the scene, the machine learning engine processes this behavior using the focused alert directive and publishes an alert. - As described, embodiments of the present invention provide techniques of configuring a behavioral recognition system to generate an alert. More specifically, by creating alert directives (or focused alert directives) for a machine learning engine to follow, certain events always or never result in an alert. Advantageously, this approach does not impede the unsupervised learning process of the behavioral recognition system because when a behavioral event triggers an alert directive, the machine learning engine has already completed its learning process.
- While the foregoing is directed to embodiments of the present disclosure, other and further embodiments of the disclosure may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.
Claims (21)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/119,227 US20190188998A1 (en) | 2012-03-15 | 2018-08-31 | Alert directives and focused alert directives in a behavioral recognition system |
US17/378,530 US11727689B2 (en) | 2012-03-15 | 2021-07-16 | Alert directives and focused alert directives in a behavioral recognition system |
US18/213,516 US12094212B2 (en) | 2012-03-15 | 2023-06-23 | Alert directives and focused alert directives in a behavioral recognition system |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261611284P | 2012-03-15 | 2012-03-15 | |
US13/839,587 US10096235B2 (en) | 2012-03-15 | 2013-03-15 | Alert directives and focused alert directives in a behavioral recognition system |
US16/119,227 US20190188998A1 (en) | 2012-03-15 | 2018-08-31 | Alert directives and focused alert directives in a behavioral recognition system |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/839,587 Continuation US10096235B2 (en) | 2012-03-15 | 2013-03-15 | Alert directives and focused alert directives in a behavioral recognition system |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/378,530 Continuation US11727689B2 (en) | 2012-03-15 | 2021-07-16 | Alert directives and focused alert directives in a behavioral recognition system |
Publications (1)
Publication Number | Publication Date |
---|---|
US20190188998A1 true US20190188998A1 (en) | 2019-06-20 |
Family
ID=49157102
Family Applications (8)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/836,372 Active - Reinstated US9208675B2 (en) | 2012-03-15 | 2013-03-15 | Loitering detection in a video surveillance system |
US13/836,730 Active 2034-03-08 US9349275B2 (en) | 2012-03-15 | 2013-03-15 | Alert volume normalization in a video surveillance system |
US13/839,587 Active 2036-09-25 US10096235B2 (en) | 2012-03-15 | 2013-03-15 | Alert directives and focused alert directives in a behavioral recognition system |
US15/163,461 Abandoned US20160267777A1 (en) | 2012-03-15 | 2016-05-24 | Alert volume normalization in a video surveillance system |
US15/938,759 Active 2034-01-18 US11217088B2 (en) | 2012-03-15 | 2018-03-28 | Alert volume normalization in a video surveillance system |
US16/119,227 Abandoned US20190188998A1 (en) | 2012-03-15 | 2018-08-31 | Alert directives and focused alert directives in a behavioral recognition system |
US17/378,530 Active US11727689B2 (en) | 2012-03-15 | 2021-07-16 | Alert directives and focused alert directives in a behavioral recognition system |
US18/213,516 Active US12094212B2 (en) | 2012-03-15 | 2023-06-23 | Alert directives and focused alert directives in a behavioral recognition system |
Family Applications Before (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/836,372 Active - Reinstated US9208675B2 (en) | 2012-03-15 | 2013-03-15 | Loitering detection in a video surveillance system |
US13/836,730 Active 2034-03-08 US9349275B2 (en) | 2012-03-15 | 2013-03-15 | Alert volume normalization in a video surveillance system |
US13/839,587 Active 2036-09-25 US10096235B2 (en) | 2012-03-15 | 2013-03-15 | Alert directives and focused alert directives in a behavioral recognition system |
US15/163,461 Abandoned US20160267777A1 (en) | 2012-03-15 | 2016-05-24 | Alert volume normalization in a video surveillance system |
US15/938,759 Active 2034-01-18 US11217088B2 (en) | 2012-03-15 | 2018-03-28 | Alert volume normalization in a video surveillance system |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/378,530 Active US11727689B2 (en) | 2012-03-15 | 2021-07-16 | Alert directives and focused alert directives in a behavioral recognition system |
US18/213,516 Active US12094212B2 (en) | 2012-03-15 | 2023-06-23 | Alert directives and focused alert directives in a behavioral recognition system |
Country Status (5)
Country | Link |
---|---|
US (8) | US9208675B2 (en) |
EP (2) | EP2826020A4 (en) |
CN (2) | CN104303218A (en) |
IN (2) | IN2014DN08349A (en) |
WO (2) | WO2013138719A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11217088B2 (en) | 2012-03-15 | 2022-01-04 | Intellective Ai, Inc. | Alert volume normalization in a video surveillance system |
Families Citing this family (94)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9959471B2 (en) | 2008-05-06 | 2018-05-01 | Careview Communications, Inc. | Patient video monitoring systems and methods for thermal detection of liquids |
US9141863B2 (en) * | 2008-07-21 | 2015-09-22 | Facefirst, Llc | Managed biometric-based notification system and method |
US20140118543A1 (en) * | 2012-10-31 | 2014-05-01 | Motorola Solutions, Inc. | Method and apparatus for video analysis algorithm selection based on historical incident data |
US9286693B2 (en) * | 2013-02-25 | 2016-03-15 | Hanwha Techwin Co., Ltd. | Method and apparatus for detecting abnormal movement |
US9639521B2 (en) * | 2013-08-09 | 2017-05-02 | Omni Ai, Inc. | Cognitive neuro-linguistic behavior recognition system for multi-sensor data fusion |
WO2015040929A1 (en) * | 2013-09-19 | 2015-03-26 | 日本電気株式会社 | Image processing system, image processing method, and program |
US20150199698A1 (en) * | 2014-01-14 | 2015-07-16 | Panasonic Intellectual Property Corporation Of America | Display method, stay information display system, and display control device |
US10389969B2 (en) * | 2014-02-14 | 2019-08-20 | Nec Corporation | Video processing system |
US11297284B2 (en) * | 2014-04-08 | 2022-04-05 | Udisense Inc. | Monitoring camera and mount |
US10708550B2 (en) * | 2014-04-08 | 2020-07-07 | Udisense Inc. | Monitoring camera and mount |
US9530080B2 (en) * | 2014-04-08 | 2016-12-27 | Joan And Irwin Jacobs Technion-Cornell Institute | Systems and methods for configuring baby monitor cameras to provide uniform data sets for analysis and to provide an advantageous view point of babies |
US20160132722A1 (en) * | 2014-05-08 | 2016-05-12 | Santa Clara University | Self-Configuring and Self-Adjusting Distributed Surveillance System |
US10140827B2 (en) | 2014-07-07 | 2018-11-27 | Google Llc | Method and system for processing motion event notifications |
US9501915B1 (en) | 2014-07-07 | 2016-11-22 | Google Inc. | Systems and methods for analyzing a video stream |
US10127783B2 (en) | 2014-07-07 | 2018-11-13 | Google Llc | Method and device for processing motion events |
US9754178B2 (en) * | 2014-08-27 | 2017-09-05 | International Business Machines Corporation | Long-term static object detection |
US9947215B2 (en) * | 2014-09-26 | 2018-04-17 | Harman International Industries, Incorporated | Pedestrian information system |
US9009805B1 (en) | 2014-09-30 | 2015-04-14 | Google Inc. | Method and system for provisioning an electronic device |
USD782495S1 (en) | 2014-10-07 | 2017-03-28 | Google Inc. | Display screen or portion thereof with graphical user interface |
CN105574889B (en) * | 2014-10-09 | 2019-06-07 | 中国科学院大学 | A kind of individual's anomaly detection method and system |
CN105631462A (en) * | 2014-10-28 | 2016-06-01 | 北京交通大学 | Behavior identification method through combination of confidence and contribution degree on the basis of space-time context |
US20160196728A1 (en) * | 2015-01-06 | 2016-07-07 | Wipro Limited | Method and system for detecting a security breach in an organization |
US9361011B1 (en) | 2015-06-14 | 2016-06-07 | Google Inc. | Methods and systems for presenting multiple live video feeds in a user interface |
WO2016201654A1 (en) * | 2015-06-17 | 2016-12-22 | 北京新目科技有限公司 | Information intelligent collection method and apparatus |
KR102015588B1 (en) * | 2015-07-16 | 2019-08-28 | 한화테크윈 주식회사 | Advanced wander alarm system and method thereof |
CN105184812B (en) * | 2015-07-21 | 2018-08-24 | 复旦大学 | A kind of pedestrian based on target following hovers detection method |
WO2017142736A1 (en) * | 2016-02-19 | 2017-08-24 | Carrier Corporation | Cloud based active commissioning system for video analytics |
US9939635B2 (en) | 2016-02-29 | 2018-04-10 | Brillio LLC | Method for providing notification in virtual reality device |
US9965382B2 (en) * | 2016-04-04 | 2018-05-08 | Omni Ai, Inc. | Data composite for efficient memory transfer in a behavioral recognition system |
US10628296B1 (en) | 2016-04-04 | 2020-04-21 | Omni Ai, Inc. | Data composite for efficient memory transfer in a behavorial recognition system |
USD854074S1 (en) | 2016-05-10 | 2019-07-16 | Udisense Inc. | Wall-assisted floor-mount for a monitoring camera |
US10506237B1 (en) | 2016-05-27 | 2019-12-10 | Google Llc | Methods and devices for dynamic adaptation of encoding bitrate for video streaming |
US10957171B2 (en) | 2016-07-11 | 2021-03-23 | Google Llc | Methods and systems for providing event alerts |
US10192415B2 (en) * | 2016-07-11 | 2019-01-29 | Google Llc | Methods and systems for providing intelligent alerts for events |
US10380429B2 (en) | 2016-07-11 | 2019-08-13 | Google Llc | Methods and systems for person detection in a video feed |
CN117902441A (en) * | 2016-07-29 | 2024-04-19 | 奥的斯电梯公司 | Monitoring system for passenger conveyor, passenger conveyor and monitoring method thereof |
CN106503618B (en) * | 2016-09-22 | 2019-09-17 | 天津大学 | Personnel based on video monitoring platform go around behavioral value method |
US10839203B1 (en) | 2016-12-27 | 2020-11-17 | Amazon Technologies, Inc. | Recognizing and tracking poses using digital imagery captured from multiple fields of view |
CA3041148C (en) * | 2017-01-06 | 2023-08-15 | Sportlogiq Inc. | Systems and methods for behaviour understanding from trajectories |
CA3056884A1 (en) * | 2017-03-17 | 2018-09-20 | Neurala, Inc. | Online, incremental real-time learning for tagging and labeling data streams for deep neural networks and neural network applications |
US10699421B1 (en) | 2017-03-29 | 2020-06-30 | Amazon Technologies, Inc. | Tracking objects in three-dimensional space using calibrated visual cameras and depth cameras |
TW201904265A (en) * | 2017-03-31 | 2019-01-16 | 加拿大商艾維吉隆股份有限公司 | Abnormal motion detection method and system |
US10410086B2 (en) | 2017-05-30 | 2019-09-10 | Google Llc | Systems and methods of person recognition in video streams |
US11783010B2 (en) | 2017-05-30 | 2023-10-10 | Google Llc | Systems and methods of person recognition in video streams |
US10296790B2 (en) | 2017-07-27 | 2019-05-21 | The Boeing Company | Coded visual markers for a surveillance system |
USD855684S1 (en) | 2017-08-06 | 2019-08-06 | Udisense Inc. | Wall mount for a monitoring camera |
US11134227B2 (en) | 2017-09-20 | 2021-09-28 | Google Llc | Systems and methods of presenting appropriate actions for responding to a visitor to a smart home environment |
US10664688B2 (en) | 2017-09-20 | 2020-05-26 | Google Llc | Systems and methods of detecting and responding to a visitor to a smart home environment |
US11232294B1 (en) | 2017-09-27 | 2022-01-25 | Amazon Technologies, Inc. | Generating tracklets from digital imagery |
WO2019097784A1 (en) * | 2017-11-16 | 2019-05-23 | ソニー株式会社 | Information processing device, information processing method, and program |
WO2019104108A1 (en) | 2017-11-22 | 2019-05-31 | Udisense Inc. | Respiration monitor |
US11284041B1 (en) | 2017-12-13 | 2022-03-22 | Amazon Technologies, Inc. | Associating items with actors based on digital imagery |
US11030442B1 (en) * | 2017-12-13 | 2021-06-08 | Amazon Technologies, Inc. | Associating events with actors based on digital imagery |
US11417150B2 (en) * | 2017-12-28 | 2022-08-16 | Nec Corporation | Information processing apparatus, method, and non-transitory computer-readable medium |
US10706701B2 (en) | 2018-01-12 | 2020-07-07 | Qognify Ltd. | System and method for dynamically ordering video channels according to rank of abnormal detection |
US10834365B2 (en) | 2018-02-08 | 2020-11-10 | Nortek Security & Control Llc | Audio-visual monitoring using a virtual assistant |
US11615623B2 (en) | 2018-02-19 | 2023-03-28 | Nortek Security & Control Llc | Object detection in edge devices for barrier operation and parcel delivery |
US11295139B2 (en) | 2018-02-19 | 2022-04-05 | Intellivision Technologies Corp. | Human presence detection in edge devices |
US10978050B2 (en) | 2018-02-20 | 2021-04-13 | Intellivision Technologies Corp. | Audio type detection |
US11417109B1 (en) * | 2018-03-20 | 2022-08-16 | Amazon Technologies, Inc. | Network-based vehicle event detection system |
CN110392228B (en) * | 2018-04-16 | 2021-06-04 | 宏碁股份有限公司 | Monitoring method and electronic device using the same |
EP3557549B1 (en) | 2018-04-19 | 2024-02-21 | PKE Holding AG | Method for evaluating a motion event |
US10795933B1 (en) * | 2018-05-01 | 2020-10-06 | Flock Group Inc. | System and method for object based query of video content captured by a dynamic surveillance network |
WO2020003951A1 (en) | 2018-06-26 | 2020-01-02 | コニカミノルタ株式会社 | Program executed by computer, information processing device, and method executed by computer |
US11468681B1 (en) | 2018-06-28 | 2022-10-11 | Amazon Technologies, Inc. | Associating events with actors using digital imagery and machine learning |
US11468698B1 (en) | 2018-06-28 | 2022-10-11 | Amazon Technologies, Inc. | Associating events with actors using digital imagery and machine learning |
US11482045B1 (en) | 2018-06-28 | 2022-10-25 | Amazon Technologies, Inc. | Associating events with actors using digital imagery and machine learning |
EP3629226B1 (en) | 2018-09-26 | 2020-11-25 | Axis AB | Method for converting alerts |
US11488374B1 (en) * | 2018-09-28 | 2022-11-01 | Apple Inc. | Motion trajectory tracking for action detection |
CN111115400B (en) | 2018-10-30 | 2022-04-26 | 奥的斯电梯公司 | System and method for detecting elevator maintenance behavior in an elevator hoistway |
USD900429S1 (en) | 2019-01-28 | 2020-11-03 | Udisense Inc. | Swaddle band with decorative pattern |
USD900430S1 (en) | 2019-01-28 | 2020-11-03 | Udisense Inc. | Swaddle blanket |
USD900431S1 (en) | 2019-01-28 | 2020-11-03 | Udisense Inc. | Swaddle blanket with decorative pattern |
USD900428S1 (en) | 2019-01-28 | 2020-11-03 | Udisense Inc. | Swaddle band |
SG10202005940YA (en) * | 2019-06-27 | 2021-01-28 | Bigobject Inc | Bionic computing system and cloud system thereof |
US11514767B2 (en) * | 2019-09-18 | 2022-11-29 | Sensormatic Electronics, LLC | Systems and methods for averting crime with look-ahead analytics |
US11900706B1 (en) | 2019-10-22 | 2024-02-13 | Objectvideo Labs, Llc | Object of interest detector distance-based multi-thresholding |
US11893795B2 (en) | 2019-12-09 | 2024-02-06 | Google Llc | Interacting with visitors of a connected home environment |
US11443516B1 (en) | 2020-04-06 | 2022-09-13 | Amazon Technologies, Inc. | Locally and globally locating actors by digital cameras and machine learning |
US11398094B1 (en) | 2020-04-06 | 2022-07-26 | Amazon Technologies, Inc. | Locally and globally locating actors by digital cameras and machine learning |
JP7363838B2 (en) * | 2021-03-02 | 2023-10-18 | トヨタ自動車株式会社 | Abnormal behavior notification device, abnormal behavior notification system, abnormal behavior notification method, and program |
US20220351319A1 (en) * | 2021-04-30 | 2022-11-03 | Here Global B.V. | Method, apparatus, and computer program product for quantifying public transport coverage |
CN113194297B (en) * | 2021-04-30 | 2023-05-23 | 重庆市科学技术研究院 | Intelligent monitoring system and method |
DE102021205480A1 (en) | 2021-05-28 | 2022-12-01 | Siemens Mobility GmbH | Procedure for training a surveillance system |
US11769394B2 (en) | 2021-09-01 | 2023-09-26 | Motorola Solutions, Inc. | Security ecosystem |
US11587416B1 (en) | 2021-09-01 | 2023-02-21 | Motorola Solutions, Inc. | Dynamic video analytics rules based on human conversation |
US20230186670A1 (en) * | 2021-09-28 | 2023-06-15 | Amizen Labs, LLC | Using Artificial Intelligence to Analyze Sensor Data to Detect Potential Change(s) for Risk and Threat Assessment and Identification |
JPWO2023058143A1 (en) * | 2021-10-06 | 2023-04-13 | ||
US12131613B2 (en) * | 2021-12-03 | 2024-10-29 | Honeywell International Inc. | Surveillance system for data centers and other secure areas |
US11606247B1 (en) * | 2022-01-21 | 2023-03-14 | Dell Products L.P. | Method and system for managing computer vision alerts using mobile agents in a computer vision environment |
US20230274551A1 (en) * | 2022-02-25 | 2023-08-31 | Johnson Controls Tyco IP Holdings LLP | Image-surveilled security escort |
EP4273812B1 (en) * | 2022-05-05 | 2024-04-10 | Axis AB | Device and method for loitering detection |
US12131539B1 (en) | 2022-06-29 | 2024-10-29 | Amazon Technologies, Inc. | Detecting interactions from features determined from sequences of images captured using one or more cameras |
WO2024103041A1 (en) * | 2022-11-10 | 2024-05-16 | Battelle Energy Alliance, Llc | System for activity detection and related methods |
Family Cites Families (96)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4679077A (en) | 1984-11-10 | 1987-07-07 | Matsushita Electric Works, Ltd. | Visual Image sensor system |
US5113507A (en) | 1988-10-20 | 1992-05-12 | Universities Space Research Association | Method and apparatus for a sparse distributed memory system |
JP3123587B2 (en) | 1994-03-09 | 2001-01-15 | 日本電信電話株式会社 | Moving object region extraction method using background subtraction |
WO1996029678A1 (en) | 1995-03-22 | 1996-09-26 | Idt International Digital Technologies Deutschland Gmbh | Method and apparatus for depth modelling and providing depth information of moving objects |
US6088468A (en) | 1995-05-17 | 2000-07-11 | Hitachi Denshi Kabushiki Kaisha | Method and apparatus for sensing object located within visual field of imaging device |
US7076102B2 (en) | 2001-09-27 | 2006-07-11 | Koninklijke Philips Electronics N.V. | Video monitoring system employing hierarchical hidden markov model (HMM) event learning and classification |
US5969755A (en) | 1996-02-05 | 1999-10-19 | Texas Instruments Incorporated | Motion based event detection system and method |
US7650015B2 (en) | 1997-07-22 | 2010-01-19 | Image Processing Technologies. LLC | Image processing method |
US5751378A (en) | 1996-09-27 | 1998-05-12 | General Instrument Corporation | Scene change detector for digital video |
US6263088B1 (en) | 1997-06-19 | 2001-07-17 | Ncr Corporation | System and method for tracking movement of objects in a scene |
US7036094B1 (en) * | 1998-08-10 | 2006-04-25 | Cybernet Systems Corporation | Behavior recognition system |
US6711278B1 (en) | 1998-09-10 | 2004-03-23 | Microsoft Corporation | Tracking semantic objects in vector image sequences |
US6570608B1 (en) | 1998-09-30 | 2003-05-27 | Texas Instruments Incorporated | System and method for detecting interactions of people and vehicles |
WO2000034919A1 (en) | 1998-12-04 | 2000-06-15 | Interval Research Corporation | Background estimation and segmentation based on range and color |
US7133537B1 (en) * | 1999-05-28 | 2006-11-07 | It Brokerage Services Pty Limited | Method and apparatus for tracking a moving object |
US6795567B1 (en) | 1999-09-16 | 2004-09-21 | Hewlett-Packard Development Company, L.P. | Method for efficiently tracking object models in video sequences via dynamic ordering of features |
US7136525B1 (en) | 1999-09-20 | 2006-11-14 | Microsoft Corporation | System and method for background maintenance of an image sequence |
US6674877B1 (en) | 2000-02-03 | 2004-01-06 | Microsoft Corporation | System and method for visually tracking occluded objects in real time |
US6940998B2 (en) | 2000-02-04 | 2005-09-06 | Cernium, Inc. | System for automated screening of security cameras |
US20050162515A1 (en) * | 2000-10-24 | 2005-07-28 | Objectvideo, Inc. | Video surveillance system |
US7868912B2 (en) | 2000-10-24 | 2011-01-11 | Objectvideo, Inc. | Video surveillance system employing video primitives |
US6678413B1 (en) | 2000-11-24 | 2004-01-13 | Yiqing Liang | System and method for object identification and behavior characterization using video analysis |
US20030107650A1 (en) * | 2001-12-11 | 2003-06-12 | Koninklijke Philips Electronics N.V. | Surveillance system with suspicious behavior detection |
US20060165386A1 (en) | 2002-01-08 | 2006-07-27 | Cernium, Inc. | Object selective video recording |
US7436887B2 (en) | 2002-02-06 | 2008-10-14 | Playtex Products, Inc. | Method and apparatus for video frame sequence-based object tracking |
US6856249B2 (en) | 2002-03-07 | 2005-02-15 | Koninklijke Philips Electronics N.V. | System and method of keeping track of normal behavior of the inhabitants of a house |
US7006128B2 (en) | 2002-05-30 | 2006-02-28 | Siemens Corporate Research, Inc. | Object detection for sudden illumination changes using order consistency |
US7227893B1 (en) | 2002-08-22 | 2007-06-05 | Xlabs Holdings, Llc | Application-specific object-based segmentation and recognition system |
US7200266B2 (en) | 2002-08-27 | 2007-04-03 | Princeton University | Method and apparatus for automated video activity analysis |
US6999600B2 (en) | 2003-01-30 | 2006-02-14 | Objectvideo, Inc. | Video scene background maintenance using change detection and classification |
KR100696728B1 (en) * | 2003-06-09 | 2007-03-20 | 가부시키가이샤 히다치 고쿠사이 덴키 | Apparatus and method for sending monitoring information |
US7026979B2 (en) | 2003-07-03 | 2006-04-11 | Hrl Labortories, Llc | Method and apparatus for joint kinematic and feature tracking using probabilistic argumentation |
US7127083B2 (en) | 2003-11-17 | 2006-10-24 | Vidient Systems, Inc. | Video surveillance system with object detection and probability scoring based on object class |
US7813525B2 (en) * | 2004-06-01 | 2010-10-12 | Sarnoff Corporation | Method and apparatus for detecting suspicious activities |
US20060018516A1 (en) | 2004-07-22 | 2006-01-26 | Masoud Osama T | Monitoring activity using video information |
US7158680B2 (en) | 2004-07-30 | 2007-01-02 | Euclid Discoveries, Llc | Apparatus and method for processing video data |
US8589315B2 (en) * | 2004-08-14 | 2013-11-19 | Hrl Laboratories, Llc | Behavior recognition using cognitive swarms and fuzzy graphs |
US7502498B2 (en) * | 2004-09-10 | 2009-03-10 | Available For Licensing | Patient monitoring apparatus |
JP2006080437A (en) | 2004-09-13 | 2006-03-23 | Intel Corp | Method and tool for mask blank inspection |
US7746378B2 (en) | 2004-10-12 | 2010-06-29 | International Business Machines Corporation | Video analysis, archiving and alerting methods and apparatus for a distributed, modular and extensible video surveillance system |
US7318052B2 (en) * | 2004-10-15 | 2008-01-08 | Sap Ag | Knowledge transfer evaluation |
US7620266B2 (en) | 2005-01-20 | 2009-11-17 | International Business Machines Corporation | Robust and efficient foreground analysis for real-time video surveillance |
US20060190419A1 (en) | 2005-02-22 | 2006-08-24 | Bunn Frank E | Video surveillance data analysis algorithms, with local and network-shared communications for facial, physical condition, and intoxication recognition, fuzzy logic intelligent camera system |
DE602006017977D1 (en) | 2005-03-17 | 2010-12-16 | British Telecomm | TRACKING OBJECTS IN A VIDEO SEQUENCE |
AU2006230361A1 (en) * | 2005-03-30 | 2006-10-05 | Cernium Corporation | Intelligent video behavior recognition with multiple masks and configurable logic inference module |
US7801328B2 (en) * | 2005-03-31 | 2010-09-21 | Honeywell International Inc. | Methods for defining, detecting, analyzing, indexing and retrieving events using video image processing |
US7825954B2 (en) | 2005-05-31 | 2010-11-02 | Objectvideo, Inc. | Multi-state target tracking |
US20090041297A1 (en) | 2005-05-31 | 2009-02-12 | Objectvideo, Inc. | Human detection and tracking for security applications |
US7884849B2 (en) | 2005-09-26 | 2011-02-08 | Objectvideo, Inc. | Video surveillance system with omni-directional camera |
CN101410855B (en) | 2006-03-28 | 2011-11-30 | 爱丁堡大学评议会 | Method for automatically attributing one or more object behaviors |
US20070250898A1 (en) | 2006-03-28 | 2007-10-25 | Object Video, Inc. | Automatic extraction of secondary video streams |
CA2649389A1 (en) | 2006-04-17 | 2007-11-08 | Objectvideo, Inc. | Video segmentation using statistical pixel modeling |
US8467570B2 (en) | 2006-06-14 | 2013-06-18 | Honeywell International Inc. | Tracking system with fused motion and object detection |
JP4413915B2 (en) * | 2006-12-13 | 2010-02-10 | 株式会社東芝 | Abnormal sign detection apparatus and method |
US8269834B2 (en) * | 2007-01-12 | 2012-09-18 | International Business Machines Corporation | Warning a user about adverse behaviors of others within an environment based on a 3D captured image stream |
US7916944B2 (en) | 2007-01-31 | 2011-03-29 | Fuji Xerox Co., Ltd. | System and method for feature level foreground segmentation |
JP5278770B2 (en) | 2007-02-08 | 2013-09-04 | ビヘイヴィアラル レコグニション システムズ, インコーポレイテッド | Behavior recognition system |
US8358342B2 (en) | 2007-02-23 | 2013-01-22 | Johnson Controls Technology Company | Video processing systems and methods |
US8086036B2 (en) | 2007-03-26 | 2011-12-27 | International Business Machines Corporation | Approach for resolving occlusions, splits and merges in video images |
US7813528B2 (en) | 2007-04-05 | 2010-10-12 | Mitsubishi Electric Research Laboratories, Inc. | Method for detecting objects left-behind in a scene |
US8078233B1 (en) * | 2007-04-11 | 2011-12-13 | At&T Mobility Ii Llc | Weight based determination and sequencing of emergency alert system messages for delivery |
US8411935B2 (en) | 2007-07-11 | 2013-04-02 | Behavioral Recognition Systems, Inc. | Semantic representation module of a machine-learning engine in a video analysis system |
US8064639B2 (en) | 2007-07-19 | 2011-11-22 | Honeywell International Inc. | Multi-pose face tracking using multiple appearance models |
US8175333B2 (en) | 2007-09-27 | 2012-05-08 | Behavioral Recognition Systems, Inc. | Estimator identifier component for behavioral recognition system |
US8300924B2 (en) | 2007-09-27 | 2012-10-30 | Behavioral Recognition Systems, Inc. | Tracker component for behavioral recognition system |
US8013738B2 (en) * | 2007-10-04 | 2011-09-06 | Kd Secure, Llc | Hierarchical storage manager (HSM) for intelligent storage of large volumes of data |
WO2009049314A2 (en) | 2007-10-11 | 2009-04-16 | Trustees Of Boston University | Video processing system employing behavior subtraction between reference and observed video image sequences |
US8195598B2 (en) * | 2007-11-16 | 2012-06-05 | Agilence, Inc. | Method of and system for hierarchical human/crowd behavior detection |
EP2093698A1 (en) | 2008-02-19 | 2009-08-26 | British Telecommunications Public Limited Company | Crowd congestion analysis |
US7962435B2 (en) * | 2008-02-20 | 2011-06-14 | Panasonic Corporation | System architecture and process for seamless adaptation to context aware behavior models |
US8427552B2 (en) * | 2008-03-03 | 2013-04-23 | Videoiq, Inc. | Extending the operational lifetime of a hard-disk drive used in video data storage applications |
US8169481B2 (en) * | 2008-05-05 | 2012-05-01 | Panasonic Corporation | System architecture and process for assessing multi-perspective multi-context abnormal behavior |
US8452108B2 (en) | 2008-06-25 | 2013-05-28 | Gannon Technologies Group Llc | Systems and methods for image recognition using graph-based pattern matching |
US8121968B2 (en) | 2008-09-11 | 2012-02-21 | Behavioral Recognition Systems, Inc. | Long-term memory in a video analysis system |
US8126833B2 (en) * | 2008-09-11 | 2012-02-28 | Behavioral Recognition Systems, Inc. | Detecting anomalous events using a long-term memory in a video analysis system |
US8180712B2 (en) * | 2008-09-30 | 2012-05-15 | The Nielsen Company (Us), Llc | Methods and apparatus for determining whether a media presentation device is in an on state or an off state |
WO2010055205A1 (en) * | 2008-11-11 | 2010-05-20 | Reijo Kortesalmi | Method, system and computer program for monitoring a person |
US9373055B2 (en) | 2008-12-16 | 2016-06-21 | Behavioral Recognition Systems, Inc. | Hierarchical sudden illumination change detection using radiance consistency within a spatial neighborhood |
US20100208063A1 (en) | 2009-02-19 | 2010-08-19 | Panasonic Corporation | System and methods for improving accuracy and robustness of abnormal behavior detection |
WO2010111748A1 (en) * | 2009-04-01 | 2010-10-07 | Curtin University Of Technology | Systems and methods for detecting anomalies from data |
CN101557506B (en) | 2009-05-19 | 2011-02-09 | 浙江工业大学 | Intelligent detecting device for violent behavior in elevator car based on computer vision |
CN101901334B (en) * | 2009-05-31 | 2013-09-11 | 汉王科技股份有限公司 | Static object detection method |
US8649594B1 (en) * | 2009-06-04 | 2014-02-11 | Agilence, Inc. | Active and adaptive intelligent video surveillance system |
US8493409B2 (en) | 2009-08-18 | 2013-07-23 | Behavioral Recognition Systems, Inc. | Visualizing and updating sequences and segments in a video surveillance system |
US8340352B2 (en) | 2009-08-18 | 2012-12-25 | Behavioral Recognition Systems, Inc. | Inter-trajectory anomaly detection using adaptive voting experts in a video surveillance system |
US8280153B2 (en) | 2009-08-18 | 2012-10-02 | Behavioral Recognition Systems | Visualizing and updating learned trajectories in video surveillance systems |
US8379085B2 (en) | 2009-08-18 | 2013-02-19 | Behavioral Recognition Systems, Inc. | Intra-trajectory anomaly detection using adaptive voting experts in a video surveillance system |
US8167430B2 (en) | 2009-08-31 | 2012-05-01 | Behavioral Recognition Systems, Inc. | Unsupervised learning of temporal anomalies for a video surveillance system |
US8270733B2 (en) * | 2009-08-31 | 2012-09-18 | Behavioral Recognition Systems, Inc. | Identifying anomalous object types during classification |
US8285060B2 (en) | 2009-08-31 | 2012-10-09 | Behavioral Recognition Systems, Inc. | Detecting anomalous trajectories in a video surveillance system |
US8218818B2 (en) | 2009-09-01 | 2012-07-10 | Behavioral Recognition Systems, Inc. | Foreground object tracking |
US8180105B2 (en) | 2009-09-17 | 2012-05-15 | Behavioral Recognition Systems, Inc. | Classifier anomalies for observed behaviors in a video surveillance system |
US20110134245A1 (en) * | 2009-12-07 | 2011-06-09 | Irvine Sensors Corporation | Compact intelligent surveillance system comprising intent recognition |
US8719198B2 (en) * | 2010-05-04 | 2014-05-06 | Microsoft Corporation | Collaborative location and activity recommendations |
US8600915B2 (en) * | 2011-12-19 | 2013-12-03 | Go Daddy Operating Company, LLC | Systems for monitoring computer resources |
WO2013138719A1 (en) | 2012-03-15 | 2013-09-19 | Behavioral Recognition Systems, Inc. | Alert directives and focused alert directives in a behavioral recognition system |
-
2013
- 2013-03-15 WO PCT/US2013/032075 patent/WO2013138719A1/en active Application Filing
- 2013-03-15 EP EP13760552.3A patent/EP2826020A4/en not_active Withdrawn
- 2013-03-15 US US13/836,372 patent/US9208675B2/en active Active - Reinstated
- 2013-03-15 US US13/836,730 patent/US9349275B2/en active Active
- 2013-03-15 IN IN8349DEN2014 patent/IN2014DN08349A/en unknown
- 2013-03-15 US US13/839,587 patent/US10096235B2/en active Active
- 2013-03-15 IN IN8342DEN2014 patent/IN2014DN08342A/en unknown
- 2013-03-15 CN CN201380019203.9A patent/CN104303218A/en active Pending
- 2013-03-15 EP EP13760772.7A patent/EP2826029A4/en not_active Withdrawn
- 2013-03-15 WO PCT/US2013/031977 patent/WO2013138700A1/en active Application Filing
- 2013-03-15 CN CN201380019214.7A patent/CN104254873A/en active Pending
-
2016
- 2016-05-24 US US15/163,461 patent/US20160267777A1/en not_active Abandoned
-
2018
- 2018-03-28 US US15/938,759 patent/US11217088B2/en active Active
- 2018-08-31 US US16/119,227 patent/US20190188998A1/en not_active Abandoned
-
2021
- 2021-07-16 US US17/378,530 patent/US11727689B2/en active Active
-
2023
- 2023-06-23 US US18/213,516 patent/US12094212B2/en active Active
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11217088B2 (en) | 2012-03-15 | 2022-01-04 | Intellective Ai, Inc. | Alert volume normalization in a video surveillance system |
US11727689B2 (en) | 2012-03-15 | 2023-08-15 | Intellective Ai, Inc. | Alert directives and focused alert directives in a behavioral recognition system |
US12094212B2 (en) | 2012-03-15 | 2024-09-17 | Intellective Ai, Inc. | Alert directives and focused alert directives in a behavioral recognition system |
Also Published As
Publication number | Publication date |
---|---|
US20160267777A1 (en) | 2016-09-15 |
EP2826029A1 (en) | 2015-01-21 |
US20230419669A1 (en) | 2023-12-28 |
US9208675B2 (en) | 2015-12-08 |
US10096235B2 (en) | 2018-10-09 |
WO2013138700A1 (en) | 2013-09-19 |
US20130243252A1 (en) | 2013-09-19 |
US12094212B2 (en) | 2024-09-17 |
US20130241730A1 (en) | 2013-09-19 |
WO2013138719A1 (en) | 2013-09-19 |
CN104303218A (en) | 2015-01-21 |
US9349275B2 (en) | 2016-05-24 |
US11217088B2 (en) | 2022-01-04 |
CN104254873A (en) | 2014-12-31 |
IN2014DN08349A (en) | 2015-05-08 |
EP2826020A4 (en) | 2016-06-15 |
US11727689B2 (en) | 2023-08-15 |
IN2014DN08342A (en) | 2015-05-08 |
US20210398418A1 (en) | 2021-12-23 |
US20130242093A1 (en) | 2013-09-19 |
EP2826020A1 (en) | 2015-01-21 |
EP2826029A4 (en) | 2016-10-26 |
US20190005806A1 (en) | 2019-01-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US12094212B2 (en) | Alert directives and focused alert directives in a behavioral recognition system | |
US10848715B2 (en) | Anomalous stationary object detection and reporting | |
US9959630B2 (en) | Background model for complex and dynamic scenes | |
US8167430B2 (en) | Unsupervised learning of temporal anomalies for a video surveillance system | |
US10049293B2 (en) | Pixel-level based micro-feature extraction | |
US9674442B2 (en) | Image stabilization techniques for video surveillance systems | |
US9111148B2 (en) | Unsupervised learning of feature anomalies for a video surveillance system | |
US11017236B1 (en) | Anomalous object interaction detection and reporting | |
US8416296B2 (en) | Mapper component for multiple art networks in a video analysis system | |
US8786702B2 (en) | Visualizing and updating long-term memory percepts in a video surveillance system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
AS | Assignment |
Owner name: BEHAVIORAL RECOGNITION SYSTEMS, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:COBB, WESLEY KENNETH;SEOW, MING-JUNG;XU, GANG;AND OTHERS;REEL/FRAME:049903/0665 Effective date: 20130315 Owner name: OMNI AI, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PEPPERWOOD FUND II, LP;REEL/FRAME:049903/0877 Effective date: 20170201 Owner name: PEPPERWOOD FUND II, LP, TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:GIANT GRAY, INC.;REEL/FRAME:049903/0803 Effective date: 20170131 Owner name: GIANT GRAY, INC., TEXAS Free format text: CHANGE OF NAME;ASSIGNOR:BEHAVIORAL RECOGNITION SYSTEMS, INC.;REEL/FRAME:049910/0538 Effective date: 20160321 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
AS | Assignment |
Owner name: INTELLECTIVE AI, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:OMNI AI, INC.;REEL/FRAME:052216/0585 Effective date: 20200124 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |