WO2007002404A2 - Target detection and tracking from overhead video streams - Google Patents

Target detection and tracking from overhead video streams Download PDF

Info

Publication number
WO2007002404A2
WO2007002404A2 PCT/US2006/024485 US2006024485W WO2007002404A2 WO 2007002404 A2 WO2007002404 A2 WO 2007002404A2 US 2006024485 W US2006024485 W US 2006024485W WO 2007002404 A2 WO2007002404 A2 WO 2007002404A2
Authority
WO
WIPO (PCT)
Prior art keywords
targets
video
target
computer
block
Prior art date
Application number
PCT/US2006/024485
Other languages
French (fr)
Other versions
WO2007002404A8 (en
WO2007002404A3 (en
Inventor
Alan J. Lipton
Peter L. Venetianer
Zhong Zhang
Haiying Liu
Zeeshan Rasheed
Himaanshu Gupta
Li Yu
Original Assignee
Objectvideo, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Objectvideo, Inc. filed Critical Objectvideo, Inc.
Priority to JP2008518435A priority Critical patent/JP2008544705A/en
Priority to MX2007016406A priority patent/MX2007016406A/en
Priority to EP06785442.2A priority patent/EP1894142B1/en
Priority to CA002611522A priority patent/CA2611522A1/en
Publication of WO2007002404A2 publication Critical patent/WO2007002404A2/en
Publication of WO2007002404A3 publication Critical patent/WO2007002404A3/en
Publication of WO2007002404A8 publication Critical patent/WO2007002404A8/en
Priority to IL188196A priority patent/IL188196A0/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/215Motion-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19602Image analysis to detect motion of the intruder, e.g. by frame subtraction
    • G08B13/19608Tracking movement of a target, e.g. by detecting an object predefined as a target, using target direction and or velocity to predict its new position

Definitions

  • the invention relates to video surveillance systems and video verification systems. Specifically, the invention relates to a video surveillance system that may be configured to detect and track individual targets in video streams from an overhead camera view.
  • Video surveillance is of critical concern in many areas of life.
  • One problem with video as a surveillance tool is that it may be very manually intensive to monitor.
  • solutions have been proposed to the problems of automated video monitoring in the form of intelligent video surveillance systems. See, for example, U.S. Patent No. 6,696,945, "Video Tripwire,” Attorney Docket No. 37112-175339; and U.S. Patent Application No. 09/987,707, "Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340, both of which are incorporated herein by reference.
  • One application of video surveillance is the detection of human beings and their behaviors.
  • the science of computer vision which is behind automated video monitoring, has limitations with respect to recognizing individual targets in overhead camera views, such as those used in residential, commercial, and home monitoring applications.
  • One embodiment of the invention includes a computer-readable medium comprising software for video processing, which when executed by a computer system, cause the computer system to perform operations comprising a method of: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
  • One embodiment of the invention includes a computer-based system to perform a method for video processing, the method comprising: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
  • One embodiment of the invention includes a method for video processing comprising: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
  • Figure 1 illustrates a video surveillance system according to an exemplary embodiment of the invention.
  • Figure 2 illustrates an exemplary frame from a video stream from the video surveillance system according to an exemplary embodiment of the invention.
  • Figure 3 illustrates a flow diagram for target detection and counting according to an exemplary embodiment of the invention.
  • Figure 4 illustrates a flow diagram for detecting moving pixels according to an exemplary embodiment of the invention.
  • Figure 5 illustrates a flow diagram for detecting line segments according to an exemplary embodiment of the invention.
  • Figure 6 illustrates a flow diagram for finding a next line segment according to an exemplary embodiment of the invention.
  • Figure 7 illustrates predicting new search directions according to an exemplary embodiment of the invention.
  • Figure 8 illustrates a flow diagram for tracking targets according to an exemplary embodiment of the invention.
  • Figure 9 illustrates a flow diagram for updating targets according to an exemplary embodiment of the invention.
  • Figure 10 illustrates a flow diagram for detecting new targets according to an exemplary embodiment of the invention.
  • Figure 11 illustrates a flow diagram for refining targets according to an exemplary embodiment of the invention.
  • Figure 12 illustrates a flow diagram for merging targets according to an exemplary embodiment of the invention.
  • Figure 13 illustrates a flow diagram for splitting targets according to an exemplary embodiment of the invention.
  • Figure 14 illustrates a flow diagram for merging and splitting targets according to an exemplary embodiment of the invention.
  • Figure 15 illustrates a flow diagram for analyzing blobs according to an exemplary embodiment of the invention.
  • Figure 16 illustrates a flow diagram for cleaning targets according to an exemplary embodiment of the invention.
  • a "computer” may refer to one or more apparatus and/or one or more systems that are capable of accepting a structured input, processing the structured input according to prescribed rules, and producing results of the processing as output.
  • Examples of a computer may include: a computer; a stationary and/or portable computer; a computer having a single processor or multiple processors, which may operate in parallel and/or not in parallel; a general purpose computer; a supercomputer; a mainframe; a super mini-computer; a mini-computer; a workstation; a micro-computer; a server; a client; an interactive television; a web appliance; a telecommunications device with internet access; a hybrid combination of a computer and an interactive television; a portable computer; a personal digital assistant (PDA); a portable telephone; application-specific hardware to emulate a computer and/or software, such as, for example, a digital signal processor (DSP) or a field-programmable gate array (FPGA); a distributed computer system for processing information via computer systems linked by a network;
  • Software may refer to prescribed rules to operate a computer. Examples of software may include software; code segments; instructions; computer programs; and programmed logic.
  • a "computer system” may refer to a system having a computer, where the computer may include a computer-readable medium embodying software to operate the computer.
  • a "network” may refer to a number of computers and associated devices that may be connected by communication facilities.
  • a network may involve permanent connections such as cables or temporary connections such as those made through telephone or other communication links.
  • Examples of a network may include: an internet, such as the Internet; an intranet; a local area network (LAN); a wide area network (WAN); and a combination of networks, such as an internet and an intranet.
  • Video may refer to motion pictures represented in analog and/or digital form. Examples of video may include television, movies, image sequences from a camera or other observer, and computer-generated image sequences. Video may be obtained from, for example, a live feed, a storage device, an IEEE 1394-based interface, a video digitizer, a computer graphics engine, or a network connection.
  • a "video camera” may refer to an apparatus for visual recording.
  • Examples of a video camera may include one or more of the following: a video camera; a digital video camera; a color camera; a monochrome camera; a camera; a camcorder; a PC camera; a webcam; an infrared (IR) video camera; a low-light video camera; a thermal video camera; a closed-circuit television (CCTV) camera; a pan, tilt, zoom (PTZ) camera; and a video sensing device.
  • a video camera may be positioned to perform surveillance of an area of interest.
  • Video processing may refer to any manipulation and/or analysis of video, including, for example, compression, editing, surveillance, and/or verification.
  • a "frame” may refer to a particular image or other discrete unit within a video.
  • the invention relates to a video surveillance system that may be configured to detect and track individual targets in video streams from an overhead camera view and to a video verification system that may be configured to verify the occurrences being monintored.
  • the system may be adapted to disambiguate multiple objects even when they interact in tight groups and to detect moving objects in the presence of other inanimate objects, such as moving shopping carts, strollers, moving furniture, and other items.
  • the invention may be used in a variety of applications.
  • the invention may be used to detect humans and reduce false alarms in a residential or commercial monitoring system.
  • the invention may be used to determine building occupancy by counting individuals entering and leaving an area and/or to detect if "piggybacking" occurred (i.e., to detect an access control violation when two people enter or exit through a portal when only one may be authorized to do so).
  • the invention may be used to detect people moving the "wrong way" in a one way corridor, such as, for example, an airport exit or public transport escalator.
  • the invention may be used to detect people interacting in a dangerous way, such as, for example, a mugging or a drug deal.
  • a retail setting the invention may be used to detect store occupancy, detect queue length at a checkout lane, or verify a point of sale (POS) transaction.
  • POS point of sale
  • the invention may be used to count people entering a public transportation facility or vehicle and to perform video surveillance of a ticket reader to ensure that there is a ticket scanned when a person enters an area (e.g., to prevent a person from jumping over a turnstile, or overcoming another such obstacle).
  • the invention may be used to verify the legitimacy of several classes of retail point of sale (POS) transactions.
  • POS point of sale
  • a "merchandise return" transaction may require that a customer be physically present.
  • a "manager override” transaction may require that a manager assist the cashier.
  • the video surveillance system of the invention may monitor the locations and number of individuals around the POS console (e.g., the cash register) and determine if an appropriate configuration of people is present at the time of a particular transaction.
  • FIG. 1 illustrates the video surveillance system according to this exemplary embodiment of the invention.
  • the video surveillance system 101 of the invention may interact with a POS system 102.
  • the video surveillance system 101 may include a video camera 103, a target (e.g., human) detection and counting module 104, a classification of transaction (valid/invalid) module 105, and a pre-defined rules database 106.
  • the video camera 103 may overlook the console of the POS system from an overhead position.
  • the field of view of the video camera 103 may be looking down on the scene.
  • the target detection and counting module 104 may receive input from the POS system 102 as a transaction report that a particular transaction is requested, underway, or has been completed.
  • the target detection and counting module 104 may determine the number of humans, if any, in the video scene. An exemplary embodiment of the target detection and counting module 104 is discussed below with respect to Figures 4-16.
  • the classification of transaction module 105 may determine the constellation of participants based on the rules received from the pre-defined rules database 106.
  • the system 101 may then provide a transaction verification message back to the POS system 102 (or some other data monitoring or archiving system) to indicate whether the transaction was legitimate or not.
  • Blocks 105 and 106 may be implemented using the techniques discussed in, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, “Video Surveillance System,”Attorney Docket No. 37112-213547; or U.S. Patent Application No. 11/098,385, "Video surveillance system employing video primitives," Attorney Docket No. 37112-215811, which are incorporated herein by reference. In these documents, the creation of rules and the performance of activity inference (e.g., people counting) are discussed.
  • activity inference e.g., people counting
  • human target primitives as discussed in, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340, may be used.
  • POS transaction primitive For the example of a POS system, a primitive called a "POS transaction primitive" may be used. This primitive may contain three data items: (1) the time of a POS transaction; (2) the location (which POS terminal) of the transaction; and (3) the type of transaction (sale, return, manager override, etc). Two rules for the rules database 106 may be used with the POS transaction primitive.
  • a "manager override” transaction rule that says the following: if a POS manager override transaction (primitive) is registered; and there have not been two employees present (>1 human in an "employee” area of interest) for a [parameter] period of time; then the transaction is invalid and an alarm condition is generated.
  • the video camera 103 may be connected to a computer-based system 107 that may perform analysis of the video from the video camera 103 to determine the locations and number of people in the scene.
  • Examples of the computer-based system 107 may include the following: a computer, as defined above; a personal computer (PC), a laptop, a personal digital assistant (PDA), a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable array (FPGA), a microcontroller; or any other form-factor processor either as a standalone device or embedded in a video camera, a digital video recorder (DVR), a network video recorder (NVR), a network switcher, a network router, a POS terminal, or any other hardware device.
  • DVR digital video recorder
  • NVR network video recorder
  • POS terminal a POS terminal
  • the computer- based system 107 may include the human detection and counting module 104, the classification of transaction module 105, and the pre-defined rules database 106.
  • the computer-based system 107 may be implemented with one or more computers employing software and connected to a network. Alternatively, the computer-based system 107 may be incorporated in whole or in part into the video camera 103.
  • the human detection and counting module 104 and the classification of transaction module 105 may be implemented as a computer- readable medium comprising software to perform the operations of the modules 104 and 105, such that when the software is executed by a computer system, the computer system may be caused to perform the operations of the modules 104 and 105.
  • the human detection and counting module 104 and the classification of transaction module 105, and the pre-defined rules database 106 may be implemented with application-specific hardware to emulate a computer and/or software.
  • Figure 2 illustrates an exemplary frame from a video stream from the video surveillance system according to an exemplary embodiment of the invention.
  • the exemplary camera view may be from a video camera positioned overhead.
  • the customer is on the right, and two employees, namely a cashier and a manager, are on the left.
  • FIG. 3 illustrates a flow diagram for target detection and counting according to an exemplary embodiment of the invention.
  • targets may be described using co-moving sets of line segments extracted from the video scene.
  • blocks 301 and 302 may be employed.
  • moving pixels may be detected in the video stream using, for example, three-frame differencing, or some other technique (see, for example, U.S. Patent No. 6,625,310, "Video Segmentation Using Statistical Pixel Modeling," Attorney Docket No. 37112-164995; or U.S. Patent Application No. 10/354,096, "Video Scene Background Maintenance Using Change Detection and Classification," Attorney Docket No.
  • block 301 may be discussed below with respect to Figure 4.
  • line segments may be detected using, for example, edge detection and line growing technique (see, for example, U.S. Patent Application No. 11/113,275, "Line Textured Target Detection and Tracking with Applications to 'Basket-run' Detection," Attorney Docket No. 37112-217049, which is incorporated herein by reference).
  • An exemplary embodiment of block 302 is discussed below with respect to Figures 5-7.
  • targets may be identified as sets of line segments that fit the requirements a normal target (e.g., approximate target shape and size), given the field of view of the video camera.
  • targets may be tracked using a tracking filter, such as a Kalman filter, applied to the centroids of the targets, or some other technique (see, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; or U.S. Patent Application No. 11/139,600, “Multi-State Target Tracking,” filed May 31, 2005, Attorney Docket No. 37112-218196, both of which are incorporated herein by reference).
  • a tracking filter such as a Kalman filter
  • FIG. 4 illustrates a flow diagram for detecting moving pixels in block 301 of Figure 3 according to an exemplary embodiment of the invention.
  • the foreground moving area may be separated from the background scene. This separation may be performed using change detection. Change detection has been studied extensively in recent years, and many techniques are available.
  • the output of the change detection may be a foreground mask for each frame.
  • the edges of each foreground mask may be detected. While other edge detection algorithms may be used, an exemplary embodiment of the invention may use the Canny edge detection, which produces single-pixel-width edges. The edge detection may be performed only on the foreground area, which may require some modifications to the Canny edge detector to incorporate the foreground mask information.
  • Figure 5 illustrates a flow diagram for detecting line segments in block 302 of Figure 3 according to an exemplary embodiment of the invention.
  • a deterministic method may be used to detect line segments by extracting all of the line segments from an edge pixel map. The method may iteratively search an edge pixel map to find a new line segment until there are not enough unused edge pixels remaining. Each edge pixel may only be in one line segment, and after being used, the edge pixel may be removed from the edge pixel map.
  • the input to block 501 may be an edge pixel map of the frame obtained by, for example, block 402 in Figure 4.
  • edge pixels may be counted.
  • a determination may be made whether a sufficient number of edge pixels exist (or remain) to identify a line segment.
  • the threshold to check this condition may be determined by user input parameters on the rough image size of an exemplary object, such as, for example, a shopping cart. For example, if the rough image width of a shopping cart is sixty pixels, the threshold on the sufficient remaining edge pixels may be, for example, one third of it, that is, twenty pixels. This threshold may be called the minimum line segment length threshold.
  • a new line segment may be identified.
  • An exemplary embodiment of block 503 is discussed below with respect to Figure 6.
  • the edge pixel map may be updated to eliminate the pixels used in block 503, as noted above.
  • a determination may be made whether the new line segment is valid based on, for example, its length and linearity. For example, if the new line segment from block 503 has length much shorter than the image dimension of an expected shopping cart or if its overall linearity is too low, the new line segment may be considered as an invalid line segment.
  • the invalid line segment may be discarded, and flow may proceed to block 501; otherwise, flow proceeds to block 506.
  • the valid line segment may be added to a list of line segments in the frame.
  • the list of valid line segments may be outputted.
  • Figure 6 illustrates a flow diagram for finding a next line segment in block 503 of Figure 5 according to an exemplary embodiment of the invention.
  • a starting point of the new line segment is identified from a given edge pixel map.
  • this start point may be obtained by scanning through the whole edge pixel map from the top left corner until the first unused edge point is located.
  • the search may be speeded up by using the start point of the preceding line segment as the scanning start position.
  • the next search directions may be predicted for the end point based on an estimated line direction.
  • An exemplary embodiment of block 602 is discussed below with respect to Figure 7.
  • next line pixel may be identified by looping through each predicted search position to determine if the pixel is an edge pixel.
  • the pixel may be added to the line segment as the new end point, and flow may proceed to block 602. Otherwise, the next line pixel may be searched for in both directions, and flow may proceed to block 605.
  • the next line pixel may be searched for in both directions, and flow may proceed to block 605.
  • the reverse direction may have already been searched. If the reverse direction has not been searched, flow may proceed to block 606; otherwise, flow may proceed to block 607.
  • the search process may reverse the line direction. The end point may become the start point, the start point may become the current end point, and flow proceeds back to block 602.
  • the end of the search process on the current line segment may be reached, and the line segment may be outputted.
  • Figure 7 illustrates predicting new search directions in block 602 of Figure 6 according to an exemplary embodiment of the invention.
  • Area 702 may depict a region of an image, where each block indicates one pixel location.
  • Area 704 may indicate the current end point pixel of the current line segment.
  • Three different states may be considered when predicting the next search positions. For the first state (the initial pixel), the current end point pixel may also be the start point. In this case, all of the eight neighboring directions A-H of the end point pixel are searched as shown by reference numeral 706.
  • the direction of the line segment may be estimated using information provided by the pixels of the line segment.
  • One way to determine the line direction may be to perform clustering of the line segment pixels into two groups, namely the starting pixels and the ending pixels, which may correspond to the first half and second half of the line segment, respectively. The line direction may then be determined by using the average locations of the two groups of pixels.
  • the top three directions may be selected, for example, C, D, and E, indicated by reference numeral 710, that have minimum angle distances from the line direction.
  • Two further scenarios may be considered in this case.
  • the line may not yet be long enough to become a consistent line segment, where it is unclear whether the list of pixels is a part of a line segment or just a cluster of neighboring edge pixels.
  • One way to determine if the current line segment is sufficiently consistent may be to use the minimum length threshold discussed above. In particular, if the line segment is less than this threshold, the line segment may be considered not to be sufficiently consistent.
  • the three direct neighboring locations 710 may be included as the next search locations.
  • the line segment may be long enough and may be consistently extracted. In this case, a portion of the line may be missing due to an occasional small gap in the edge map caused by noise.
  • further neighborhood search locations may be included as indicated by reference numeral 712.
  • Figure 8 illustrates a flow diagram for tracking targets in block 304 of Figure 3 according to an exemplary embodiment of the invention.
  • existing targets may be updated as new information is received from frame to frame.
  • An exemplary embodiment of block 801 is discussed below with respect to Figure 9.
  • new targets may be recognized from any unassigned line segments that have not been deemed part of an existing target.
  • An exemplary embodiment of block 802 is discussed below with respect to Figure 10.
  • the targets may be refined to ensure that the available features may be accommodated.
  • An exemplary embodiment of block 803 is discussed below with respect to Figure 11.
  • the targets may be analyzed to determine if they should be merged (i.e., two targets become one target), and in block 805, the targets may be analyzed to determine if they should be split (i.e., one target becomes two targets).
  • An exemplary embodiment of blocks 804 and 805 is discussed below with respect to Figures 12-15.
  • the targets are cleaned, which may be used to determine when a target has left the field of view of the video camera.
  • An exemplary embodiment of block 806 is discussed below with respect to Figure 16.
  • Figure 9 illustrates a flow diagram for updating targets in block 801 of Figure 8 according to an exemplary embodiment of the invention.
  • the parameters e.g., position and size, or position, size, and velocity
  • the parameters may be predicted using an appropriate tracking filter, such as, for example, a Kalman filter or the another tracking filtering (see, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; or U.S. Patent Application No. 11/139,600, “Multi-State Target Tracking," filed May 31, 2005, Attorney Docket No. 37112-218196).
  • a Kalman filter or the another tracking filtering
  • the line segments that have been detected may be assigned to each of the targets based on their locations with respect to the centroid and size of the existing target.
  • the targets may be updated. For example, the target's new position, size and velocity may be updated according to the tracking filter update rules.
  • Figure 10 illustrates a flow diagram for detecting new targets in block 802 of
  • any unassigned line segments may be clustered using, for example, a neighborhood grouping method. For example, any line segments within a certain threshold of distance from each other may be clustered into a single group.
  • the cluster of the unassigned line segments may be verified to make ensure they correspond to the pre-defined requirements of a target. For example, if a human target in the field of view of Figure 2 is used to define the requirements of a target, the cluster of the unassigned line segments may need to have the correct approximate size to indicate the presence of a human target. If the cluster of the unassigned line segments is too large or too small, the cluster of the unassigned line segments may be rejected.
  • the cluster of unassigned line segments may be designated as a new target, and a tracking filter may be instantiated for the new target with the position and size of the cluster of unassigned line segments as the initial parameters for the new target.
  • FIG 11 illustrates a flow diagram for refining targets in block 803 of Figure 8 according to an exemplary embodiment of the invention.
  • any remaining line segments that have not been assigned to existing or new targets may be agglomerated into their nearest neighbor target.
  • the targets may be re-estimated based on the new features. For example, the position and velocity of the targets may be re-calculated, and the associated tracking filter may be updated with these new parameters.
  • a determination may be made as to whether or not each target is becoming stationary (i.e., stops moving). If the number and size of line segments associated with that target decreases, the target may be ceasing motion.
  • the target's parameters e.g., size, position, and velocity
  • the target's parameters may be updated using all (or some) of the moving pixels in the target's vicinity rather than just the moving line segments.
  • FIG. 12 illustrates a flow diagram for merging targets in block 804 of Figure 8 according to an exemplary embodiment of the invention.
  • two targets may be obtained.
  • the parameters of the obtained targets may be compared. For example, the size and history (or age) of the targets may be compared. If the two targets occupy similar space, one is smaller than the other, and one is younger than the other, the two targets may be deemed similar enough to be merged into a single target. If the parameters of the targets are similar, flow may proceed to block 1203; otherwise, flow may proceed to block 1201.
  • the two target may be merged into a single target. For example, the smaller and/or younger target may be merged into the larger one. After block 1203, flow may proceed to block 1201. For flow returning to block 1201, two targets may be obtained that have not been compared previously. Flow may exit block 804 once all (or a sufficient number) of targets have been compared for merger.
  • Figure 13 illustrates a flow diagram for splitting targets in block 805 of Figure 8 according to an exemplary embodiment of the invention.
  • a target may be obtained.
  • a determination may be made whether the target is similar to a normal target.
  • the normal target may be modeled after a person in Figure 2. If the target and normal target are compared based on, for example, their sizes, and if the target is larger than the normal target, the target may be determined not to be similar to the normal target. If the target is not similar to the normal target, flow may proceed to block 1303; otherwise, flow may proceed to block 1301.
  • clusters may be obtained from the line segments of the target.
  • two line segments that are furthest away from each other within the target may be identified, and clustering may be re-initialized (as in block 1001 of Figure 10) with both of these line segments as the starting points.
  • the result may be two new clusters of line segments.
  • a determination may be made whether the two new clusters of line segments are similar to the normal target. For example, if the resulting two clusters are of appropriate size and shape when compared to the normal target, the two clusters may be considered individual targets. If the two new clusters of line segments are similar to the normal target, flow may proceed to block 1305; otherwise, flow may proceed to block 1301.
  • target identities may be assigned to the two new clusters of line segments. For example, the smaller cluster may be assigned a new identity, and the larger cluster may maintain the original identity of the target. From block 1305, flow may proceed to block 1301. Flow may exit block 805 once all (or a sufficient number) of targets have been analyzed for splitting.
  • the merging and splitting of targets may be considered simultaneously and may be based on, for example, the analysis of the shape of the moving target blob.
  • the analysis may result in labeling the number of human targets in a blob as "no targets,”one human target,” or ">1 human targets.”
  • Other embodiments might seek to count specific targets in a group.
  • Figure 14 illustrates a flow diagram for merging and splitting targets in blocks 804 and 805 of Figure 8 according to an exemplary embodiment of the invention.
  • a foreground mask may be generated for each video frame.
  • This foreground mask may be generated using the detection of moving pixels discussed for block 301 of Figure 3 or another foreground object detection technique (see, for example, U.S. Patent No. 6,625,310, "Video Segmentation Using Statistical Pixel Modeling," Attorney Docket No. 37112-164995; U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System,” Attorney Docket No. 37112- 213547; or U.S. Patent Application No. 11/098,385, “Video Surveillance System Employing Video Primitives,” Attorney Docket No. 37112-215811, all of which are incorporated herein by reference).
  • foreground objects i.e., blobs
  • the foreground objects may be detected using a clustering algorithm (see, e.g., U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System,” Attorney Docket No. 37112-213547; or U.S. Patent Application No. 11/098,385, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-215811).
  • a clustering algorithm see, e.g., U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System,” Attorney Docket No. 37112-213547; or U.
  • the blobs may be tracked via an object tracking algorithm and tracking information may be generated (see, e.g., U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System,” Attorney Docket No. 37112-213547; U.S. Patent Application No. 11/098,385, “Video Surveillance System Employing Video Primitives,” Attorney Docket No. 37112-215811; or U.S. Patent Application No. 11/139,600, “Multi-State Target Tracking,” filed May 31, 2005, Attorney Docket No. 37112-218196.
  • Block 1403 may be optional.
  • block 1404 From blocks 1402 and 1403, flow may proceed to block 1404.
  • the blobs from block 1402 and the tracking information from block 1403 may be used to analyze the blobs, and the number of targets may be identified. For example, the blobs may be analyzed based on their size and shape.
  • An exemplary embodiment of block 1403 is discussed below with respect to Figure 14.
  • the result of block 1404 may be targets that are the same as previous targets, less than the previous targets (i.e., a merger of previous targets), or more than the previous targets (i.e., a split of previous targets).
  • Figure 15 illustrates a flow diagram for analyzing blobs in block 1404 of Figure 14 according to an exemplary embodiment of the invention.
  • the flow may be performed for each blob identified in block 1302.
  • Flow may exit block 1404 once all (or a sufficient number) of blobs have been analyzed.
  • the size of the blob may be compared to a multiple target size threshold.
  • the multiple target size threshold may represent a size representing two or more normal targets (e.g., two or more humans). If the size of the blob is greater than the multiple target size threshold, flow may proceed to block 1503; otherwise, flow may proceed to block 1504.
  • the size of the blob may be greater than or equal to the multiple target size threshold, and the blob may be labeled as more than one target (e.g., labeled as ">1 human").
  • the size of the blob may be compared to a minimum single target size threshold.
  • the minimum single target size threshold may represent a minimum size of a normal target. If the size of the blob is less than the minimum target size threshold, flow may proceed to block 1505; otherwise, flow may proceed to block 1507.
  • the blob may be designated as representing no targets.
  • the size of the blob may be compared to a maximum single target size threshold.
  • the maximum single target size threshold may represent an expected maximum size of a normal target. If the size of the blob is less than the maximum single target size threshold, flow may proceed to block 1508; otherwise, flow may proceed to block 1509.
  • the size of the blob may be less than or equal to the multiple target size threshold but greater than the maximum single target size threshold, and additional analysis may be needed to determine the number of targets represented by the blob (i.e., no targets or one target).
  • eigen analysis may be performed to determine the major and minor axes of the blob.
  • the blob may then be split along its minor axis into two sub-blobs.
  • the convex area e.g., the area of the convex hull
  • the sub-blobs may be analyzed to determine if the each of the two sub-blobs conforms to the normal target.
  • the two sub-blobs may be analyzed to determine if their shape is similar to the shape of the normal target. The following analysis may be performed: if the ratio of the of each sub- blob's area to its convex hull area is greater than a minimum target solidity threshold, and if the convex area of each sub-blob is greater than the minimum single target size threshold, then the original blob may be considered to comprise two targets, and flow may proceed to block 1512; otherwise, flow may proceed to block 1513.
  • the blob may be considered to comprise two targets, and the blob may be labeled as more than one target (e.g., labeled as ">1 human”).
  • flow may be received from blocks 1503, 1508, 1512, and 1513, and the blob may be analyzed to determine if it is stationary.
  • a technique such as those described in, for example, U.S. Patent Application No. 10/354,096, "Video Scene Background Maintenance Using Change Detection and Classification," Attorney Docket No. 37112- 182386; or U.S. Patent Application No. 11/139,600, "Multi-State Target Tracking," filed May 31, 2005, Attorney Docket No. 37112-218196, may be used for this purpose.
  • flow may proceed to block 1515; otherwise, flow may proceed to block 1506.
  • the blob may be designated as represented no targets.
  • FIG. 16 illustrates a flow diagram for cleaning targets in block 806 of Figure 8 according to an exemplary embodiment of the invention.
  • each target may be analyzed individually.
  • a target may be obtained.
  • the target may be analyzed to determine if the target was detected in the frame. If the target was detected in the frame, flow may proceed to block 1603; otherwise, flow may proceed to block 1604.
  • the target may be detected in the frame and may be maintained.
  • the target may be analyzed to determine if the target was moving out of the field of view of the video camera in a prior frame. If the target was not moving out of the field of view, flow may proceed to block 1603, and the target is maintained; otherwise, flow may proceed to block 1605.
  • the target may not be detected in the frame, may have been moving out of the field of view, and may be removed from the list of current targets. Flow may exit block 806 once all (or a sufficient number) of targets have been analyzed for cleaning.

Abstract

A technique for video processing includes: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.

Description

Target Detection and Tracking from Overhead Video Streams
[0001 ] Field of the Invention
[0002] The invention relates to video surveillance systems and video verification systems. Specifically, the invention relates to a video surveillance system that may be configured to detect and track individual targets in video streams from an overhead camera view.
[0003] Background of the Invention
[0004] Video surveillance is of critical concern in many areas of life. One problem with video as a surveillance tool is that it may be very manually intensive to monitor. Recently, solutions have been proposed to the problems of automated video monitoring in the form of intelligent video surveillance systems. See, for example, U.S. Patent No. 6,696,945, "Video Tripwire," Attorney Docket No. 37112-175339; and U.S. Patent Application No. 09/987,707, "Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340, both of which are incorporated herein by reference. One application of video surveillance is the detection of human beings and their behaviors. Unfortunately, the science of computer vision, which is behind automated video monitoring, has limitations with respect to recognizing individual targets in overhead camera views, such as those used in residential, commercial, and home monitoring applications.
[0005] Current video surveillance systems (see, for example, C. Stauffer,
W.E.L. Grimson, "Learning Patterns of Activity Using Real-Time Tracking," IEEE Trans. PAMI, 22(8):747-757, August 2000; and R. Collins, A. Lipton, H. Fujiyoshi, and T. Kanade, "Algorithms for Cooperative Multisensor Surveillance," Proceedings of the IEEE, Vol. 89, No. 10, October, 2001, pp. 1456 - 1477, both of which are incorporated herein by reference) have two basic limitations. First, groups of targets may often be crowded together and detected as a single "blob." The blob may be correctly labeled as "human group," but the number of individuals comprising the group may not be ascertained. Second, other inanimate objects, such as, for example, furniture, strollers, and shopping carts, may generally not be disambiguated from legitimate targets (particularly in, for example, overhead camera shots). In addition, other "human detection" algorithms (see, for example, the techniques discussed at http://vismod.media.mit.edu/vismod/demos/pfinder/ and U.S. Patent Application No. 11/139,986, "Human Detection and Tracking for Security Applications," filed May 31, 2005, Attorney Docket No. 37112-218471, both of which are incorporated herein by reference) rely on more oblique camera views and specific human models to recognize humans, but generally do not perform well for overhead camera views.
[0006] Summary of the Invention
[0007] One embodiment of the invention includes a computer-readable medium comprising software for video processing, which when executed by a computer system, cause the computer system to perform operations comprising a method of: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
[0008] One embodiment of the invention includes a computer-based system to perform a method for video processing, the method comprising: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
[0009] One embodiment of the invention includes a method for video processing comprising: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
[0010] Brief Description of the Drawings [0011] The foregoing and other features and advantages of the invention will be apparent from the following, more particular description of the embodiments of the invention, as illustrated in the accompanying drawings. [0012] Figure 1 illustrates a video surveillance system according to an exemplary embodiment of the invention. [0013] Figure 2 illustrates an exemplary frame from a video stream from the video surveillance system according to an exemplary embodiment of the invention. [0014] Figure 3 illustrates a flow diagram for target detection and counting according to an exemplary embodiment of the invention. [0015] Figure 4 illustrates a flow diagram for detecting moving pixels according to an exemplary embodiment of the invention. [0016] Figure 5 illustrates a flow diagram for detecting line segments according to an exemplary embodiment of the invention. [0017] Figure 6 illustrates a flow diagram for finding a next line segment according to an exemplary embodiment of the invention. [0018] Figure 7 illustrates predicting new search directions according to an exemplary embodiment of the invention. [0019] Figure 8 illustrates a flow diagram for tracking targets according to an exemplary embodiment of the invention. [0020] Figure 9 illustrates a flow diagram for updating targets according to an exemplary embodiment of the invention. [0021] Figure 10 illustrates a flow diagram for detecting new targets according to an exemplary embodiment of the invention. [0022] Figure 11 illustrates a flow diagram for refining targets according to an exemplary embodiment of the invention. [0023] Figure 12 illustrates a flow diagram for merging targets according to an exemplary embodiment of the invention. [0024] Figure 13 illustrates a flow diagram for splitting targets according to an exemplary embodiment of the invention. [0025] Figure 14 illustrates a flow diagram for merging and splitting targets according to an exemplary embodiment of the invention. [0026] Figure 15 illustrates a flow diagram for analyzing blobs according to an exemplary embodiment of the invention. [0027] Figure 16 illustrates a flow diagram for cleaning targets according to an exemplary embodiment of the invention.
[0028] Definitions
[0029] In describing the invention, the following definitions are applicable throughout (including above).
[0030] A "computer" may refer to one or more apparatus and/or one or more systems that are capable of accepting a structured input, processing the structured input according to prescribed rules, and producing results of the processing as output. Examples of a computer may include: a computer; a stationary and/or portable computer; a computer having a single processor or multiple processors, which may operate in parallel and/or not in parallel; a general purpose computer; a supercomputer; a mainframe; a super mini-computer; a mini-computer; a workstation; a micro-computer; a server; a client; an interactive television; a web appliance; a telecommunications device with internet access; a hybrid combination of a computer and an interactive television; a portable computer; a personal digital assistant (PDA); a portable telephone; application-specific hardware to emulate a computer and/or software, such as, for example, a digital signal processor (DSP) or a field-programmable gate array (FPGA); a distributed computer system for processing information via computer systems linked by a network; two or more computer systems connected together via a network for transmitting or receiving information between the computer systems; and one or more apparatus and/or one or more systems that may accept data, may process data in accordance with one or more stored software programs, may generate results, and typically may include input, output, storage, arithmetic, logic, and control units.
[0031] "Software" may refer to prescribed rules to operate a computer. Examples of software may include software; code segments; instructions; computer programs; and programmed logic.
[0032] A "computer system" may refer to a system having a computer, where the computer may include a computer-readable medium embodying software to operate the computer.
[0033] A "network" may refer to a number of computers and associated devices that may be connected by communication facilities. A network may involve permanent connections such as cables or temporary connections such as those made through telephone or other communication links. Examples of a network may include: an internet, such as the Internet; an intranet; a local area network (LAN); a wide area network (WAN); and a combination of networks, such as an internet and an intranet.
[0034] "Video" may refer to motion pictures represented in analog and/or digital form. Examples of video may include television, movies, image sequences from a camera or other observer, and computer-generated image sequences. Video may be obtained from, for example, a live feed, a storage device, an IEEE 1394-based interface, a video digitizer, a computer graphics engine, or a network connection.
[0035] A "video camera" may refer to an apparatus for visual recording. Examples of a video camera may include one or more of the following: a video camera; a digital video camera; a color camera; a monochrome camera; a camera; a camcorder; a PC camera; a webcam; an infrared (IR) video camera; a low-light video camera; a thermal video camera; a closed-circuit television (CCTV) camera; a pan, tilt, zoom (PTZ) camera; and a video sensing device. A video camera may be positioned to perform surveillance of an area of interest.
[0036] "Video processing" may refer to any manipulation and/or analysis of video, including, for example, compression, editing, surveillance, and/or verification.
[0037] A "frame" may refer to a particular image or other discrete unit within a video.
[0038] Detailed Description of the Embodiments
[0039] In describing the exemplary embodiments of the present invention illustrated in the drawings, specific terminology is employed for the sake of clarity. However, the invention is not intended to be limited to the specific terminology so selected. It is to be understood that each specific element includes all technical equivalents that operate in a similar manner to accomplish a similar purpose. Each reference cited herein is incorporated by reference.
[0040] The invention relates to a video surveillance system that may be configured to detect and track individual targets in video streams from an overhead camera view and to a video verification system that may be configured to verify the occurrences being monintored. The system may be adapted to disambiguate multiple objects even when they interact in tight groups and to detect moving objects in the presence of other inanimate objects, such as moving shopping carts, strollers, moving furniture, and other items.
[0041] The invention may be used in a variety of applications. In a residential or commercial setting, the invention may be used to detect humans and reduce false alarms in a residential or commercial monitoring system. In a commercial setting, the invention may be used to determine building occupancy by counting individuals entering and leaving an area and/or to detect if "piggybacking" occurred (i.e., to detect an access control violation when two people enter or exit through a portal when only one may be authorized to do so). For physical security, the invention may be used to detect people moving the "wrong way" in a one way corridor, such as, for example, an airport exit or public transport escalator. For public safety, the invention may be used to detect people interacting in a dangerous way, such as, for example, a mugging or a drug deal. In a retail setting, the invention may be used to detect store occupancy, detect queue length at a checkout lane, or verify a point of sale (POS) transaction. In a public transportation setting, the invention may be used to count people entering a public transportation facility or vehicle and to perform video surveillance of a ticket reader to ensure that there is a ticket scanned when a person enters an area (e.g., to prevent a person from jumping over a turnstile, or overcoming another such obstacle).
[0042] As an exemplary embodiment, the invention may be used to verify the legitimacy of several classes of retail point of sale (POS) transactions. For example, a "merchandise return" transaction may require that a customer be physically present. As another example, a "manager override" transaction may require that a manager assist the cashier. The video surveillance system of the invention may monitor the locations and number of individuals around the POS console (e.g., the cash register) and determine if an appropriate configuration of people is present at the time of a particular transaction.
[0043] In Figures 1 and 2, the invention is illustrated for use in retail with a POS transaction verification application. Figure 1 illustrates the video surveillance system according to this exemplary embodiment of the invention. For an exemplary POS setting, the video surveillance system 101 of the invention may interact with a POS system 102. The video surveillance system 101 may include a video camera 103, a target (e.g., human) detection and counting module 104, a classification of transaction (valid/invalid) module 105, and a pre-defined rules database 106.
[0044] The video camera 103 may overlook the console of the POS system from an overhead position. The field of view of the video camera 103 may be looking down on the scene. The target detection and counting module 104 may receive input from the POS system 102 as a transaction report that a particular transaction is requested, underway, or has been completed. The target detection and counting module 104 may determine the number of humans, if any, in the video scene. An exemplary embodiment of the target detection and counting module 104 is discussed below with respect to Figures 4-16. The classification of transaction module 105 may determine the constellation of participants based on the rules received from the pre-defined rules database 106. The system 101 may then provide a transaction verification message back to the POS system 102 (or some other data monitoring or archiving system) to indicate whether the transaction was legitimate or not.
[0045] Blocks 105 and 106 may be implemented using the techniques discussed in, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System,"Attorney Docket No. 37112-213547; or U.S. Patent Application No. 11/098,385, "Video surveillance system employing video primitives," Attorney Docket No. 37112-215811, which are incorporated herein by reference. In these documents, the creation of rules and the performance of activity inference (e.g., people counting) are discussed. For this invention, for example, human target primitives, as discussed in, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340, may be used.
[0046] For the example of a POS system, a primitive called a "POS transaction primitive" may be used. This primitive may contain three data items: (1) the time of a POS transaction; (2) the location (which POS terminal) of the transaction; and (3) the type of transaction (sale, return, manager override, etc). Two rules for the rules database 106 may be used with the POS transaction primitive. Firstly a "return transaction verification" rule may be used as follows: if a POS return transaction (primitive) is registered; and there has been no customer present (>= human in a "customer" area of interest) for a [parameter] period of time; or there has been no cashier present (>= 1 human present in an "employee" area of interest) for a [parameter] period of time, then the transaction is invalid and an alarm condition is generated. Secondly, a "manager override" transaction rule that says the following: if a POS manager override transaction (primitive) is registered; and there have not been two employees present (>1 human in an "employee" area of interest) for a [parameter] period of time; then the transaction is invalid and an alarm condition is generated.
[0047] The video camera 103 may be connected to a computer-based system 107 that may perform analysis of the video from the video camera 103 to determine the locations and number of people in the scene. Examples of the computer-based system 107 may include the following: a computer, as defined above; a personal computer (PC), a laptop, a personal digital assistant (PDA), a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable array (FPGA), a microcontroller; or any other form-factor processor either as a standalone device or embedded in a video camera, a digital video recorder (DVR), a network video recorder (NVR), a network switcher, a network router, a POS terminal, or any other hardware device. The computer- based system 107 may include the human detection and counting module 104, the classification of transaction module 105, and the pre-defined rules database 106. The computer-based system 107 may be implemented with one or more computers employing software and connected to a network. Alternatively, the computer-based system 107 may be incorporated in whole or in part into the video camera 103. The human detection and counting module 104 and the classification of transaction module 105 may be implemented as a computer- readable medium comprising software to perform the operations of the modules 104 and 105, such that when the software is executed by a computer system, the computer system may be caused to perform the operations of the modules 104 and 105. Alternatively, the human detection and counting module 104 and the classification of transaction module 105, and the pre-defined rules database 106 may be implemented with application-specific hardware to emulate a computer and/or software.
[0048] Figure 2 illustrates an exemplary frame from a video stream from the video surveillance system according to an exemplary embodiment of the invention. The exemplary camera view may be from a video camera positioned overhead. In the exemplary frame, the customer is on the right, and two employees, namely a cashier and a manager, are on the left.
[0049] In the example of Figures 1 and 2, the invention is illustrated for use in retail with a POS transaction verification application. However, it is understood that the invention may be applied to any appropriate application as those skilled in the art will recognize.
[0050] Figure 3 illustrates a flow diagram for target detection and counting according to an exemplary embodiment of the invention. With the invention, targets may be described using co-moving sets of line segments extracted from the video scene. To extract these sets of line segments, blocks 301 and 302 may be employed. In block 301, moving pixels may be detected in the video stream using, for example, three-frame differencing, or some other technique (see, for example, U.S. Patent No. 6,625,310, "Video Segmentation Using Statistical Pixel Modeling," Attorney Docket No. 37112-164995; or U.S. Patent Application No. 10/354,096, "Video Scene Background Maintenance Using Change Detection and Classification," Attorney Docket No. 37112-182386, both of which are incorporated herein by reference), and a motion mask may be extracted. An exemplary embodiment of block 301 is discussed below with respect to Figure 4. In block 302, line segments may be detected using, for example, edge detection and line growing technique (see, for example, U.S. Patent Application No. 11/113,275, "Line Textured Target Detection and Tracking with Applications to 'Basket-run' Detection," Attorney Docket No. 37112-217049, which is incorporated herein by reference). An exemplary embodiment of block 302 is discussed below with respect to Figures 5-7. In block 303, targets may be identified as sets of line segments that fit the requirements a normal target (e.g., approximate target shape and size), given the field of view of the video camera. In block 304, targets may be tracked using a tracking filter, such as a Kalman filter, applied to the centroids of the targets, or some other technique (see, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; or U.S. Patent Application No. 11/139,600, "Multi-State Target Tracking," filed May 31, 2005, Attorney Docket No. 37112-218196, both of which are incorporated herein by reference). An exemplary embodiment of block 304 is discussed below with respect to Figures 8-16. [0051] Figure 4 illustrates a flow diagram for detecting moving pixels in block 301 of Figure 3 according to an exemplary embodiment of the invention. In block 401, the foreground moving area may be separated from the background scene. This separation may be performed using change detection. Change detection has been studied extensively in recent years, and many techniques are available. The output of the change detection may be a foreground mask for each frame. In block 402, the edges of each foreground mask may be detected. While other edge detection algorithms may be used, an exemplary embodiment of the invention may use the Canny edge detection, which produces single-pixel-width edges. The edge detection may be performed only on the foreground area, which may require some modifications to the Canny edge detector to incorporate the foreground mask information.
[0052] Figure 5 illustrates a flow diagram for detecting line segments in block 302 of Figure 3 according to an exemplary embodiment of the invention. According to an exemplary embodiment, a deterministic method may be used to detect line segments by extracting all of the line segments from an edge pixel map. The method may iteratively search an edge pixel map to find a new line segment until there are not enough unused edge pixels remaining. Each edge pixel may only be in one line segment, and after being used, the edge pixel may be removed from the edge pixel map.
[0053] The input to block 501 may be an edge pixel map of the frame obtained by, for example, block 402 in Figure 4. In block 501, edge pixels may be counted. In block 502, a determination may be made whether a sufficient number of edge pixels exist (or remain) to identify a line segment. The threshold to check this condition may be determined by user input parameters on the rough image size of an exemplary object, such as, for example, a shopping cart. For example, if the rough image width of a shopping cart is sixty pixels, the threshold on the sufficient remaining edge pixels may be, for example, one third of it, that is, twenty pixels. This threshold may be called the minimum line segment length threshold. If a sufficient number of edge pixels do not exist (or remain), flow may proceed to block 507; otherwise, flow may proceed to block 503. In block 503, a new line segment may be identified. An exemplary embodiment of block 503 is discussed below with respect to Figure 6. In block 504, the edge pixel map may be updated to eliminate the pixels used in block 503, as noted above. In block 505, a determination may be made whether the new line segment is valid based on, for example, its length and linearity. For example, if the new line segment from block 503 has length much shorter than the image dimension of an expected shopping cart or if its overall linearity is too low, the new line segment may be considered as an invalid line segment. If the new line segment is not valid, the invalid line segment may be discarded, and flow may proceed to block 501; otherwise, flow proceeds to block 506. In block 506, the valid line segment may be added to a list of line segments in the frame. In block 514, the list of valid line segments may be outputted.
[0054] Figure 6 illustrates a flow diagram for finding a next line segment in block 503 of Figure 5 according to an exemplary embodiment of the invention. In block 601, a starting point of the new line segment is identified from a given edge pixel map. For the first line segment, this start point may be obtained by scanning through the whole edge pixel map from the top left corner until the first unused edge point is located. For all subsequent line segments, the search may be speeded up by using the start point of the preceding line segment as the scanning start position. In block 602, the next search directions may be predicted for the end point based on an estimated line direction. An exemplary embodiment of block 602 is discussed below with respect to Figure 7. In block 603, the next line pixel may be identified by looping through each predicted search position to determine if the pixel is an edge pixel. In block 604, if the next line pixel is an edge pixel, the pixel may be added to the line segment as the new end point, and flow may proceed to block 602. Otherwise, the next line pixel may be searched for in both directions, and flow may proceed to block 605. In block 605, if the next line pixel can not be found in one direction, the reverse direction may have already been searched. If the reverse direction has not been searched, flow may proceed to block 606; otherwise, flow may proceed to block 607. In block 606, the search process may reverse the line direction. The end point may become the start point, the start point may become the current end point, and flow proceeds back to block 602. In block 607, the end of the search process on the current line segment may be reached, and the line segment may be outputted.
[0055] Figure 7 illustrates predicting new search directions in block 602 of Figure 6 according to an exemplary embodiment of the invention. Area 702 may depict a region of an image, where each block indicates one pixel location. Area 704 may indicate the current end point pixel of the current line segment. Three different states may be considered when predicting the next search positions. For the first state (the initial pixel), the current end point pixel may also be the start point. In this case, all of the eight neighboring directions A-H of the end point pixel are searched as shown by reference numeral 706.
[0056] For the second state, once multiple pixels in a line segment exist, the direction of the line segment may be estimated using information provided by the pixels of the line segment. One way to determine the line direction may be to perform clustering of the line segment pixels into two groups, namely the starting pixels and the ending pixels, which may correspond to the first half and second half of the line segment, respectively. The line direction may then be determined by using the average locations of the two groups of pixels.
[0057] For the third state, when a current line direction is available, for example, as may be indicated by arrow 708, the top three directions may be selected, for example, C, D, and E, indicated by reference numeral 710, that have minimum angle distances from the line direction. Two further scenarios may be considered in this case. First, the line may not yet be long enough to become a consistent line segment, where it is unclear whether the list of pixels is a part of a line segment or just a cluster of neighboring edge pixels. One way to determine if the current line segment is sufficiently consistent may be to use the minimum length threshold discussed above. In particular, if the line segment is less than this threshold, the line segment may be considered not to be sufficiently consistent. To avoid extracting a false line segment, the three direct neighboring locations 710 may be included as the next search locations. Second, the line segment may be long enough and may be consistently extracted. In this case, a portion of the line may be missing due to an occasional small gap in the edge map caused by noise. Thus, further neighborhood search locations may be included as indicated by reference numeral 712.
[0058] Figure 8 illustrates a flow diagram for tracking targets in block 304 of Figure 3 according to an exemplary embodiment of the invention. In block 801, existing targets may be updated as new information is received from frame to frame. An exemplary embodiment of block 801 is discussed below with respect to Figure 9. In block 802, new targets may be recognized from any unassigned line segments that have not been deemed part of an existing target. An exemplary embodiment of block 802 is discussed below with respect to Figure 10. In block 803, the targets may be refined to ensure that the available features may be accommodated. An exemplary embodiment of block 803 is discussed below with respect to Figure 11. In block 804, the targets may be analyzed to determine if they should be merged (i.e., two targets become one target), and in block 805, the targets may be analyzed to determine if they should be split (i.e., one target becomes two targets). An exemplary embodiment of blocks 804 and 805 is discussed below with respect to Figures 12-15. In block 806, the targets are cleaned, which may be used to determine when a target has left the field of view of the video camera. An exemplary embodiment of block 806 is discussed below with respect to Figure 16.
[0059] Figure 9 illustrates a flow diagram for updating targets in block 801 of Figure 8 according to an exemplary embodiment of the invention. In block 901, the parameters (e.g., position and size, or position, size, and velocity) of existing targets may be predicted using an appropriate tracking filter, such as, for example, a Kalman filter or the another tracking filtering (see, for example, U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; or U.S. Patent Application No. 11/139,600, "Multi-State Target Tracking," filed May 31, 2005, Attorney Docket No. 37112-218196). In block 902, the line segments that have been detected may be assigned to each of the targets based on their locations with respect to the centroid and size of the existing target. In block 903, the targets may be updated. For example, the target's new position, size and velocity may be updated according to the tracking filter update rules.
[0060] Figure 10 illustrates a flow diagram for detecting new targets in block 802 of
Figure 8 according to an exemplary embodiment of the invention. In block 1001, any unassigned line segments may be clustered using, for example, a neighborhood grouping method. For example, any line segments within a certain threshold of distance from each other may be clustered into a single group. In block 1002, the cluster of the unassigned line segments may be verified to make ensure they correspond to the pre-defined requirements of a target. For example, if a human target in the field of view of Figure 2 is used to define the requirements of a target, the cluster of the unassigned line segments may need to have the correct approximate size to indicate the presence of a human target. If the cluster of the unassigned line segments is too large or too small, the cluster of the unassigned line segments may be rejected. In block 1003, assuming the cluster of the unassigned line segments fits the requirements of a target definition from block 1002, the cluster of unassigned line segments may be designated as a new target, and a tracking filter may be instantiated for the new target with the position and size of the cluster of unassigned line segments as the initial parameters for the new target.
[0061] Figure 11 illustrates a flow diagram for refining targets in block 803 of Figure 8 according to an exemplary embodiment of the invention. In block 1101, any remaining line segments that have not been assigned to existing or new targets may be agglomerated into their nearest neighbor target. In block 1102, the targets may be re-estimated based on the new features. For example, the position and velocity of the targets may be re-calculated, and the associated tracking filter may be updated with these new parameters. In block 1103, a determination may be made as to whether or not each target is becoming stationary (i.e., stops moving). If the number and size of line segments associated with that target decreases, the target may be ceasing motion. If the target is determined to becoming stationary, flow proceeds to block 1104; otherwise, flow may exit from block 803. In block 1104, the target's parameters (e.g., size, position, and velocity) may be updated using all (or some) of the moving pixels in the target's vicinity rather than just the moving line segments.
[0062] Figure 12 illustrates a flow diagram for merging targets in block 804 of Figure 8 according to an exemplary embodiment of the invention. In block 1201, two targets may be obtained. In block 1202, the parameters of the obtained targets may be compared. For example, the size and history (or age) of the targets may be compared. If the two targets occupy similar space, one is smaller than the other, and one is younger than the other, the two targets may be deemed similar enough to be merged into a single target. If the parameters of the targets are similar, flow may proceed to block 1203; otherwise, flow may proceed to block 1201. In block 1203, the two target may be merged into a single target. For example, the smaller and/or younger target may be merged into the larger one. After block 1203, flow may proceed to block 1201. For flow returning to block 1201, two targets may be obtained that have not been compared previously. Flow may exit block 804 once all (or a sufficient number) of targets have been compared for merger.
[0063] Figure 13 illustrates a flow diagram for splitting targets in block 805 of Figure 8 according to an exemplary embodiment of the invention. In block 1301, a target may be obtained. In block 1302, a determination may be made whether the target is similar to a normal target. For example, the normal target may be modeled after a person in Figure 2. If the target and normal target are compared based on, for example, their sizes, and if the target is larger than the normal target, the target may be determined not to be similar to the normal target. If the target is not similar to the normal target, flow may proceed to block 1303; otherwise, flow may proceed to block 1301. In block 1303, clusters may be obtained from the line segments of the target. For example, two line segments that are furthest away from each other within the target may be identified, and clustering may be re-initialized (as in block 1001 of Figure 10) with both of these line segments as the starting points. The result may be two new clusters of line segments. In block 1304, a determination may be made whether the two new clusters of line segments are similar to the normal target. For example, if the resulting two clusters are of appropriate size and shape when compared to the normal target, the two clusters may be considered individual targets. If the two new clusters of line segments are similar to the normal target, flow may proceed to block 1305; otherwise, flow may proceed to block 1301. In block 1305, target identities may be assigned to the two new clusters of line segments. For example, the smaller cluster may be assigned a new identity, and the larger cluster may maintain the original identity of the target. From block 1305, flow may proceed to block 1301. Flow may exit block 805 once all (or a sufficient number) of targets have been analyzed for splitting.
[0064] As an alternative to the techniques discussed with respect to Figures 12 and 13, the merging and splitting of targets may be considered simultaneously and may be based on, for example, the analysis of the shape of the moving target blob. For example, with reference to Figure 2, the analysis may result in labeling the number of human targets in a blob as "no targets,"one human target," or ">1 human targets." Other embodiments might seek to count specific targets in a group. Figure 14 illustrates a flow diagram for merging and splitting targets in blocks 804 and 805 of Figure 8 according to an exemplary embodiment of the invention. In block 1401, a foreground mask may be generated for each video frame. This foreground mask may be generated using the detection of moving pixels discussed for block 301 of Figure 3 or another foreground object detection technique (see, for example, U.S. Patent No. 6,625,310, "Video Segmentation Using Statistical Pixel Modeling," Attorney Docket No. 37112-164995; U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System," Attorney Docket No. 37112- 213547; or U.S. Patent Application No. 11/098,385, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-215811, all of which are incorporated herein by reference).
[0065] In block 1402, foreground objects (i.e., blobs) may be detected within the motion mask generated in block 1401. The foreground objects may be detected using a clustering algorithm (see, e.g., U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System," Attorney Docket No. 37112-213547; or U.S. Patent Application No. 11/098,385, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-215811).
[0066] Optionally, in block 1403, the blobs may be tracked via an object tracking algorithm and tracking information may be generated (see, e.g., U.S. Patent Application No. 09/987,707, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-175340; U.S. Patent Application No. 11/057,154, "Video Surveillance System," Attorney Docket No. 37112-213547; U.S. Patent Application No. 11/098,385, "Video Surveillance System Employing Video Primitives," Attorney Docket No. 37112-215811; or U.S. Patent Application No. 11/139,600, "Multi-State Target Tracking," filed May 31, 2005, Attorney Docket No. 37112-218196. Block 1403 may be optional.
[0067] From blocks 1402 and 1403, flow may proceed to block 1404. In block 1404, the blobs from block 1402 and the tracking information from block 1403 may be used to analyze the blobs, and the number of targets may be identified. For example, the blobs may be analyzed based on their size and shape. An exemplary embodiment of block 1403 is discussed below with respect to Figure 14. The result of block 1404 may be targets that are the same as previous targets, less than the previous targets (i.e., a merger of previous targets), or more than the previous targets (i.e., a split of previous targets).
[0068] Figure 15 illustrates a flow diagram for analyzing blobs in block 1404 of Figure 14 according to an exemplary embodiment of the invention. In block 1501, the flow may be performed for each blob identified in block 1302. Flow may exit block 1404 once all (or a sufficient number) of blobs have been analyzed. In block 1502, the size of the blob may be compared to a multiple target size threshold. For example, the multiple target size threshold may represent a size representing two or more normal targets (e.g., two or more humans). If the size of the blob is greater than the multiple target size threshold, flow may proceed to block 1503; otherwise, flow may proceed to block 1504. In block 1503, the size of the blob may be greater than or equal to the multiple target size threshold, and the blob may be labeled as more than one target (e.g., labeled as ">1 human").
[0069] In block 1504, the size of the blob may be compared to a minimum single target size threshold. The minimum single target size threshold may represent a minimum size of a normal target. If the size of the blob is less than the minimum target size threshold, flow may proceed to block 1505; otherwise, flow may proceed to block 1507. In block 1505, the size of the blob may be less than the minimum single target size threshold, and the blob may be labeled as no target (e.g., labeled as "= 0 human"). In block 1506, the blob may be designated as representing no targets.
[0070] In block 1507, the size of the blob may be compared to a maximum single target size threshold. The maximum single target size threshold may represent an expected maximum size of a normal target. If the size of the blob is less than the maximum single target size threshold, flow may proceed to block 1508; otherwise, flow may proceed to block 1509. In block 1508, the size of the blob may be less than the maximum single target size threshold, and the blob may be labeled as one target (e.g., labeled as "= 1 human").
[0071] If flow proceeds to block 1509, the size of the blob may be less than or equal to the multiple target size threshold but greater than the maximum single target size threshold, and additional analysis may be needed to determine the number of targets represented by the blob (i.e., no targets or one target). In block 1509, eigen analysis may be performed to determine the major and minor axes of the blob. The blob may then be split along its minor axis into two sub-blobs. In block 1510, the convex area (e.g., the area of the convex hull) of each sub-blob may be determined.
[0072] In block 1511, the sub-blobs may be analyzed to determine if the each of the two sub-blobs conforms to the normal target. For example, the two sub-blobs may be analyzed to determine if their shape is similar to the shape of the normal target. The following analysis may be performed: if the ratio of the of each sub- blob's area to its convex hull area is greater than a minimum target solidity threshold, and if the convex area of each sub-blob is greater than the minimum single target size threshold, then the original blob may be considered to comprise two targets, and flow may proceed to block 1512; otherwise, flow may proceed to block 1513. In block 1512, the blob may be considered to comprise two targets, and the blob may be labeled as more than one target (e.g., labeled as ">1 human"). In block 1513, the blob may be considered to comprise one target, and the blob may be labeled as one target (e.g., labeled as "=1 human").
[0073] In block 1514, flow may be received from blocks 1503, 1508, 1512, and 1513, and the blob may be analyzed to determine if it is stationary. To determine if the blob is stationary, a technique such as those described in, for example, U.S. Patent Application No. 10/354,096, "Video Scene Background Maintenance Using Change Detection and Classification," Attorney Docket No. 37112- 182386; or U.S. Patent Application No. 11/139,600, "Multi-State Target Tracking," filed May 31, 2005, Attorney Docket No. 37112-218196, may be used for this purpose. If the blob is stationary, flow may proceed to block 1515; otherwise, flow may proceed to block 1506. In block 1515, the blob may be designated as represented no targets.
[0074] Figure 16 illustrates a flow diagram for cleaning targets in block 806 of Figure 8 according to an exemplary embodiment of the invention. In Figure 16, each target may be analyzed individually. In block 1601, a target may be obtained. In block 1602, the target may be analyzed to determine if the target was detected in the frame. If the target was detected in the frame, flow may proceed to block 1603; otherwise, flow may proceed to block 1604. In block 1603, the target may be detected in the frame and may be maintained. In block 1604, the target may be analyzed to determine if the target was moving out of the field of view of the video camera in a prior frame. If the target was not moving out of the field of view, flow may proceed to block 1603, and the target is maintained; otherwise, flow may proceed to block 1605. In block 1605, the target may not be detected in the frame, may have been moving out of the field of view, and may be removed from the list of current targets. Flow may exit block 806 once all (or a sufficient number) of targets have been analyzed for cleaning.
[0075] The examples and embodiments described herein are non-limiting examples.
[0076] The invention is described in detail with respect to exemplary embodiments, and it will now be apparent from the foregoing to those skilled in the art that changes and modifications may be made without departing from the invention in its broader aspects, and the invention, therefore, as defined in the claims is intended to cover all such changes and modifications as fall within the true spirit of the invention.

Claims

ClaimsWhat is claimed is:
1. A computer-readable medium comprising software for video processing, which when executed by a computer system, cause the computer system to perform operations comprising a method of: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
2. A computer-readable medium as in claim 1, wherein detecting moving pixels comprises: separating foreground in the video from background in the video; and detecting edges in the video.
3. A computer-readable medium as in claim 1, wherein detecting line segments comprises: counting edge pixels; and identifying a line segment based on the edge pixels.
4. A computer-readable medium as in claim 3, wherein identifying the line segment comprises: identifying a start point; predicting next search directions; identifying a next line pixel; and providing a line segment.
5. A computer-readable medium as in claim 1, wherein identifying targets comprises: updating existing targets; detecting new targets; refining the new targets; merging the existing targets and the new targets; splitting the existing targets and the new targets; and cleaning the existing targets and the new targets.
6. A computer-readable medium as in claim 5, wherein updating targets comprises: predicting a target; assigning a line segment to the predicted target; and updating the target.
7. A computer-readable medium as in claim 5, wherein detecting new targets comprises: performing line segment clustering; performing cluster verification based on the line segment clustering; and generating a new target based on the cluster verification.
8. A computer-readable medium as in claim 5, wherein refining the new targets comprises: agglomerating remaining line segments to nearest targets; re-estimating the targets; and updating the targets.
9. A computer-readable medium as in claim 5, wherein merging the existing targets and the new targets comprises: obtaining a target pair; and merging the target pair if parameters of the target pair are similar.
10. A computer-readable medium as in claim 5, wherein splitting the existing targets and the new targets comprises: obtaining a target; performing line clustering on the obtained target if the obtained target is not similar to a normal target to obtain clusters; and assigning a target identities to the clusters if the clusters are similar to the normal target.
11. A computer-readable medium as in claim 5, wherein merging and splitting the existing targets and the new targets comprises: generating a foreground mask; detecting foreground objects based on the foreground mask; and analyzing the foreground objects to obtain a number of targets.
12. A computer-readable medium as in claim 11, wherein analyzing the foreground objects is based on comparing the foreground objects to a multiple target size threshold, a minimum single target size threshold, and a maximum single target size threshold.
13. A computer-readable medium as in claim 5, wherein cleaning the existing targets and the new targets comprises: obtaining a target; maintaining the obtained target if the obtained target is detected in a current frame or was not moving out of a field of view of the video camera; and removing the obtained target if the obtained target is not detected in a current frame and was moving out of a field of view of the video camera.
14. An computer-based system to perform a method for video processing, said method comprising: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
15. A method to for video processing, comprising: receiving video from an overhead view of a scene; detecting moving pixels in the video; detecting line segments in the video based on detected moving pixels; identifying targets in the video based on the detected line segments; tracking targets in the video based on the identified targets; and managing tracked targets in the video.
PCT/US2006/024485 2004-06-24 2006-06-23 Target detection and tracking from overhead video streams WO2007002404A2 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
JP2008518435A JP2008544705A (en) 2005-06-24 2006-06-23 Detect and track surveillance objects from overhead video streams
MX2007016406A MX2007016406A (en) 2005-06-24 2006-06-23 Target detection and tracking from overhead video streams.
EP06785442.2A EP1894142B1 (en) 2005-06-24 2006-06-23 Target detection and tracking from overhead video streams
CA002611522A CA2611522A1 (en) 2005-06-24 2006-06-23 Target detection and tracking from overhead video streams
IL188196A IL188196A0 (en) 2004-06-24 2007-12-17 Target detection and tracking from overhead video streams

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/165,435 2005-06-24
US11/165,435 US7796780B2 (en) 2005-06-24 2005-06-24 Target detection and tracking from overhead video streams

Publications (3)

Publication Number Publication Date
WO2007002404A2 true WO2007002404A2 (en) 2007-01-04
WO2007002404A3 WO2007002404A3 (en) 2007-05-31
WO2007002404A8 WO2007002404A8 (en) 2007-07-12

Family

ID=37567393

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/024485 WO2007002404A2 (en) 2004-06-24 2006-06-23 Target detection and tracking from overhead video streams

Country Status (8)

Country Link
US (1) US7796780B2 (en)
EP (1) EP1894142B1 (en)
JP (1) JP2008544705A (en)
KR (1) KR20080021804A (en)
CN (1) CN101208710A (en)
CA (1) CA2611522A1 (en)
MX (1) MX2007016406A (en)
WO (1) WO2007002404A2 (en)

Families Citing this family (93)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9892606B2 (en) 2001-11-15 2018-02-13 Avigilon Fortress Corporation Video surveillance system employing video primitives
US8711217B2 (en) 2000-10-24 2014-04-29 Objectvideo, Inc. Video surveillance system employing video primitives
US8564661B2 (en) 2000-10-24 2013-10-22 Objectvideo, Inc. Video analytic rule detection system and method
US7424175B2 (en) 2001-03-23 2008-09-09 Objectvideo, Inc. Video segmentation using statistical pixel modeling
CN101053254B (en) * 2004-11-03 2010-06-23 皇家飞利浦电子股份有限公司 High intensity display screen based electronic window
WO2006127608A2 (en) * 2005-05-23 2006-11-30 Nextcode Corporation Efficient finder patterns and methods for application to 2d machine vision problems
US7796780B2 (en) 2005-06-24 2010-09-14 Objectvideo, Inc. Target detection and tracking from overhead video streams
US7801330B2 (en) * 2005-06-24 2010-09-21 Objectvideo, Inc. Target detection and tracking from video streams
US7773116B1 (en) * 2006-02-08 2010-08-10 Lockheed Martin Corporation Digital imaging stabilization
CN101443789B (en) 2006-04-17 2011-12-28 实物视频影像公司 video segmentation using statistical pixel modeling
TW200745996A (en) 2006-05-24 2007-12-16 Objectvideo Inc Intelligent imagery-based sensor
US7925536B2 (en) * 2006-05-25 2011-04-12 Objectvideo, Inc. Intelligent video verification of point of sale (POS) transactions
TW200822751A (en) 2006-07-14 2008-05-16 Objectvideo Inc Video analytics for retail business process monitoring
US20080074496A1 (en) * 2006-09-22 2008-03-27 Object Video, Inc. Video analytics for banking business process monitoring
US7831098B2 (en) * 2006-11-07 2010-11-09 Recognition Robotics System and method for visual searching of objects using lines
WO2008100537A2 (en) * 2007-02-12 2008-08-21 Sorensen Associates Inc. Still image shopping event monitoring and analysis system and method
US20080273754A1 (en) * 2007-05-04 2008-11-06 Leviton Manufacturing Co., Inc. Apparatus and method for defining an area of interest for image sensing
US8131010B2 (en) * 2007-07-30 2012-03-06 International Business Machines Corporation High density queue estimation and line management
TW200926011A (en) * 2007-09-04 2009-06-16 Objectvideo Inc Background modeling with feature blocks
US9646312B2 (en) * 2007-11-07 2017-05-09 Game Design Automation Pty Ltd Anonymous player tracking
US9019381B2 (en) 2008-05-09 2015-04-28 Intuvision Inc. Video tracking systems and methods employing cognitive vision
WO2009141955A1 (en) * 2008-05-21 2009-11-26 パナソニック株式会社 Image pickup apparatus, image pick-up method and integrated circuit
KR101192429B1 (en) * 2008-12-19 2012-10-17 주식회사 케이티 Method for restoring transport error included in image and apparatus thereof
US8805004B2 (en) * 2009-01-09 2014-08-12 Thomson Licensing Method and apparatus for detecting and separating objects of interest in soccer video by color segmentation and shape analysis
KR101632963B1 (en) * 2009-02-02 2016-06-23 아이사이트 모빌 테크놀로지 엘티디 System and method for object recognition and tracking in a video stream
GB2475104A (en) * 2009-11-09 2011-05-11 Alpha Vision Design Res Ltd Detecting movement of 3D objects using a TOF camera
US8873798B2 (en) 2010-02-05 2014-10-28 Rochester Institue Of Technology Methods for tracking objects using random projections, distance learning and a hybrid template library and apparatuses thereof
US20110249123A1 (en) * 2010-04-09 2011-10-13 Honeywell International Inc. Systems and methods to group and browse cameras in a large scale surveillance system
US8730396B2 (en) * 2010-06-23 2014-05-20 MindTree Limited Capturing events of interest by spatio-temporal video analysis
EP2708032A4 (en) * 2011-05-12 2014-10-29 Solink Corp Video analytics system
US8831287B2 (en) * 2011-06-09 2014-09-09 Utah State University Systems and methods for sensing occupancy
US8334898B1 (en) 2011-07-26 2012-12-18 ByteLight, Inc. Method and system for configuring an imaging device for the reception of digital pulse recognition information
US8520065B2 (en) 2011-07-26 2013-08-27 ByteLight, Inc. Method and system for video processing to determine digital pulse recognition tones
US8436896B2 (en) 2011-07-26 2013-05-07 ByteLight, Inc. Method and system for demodulating a digital pulse recognition signal in a light based positioning system using a Fourier transform
US8432438B2 (en) 2011-07-26 2013-04-30 ByteLight, Inc. Device for dimming a beacon light source used in a light based positioning system
US9444547B2 (en) 2011-07-26 2016-09-13 Abl Ip Holding Llc Self-identifying one-way authentication method using optical signals
US8866391B2 (en) 2011-07-26 2014-10-21 ByteLight, Inc. Self identifying modulated light source
US9287976B2 (en) 2011-07-26 2016-03-15 Abl Ip Holding Llc Independent beacon based light position system
US8457502B2 (en) 2011-07-26 2013-06-04 ByteLight, Inc. Method and system for modulating a beacon light source in a light based positioning system
US9418115B2 (en) 2011-07-26 2016-08-16 Abl Ip Holding Llc Location-based mobile services and applications
US8334901B1 (en) 2011-07-26 2012-12-18 ByteLight, Inc. Method and system for modulating a light source in a light based positioning system using a DC bias
US9723676B2 (en) 2011-07-26 2017-08-01 Abl Ip Holding Llc Method and system for modifying a beacon light source for use in a light based positioning system
US9787397B2 (en) 2011-07-26 2017-10-10 Abl Ip Holding Llc Self identifying modulated light source
US8416290B2 (en) 2011-07-26 2013-04-09 ByteLight, Inc. Method and system for digital pulse recognition demodulation
US8994799B2 (en) 2011-07-26 2015-03-31 ByteLight, Inc. Method and system for determining the position of a device in a light based positioning system using locally stored maps
US9288450B2 (en) 2011-08-18 2016-03-15 Infosys Limited Methods for detecting and recognizing a moving object in video and devices thereof
US8809788B2 (en) * 2011-10-26 2014-08-19 Redwood Systems, Inc. Rotating sensor for occupancy detection
CN102842036B (en) * 2011-11-30 2015-07-15 三峡大学 Intelligent multi-target detection method facing ship lock video monitoring
CN102521844A (en) * 2011-11-30 2012-06-27 湖南大学 Particle filter target tracking improvement method based on vision attention mechanism
US8825368B2 (en) * 2012-05-21 2014-09-02 International Business Machines Corporation Physical object search
US9147114B2 (en) * 2012-06-19 2015-09-29 Honeywell International Inc. Vision based target tracking for constrained environments
CN102831617A (en) * 2012-07-17 2012-12-19 聊城大学 Method and system for detecting and tracking moving object
US9213781B1 (en) 2012-09-19 2015-12-15 Placemeter LLC System and method for processing image data
US9197861B2 (en) * 2012-11-15 2015-11-24 Avo Usa Holding 2 Corporation Multi-dimensional virtual beam detection for video analytics
US10091556B1 (en) * 2012-12-12 2018-10-02 Imdb.Com, Inc. Relating items to objects detected in media
CN103079061B (en) * 2013-01-30 2016-07-13 浙江宇视科技有限公司 A kind of video tracking processes device and video link processes device
US20140236653A1 (en) * 2013-02-15 2014-08-21 Tyco Fire & Security Gmbh Systems and methods for retail line management
DE102014209039A1 (en) * 2013-05-22 2014-11-27 Osram Gmbh Method and system for location detection
US9705600B1 (en) 2013-06-05 2017-07-11 Abl Ip Holding Llc Method and system for optical communication
KR20150018037A (en) * 2013-08-08 2015-02-23 주식회사 케이티 System for monitoring and method for monitoring using the same
KR20150018696A (en) 2013-08-08 2015-02-24 주식회사 케이티 Method, relay apparatus and user terminal for renting surveillance camera
CN103581630A (en) * 2013-11-15 2014-02-12 东华大学 Remote video monitoring device and method based on DSP and FPGA
WO2015077767A1 (en) 2013-11-25 2015-05-28 Daniel Ryan System and method for communication with a mobile device via a positioning system including rf communication devices and modulated beacon light sources
KR20150075224A (en) 2013-12-24 2015-07-03 주식회사 케이티 Apparatus and method for providing of control service
CN104751157B (en) * 2013-12-31 2018-07-27 中核控制系统工程有限公司 Detecting and tracking method based on FPGA
WO2015166612A1 (en) 2014-04-28 2015-11-05 日本電気株式会社 Image analysis system, image analysis method, and image analysis program
JP2017525064A (en) 2014-05-30 2017-08-31 プレイスメーター インコーポレイテッドPlacemeter Inc. System and method for activity monitoring using video data
CN105989367B (en) 2015-02-04 2019-06-28 阿里巴巴集团控股有限公司 Target Acquisition method and apparatus
US9760792B2 (en) 2015-03-20 2017-09-12 Netra, Inc. Object detection and classification
US9922271B2 (en) 2015-03-20 2018-03-20 Netra, Inc. Object detection and classification
US11334751B2 (en) 2015-04-21 2022-05-17 Placemeter Inc. Systems and methods for processing video data for activity monitoring
US10043078B2 (en) 2015-04-21 2018-08-07 Placemeter LLC Virtual turnstile system and method
CN104932365A (en) * 2015-06-03 2015-09-23 东莞市精致自动化科技有限公司 Intelligent visual system
CN105096321B (en) * 2015-07-24 2018-05-18 上海小蚁科技有限公司 A kind of low complex degree Motion detection method based on image border
JP6022123B1 (en) * 2015-11-09 2016-11-09 三菱電機株式会社 Image generation system and image generation method
US11100335B2 (en) 2016-03-23 2021-08-24 Placemeter, Inc. Method for queue time estimation
CN105931406A (en) * 2016-06-24 2016-09-07 付韶明 Video monitoring and alarming system based on face identification
US10223590B2 (en) * 2016-08-01 2019-03-05 Qualcomm Incorporated Methods and systems of performing adaptive morphology operations in video analytics
WO2018035667A1 (en) * 2016-08-22 2018-03-01 深圳前海达闼云端智能科技有限公司 Display method and apparatus, electronic device, computer program product, and non-transient computer readable storage medium
US10186124B1 (en) 2017-10-26 2019-01-22 Scott Charles Mullins Behavioral intrusion detection system
CN108205660B (en) * 2017-12-12 2020-04-14 浙江浙大中控信息技术有限公司 Infrared image pedestrian flow detection device and detection method based on top view angle
CN108024113B (en) * 2017-12-15 2021-05-11 东华大学 Target ratio self-adaptive compressed domain small target tracking method
CN108319918B (en) * 2018-02-05 2022-07-08 中国科学院长春光学精密机械与物理研究所 Embedded tracker and target tracking method applied to same
US11615623B2 (en) 2018-02-19 2023-03-28 Nortek Security & Control Llc Object detection in edge devices for barrier operation and parcel delivery
US11295139B2 (en) 2018-02-19 2022-04-05 Intellivision Technologies Corp. Human presence detection in edge devices
CN108337486A (en) * 2018-04-19 2018-07-27 北京软通智城科技有限公司 A kind of device and method of the video analysis of the algorithm configuration based on scene
CA3095327C (en) 2018-05-18 2023-03-14 Essity Hygiene And Health Aktiebolag Presence and absence detection
CN109005327A (en) * 2018-08-31 2018-12-14 北京诚志重科海图科技有限公司 A kind of video structural picture pick-up device and system
CN109558817B (en) * 2018-11-16 2021-01-01 西安电子科技大学 Airport runway detection method based on FPGA acceleration
US11126861B1 (en) 2018-12-14 2021-09-21 Digimarc Corporation Ambient inventorying arrangements
US11653052B2 (en) * 2020-10-26 2023-05-16 Genetec Inc. Systems and methods for producing a privacy-protected video clip
US11729445B2 (en) * 2021-12-28 2023-08-15 The Adt Security Corporation Video rights management for an in-cabin monitoring system
CN114612712A (en) * 2022-03-03 2022-06-10 北京百度网讯科技有限公司 Object classification method, device, equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6696945B1 (en) 2001-10-09 2004-02-24 Diamondback Vision, Inc. Video tripwire
US20050104958A1 (en) 2003-11-13 2005-05-19 Geoffrey Egnal Active camera video-based surveillance systems and methods
US20050146605A1 (en) 2000-10-24 2005-07-07 Lipton Alan J. Video surveillance system employing video primitives

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6263088B1 (en) * 1997-06-19 2001-07-17 Ncr Corporation System and method for tracking movement of objects in a scene
US6639593B1 (en) * 1998-07-31 2003-10-28 Adobe Systems, Incorporated Converting bitmap objects to polygons
IL141589A (en) * 1998-09-10 2005-11-20 Ecchandes Inc Optical counting device using edge analysis
US6545706B1 (en) * 1999-07-30 2003-04-08 Electric Planet, Inc. System, method and article of manufacture for tracking a head of a camera-generated image of a person
EP1109141A1 (en) * 1999-12-17 2001-06-20 Siemens Building Technologies AG Presence detector and use thereof
US20050162515A1 (en) * 2000-10-24 2005-07-28 Objectvideo, Inc. Video surveillance system
US7868912B2 (en) * 2000-10-24 2011-01-11 Objectvideo, Inc. Video surveillance system employing video primitives
US9892606B2 (en) * 2001-11-15 2018-02-13 Avigilon Fortress Corporation Video surveillance system employing video primitives
US6625310B2 (en) * 2001-03-23 2003-09-23 Diamondback Vision, Inc. Video segmentation using statistical pixel modeling
TW582168B (en) * 2002-03-01 2004-04-01 Huper Lab Co Ltd Method for abstracting multiple moving objects
US7409092B2 (en) * 2002-06-20 2008-08-05 Hrl Laboratories, Llc Method and apparatus for the surveillance of objects in images
US6999600B2 (en) * 2003-01-30 2006-02-14 Objectvideo, Inc. Video scene background maintenance using change detection and classification
EP2408193A3 (en) * 2004-04-16 2014-01-15 James A. Aman Visible and non-visible light sensing camera for videoing and object tracking
US20060170769A1 (en) * 2005-01-31 2006-08-03 Jianpeng Zhou Human and object recognition in digital video
US7825954B2 (en) * 2005-05-31 2010-11-02 Objectvideo, Inc. Multi-state target tracking
US7796780B2 (en) 2005-06-24 2010-09-14 Objectvideo, Inc. Target detection and tracking from overhead video streams
US7596241B2 (en) * 2005-06-30 2009-09-29 General Electric Company System and method for automatic person counting and detection of specific events
US8254625B2 (en) * 2006-11-02 2012-08-28 Hyperactive Technologies, Inc. Automated service measurement, monitoring and management

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050146605A1 (en) 2000-10-24 2005-07-07 Lipton Alan J. Video surveillance system employing video primitives
US6696945B1 (en) 2001-10-09 2004-02-24 Diamondback Vision, Inc. Video tripwire
US20050104958A1 (en) 2003-11-13 2005-05-19 Geoffrey Egnal Active camera video-based surveillance systems and methods

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
C. STAUFFER; W. E. L. GRIMSON: "Learning Patterns of Activity Using Real-Time Tracking", IEEE TRANS. PAMI, vol. 22, no. 8, August 2000 (2000-08-01), pages 747 - 757, XP000976482, DOI: doi:10.1109/34.868677
GATES J. ET AL: "A real-time line extraction algorithm", PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS / ISCAS '99, vol. 4, - 30 May 1999 (1999-05-30), ORLANDO, FLORIDA, pages 68 - 71, XP010341146, DOI: doi:10.1109/ISCAS.1999.779944
R. COLLINS ET AL: "Algorithms for Cooperative Multisensor Surveillance", PROCEEDINGS OF THE IEEE, vol. 89, no. 10, October 2001 (2001-10-01), pages 1456 - 1477, XP002342459, DOI: doi:10.1109/5.959341
See also references of EP1894142A4

Also Published As

Publication number Publication date
EP1894142A4 (en) 2017-01-18
KR20080021804A (en) 2008-03-07
US20060291695A1 (en) 2006-12-28
JP2008544705A (en) 2008-12-04
CN101208710A (en) 2008-06-25
CA2611522A1 (en) 2007-01-04
US7796780B2 (en) 2010-09-14
WO2007002404A8 (en) 2007-07-12
MX2007016406A (en) 2008-03-05
EP1894142B1 (en) 2021-06-09
WO2007002404A3 (en) 2007-05-31
EP1894142A2 (en) 2008-03-05

Similar Documents

Publication Publication Date Title
US7796780B2 (en) Target detection and tracking from overhead video streams
US7801330B2 (en) Target detection and tracking from video streams
US10346688B2 (en) Congestion-state-monitoring system
US9158975B2 (en) Video analytics for retail business process monitoring
US10176384B2 (en) Method and system for automated sequencing of vehicles in side-by-side drive-thru configurations via appearance-based classification
CA2505831C (en) Method and system for tracking and behavioral monitoring of multiple objects moving through multiple fields-of-view
Snidaro et al. Video security for ambient intelligence
US20080074496A1 (en) Video analytics for banking business process monitoring
JP4369233B2 (en) Surveillance television equipment using video primitives
US9940633B2 (en) System and method for video-based detection of drive-arounds in a retail setting
US20060239506A1 (en) Line textured target detection and tracking with applications to "Basket-run" detection
WO2012038241A1 (en) Activity determination as function of transaction log
TW200818915A (en) Automatic extraction of secondary video streams
WO2006036578A2 (en) Method for finding paths in video
Lu et al. Detecting unattended packages through human activity recognition and object association
Leo et al. Real‐time smart surveillance using motion analysis
Valle et al. People counting in low density video sequences
Appiah et al. Autonomous real-time surveillance system with distributed ip cameras
Sujatha et al. An efficient motion based video object detection and tracking system
Beigh et al. Motion Aware Video Surveillance System (MAVSS)
AU2003285191B2 (en) Method and system for tracking and behavioral monitoring of multiple objects moving through multiple fields-of-view
Sim Pedestrian Counting System

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200680020567.9

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref document number: 2611522

Country of ref document: CA

ENP Entry into the national phase

Ref document number: 2008518435

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 188196

Country of ref document: IL

WWE Wipo information: entry into national phase

Ref document number: MX/a/2007/016406

Country of ref document: MX

WWE Wipo information: entry into national phase

Ref document number: 2006785442

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 1020087001779

Country of ref document: KR