WO2011067790A2 - Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera - Google Patents

Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera Download PDF

Info

Publication number
WO2011067790A2
WO2011067790A2 PCT/IN2010/000783 IN2010000783W WO2011067790A2 WO 2011067790 A2 WO2011067790 A2 WO 2011067790A2 IN 2010000783 W IN2010000783 W IN 2010000783W WO 2011067790 A2 WO2011067790 A2 WO 2011067790A2
Authority
WO
WIPO (PCT)
Prior art keywords
objects
image
computing
pedestrian
tracking
Prior art date
Application number
PCT/IN2010/000783
Other languages
French (fr)
Other versions
WO2011067790A3 (en
Inventor
K. S. Chidanand
Brojeshwar Bhowmick
Original Assignee
Tata Consultancy Services Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tata Consultancy Services Limited filed Critical Tata Consultancy Services Limited
Priority to JP2012530411A priority Critical patent/JP5453538B2/en
Priority to EP10822866.9A priority patent/EP2481012A2/en
Priority to CN201080030194.XA priority patent/CN102511046B/en
Priority to US13/380,559 priority patent/US8964033B2/en
Publication of WO2011067790A2 publication Critical patent/WO2011067790A2/en
Publication of WO2011067790A3 publication Critical patent/WO2011067790A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/103Static body considered as a whole, e.g. static pedestrian or occupant recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/478Contour-based spectral representations or scale-space representations, e.g. by Fourier analysis, wavelet analysis or curvature scale-space [CSS]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads

Definitions

  • the present invention generally relates to a system and method for detecting, classifying and tracking the pedestrian. More particularly, this invention relates to a cost effective system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle by using real time images captured by near infrared (IR) camera disposed on the vehicle.
  • IR near infrared
  • US713941 1 to Fujimura et al teaches that a system and method for detecting and tracking as pedestrians, in low visibility conditions or otherwise.
  • a night vision camera periodically captures an infrared image of a road from a single perspective.
  • a pedestrian detection module determines a position of a pedestrian in the frame by processing the captured image.
  • the pedestrian detection module includes a support vector machine to compare information derived from the night vision camera to a training database.
  • a pedestrian tracking module estimates pedestrian movement of the detected pedestrian from in subsequent frames by applying filters.
  • the tracking module uses Kalman filtering to estimate pedestrian movement at periodic times and mean-shifting to adjust the estimation.
  • US7526102 to Ibrahim Burak Ozer teaches that methods and systems for providing real-time video surveillance of crowded environments.
  • the method consists of several object detection and tracking processes that may be selected automatically to track individual objects or group of objects based on the resolution and occlusion levels in the input videos.
  • Possible objects of interest (OOI) may be human, animals, cars etc.
  • the invention may be used for tracking people in crowded environments or cars in heavy traffic conditions.
  • the primary objective of the present invention is to provide a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle for avoiding collision which is simple, easy to install, provides higher accuracy at a lower cost.
  • Another objective of the invention is to provide a systematic way of detecting the road region by estimating ground plane using near IR camera.
  • a still another objective of the invention is to provide a systematic way of eliminating non-ground objects based on their distance to ground using near IR camera.
  • a still another objective of the invention is to provide a systematic way of filtering the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects using near IR camera.
  • SNR signal to noise ratio
  • Still another objective of the invention is to provide a systematic way of eliminating the non-vertical objects from the IR images by computing inertial moment relative to x and y axis with respect to the centre of mass of such non-vertical objects.
  • Yet another objective of the invention is to provide a systematic way of classifying the pedestrians in the analyzed frame of the image based their shape using near IR camera.
  • Further objective of the invention is to provide a systematic way of tracking the movement of the classified pedestrian using mean shift algorithm.
  • Yet another objective of the invention is to provide a system and method for detecting and tracking the pedestrians which is simple and cost effective.
  • the present invention provides a system and method for system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision which is simple, easy to install, provides higher accuracy at a jower cost.
  • the present invention embodies a cost effective method for detecting, classifying and tracking the pedestrian present in front of the vehicle using images captured by near infrared (IR) camera disposed on the vehicle, wherein the said method comprises the processor implemented steps of: detecting the road to focus of attention and for filtering the region of interest (ROI) objects in the said image by estimating the ground plane; eliminating the non-ground objects based on their distance to ground; filtering the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects; eliminating the non-vertical objects by computing inertial moment relative to x and y axis with respect to the centre of mass of such non-vertical objects; classifying the pedestrians in the analyzed frame of the image based their shape; and tracking the movement of the classified pedestrian using mean shift algorithm.
  • SNR signal to noise ratio
  • tracked data with respect to the pedestrian is further communicated by the processor to an output means.
  • an alert means warns the driver for presence of one or more pedestrian, wherein the alert means can be audio and audio visual devices, sounding an alarm, a voice based caution, an Indicator and display.
  • near IR camera can be disposed either on the dashboard or in front or inside or top of the vehicle. In one exemplary embodiment of the invention, the near IR camera can be disposed in front of the vehicle.
  • Figure 1 is flowchart which illustrates a method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision according to various embodiments of the invention.
  • the present invention provides a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision which is simple, easy to install, provides higher accuracy at a lower cost.
  • a cost effective system comprises of a near IR camera disposed on the vehicle for capturing an image; and a processor for analyzing the captured image in real-time for detecting, classifying and tracking the pedestrian present in front of the vehicle.
  • Figure 1 is flowchart which illustrates a method 100 for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision according to various embodiments of the invention.
  • IR camera can be disposed either on the dashboard or in front or inside or top of the vehicle.
  • the near IR camera is disposed in front of the vehicle.
  • the resolution of the near IR camera can be selected from 640*480, 720*480, etc.
  • the Tesolution of the near IR camera is 720 * 480.
  • the IR range of the near IR camera can be selected from a range of (0.7-1 ) to 5 Microns for detecting and tracking the pedestrians.
  • the IR range of the near IR camera can be selected from a range of 0.7 to 5 Microns.
  • the temperature range of the near IR camera can be selected from 740 to 3,000-5,200 Kelvin for detecting and tracking the pedestrians.
  • the processor can be disposed either in the body of the near IR camera or on the dashboard of the vehicle. In one exemplary embodiment of the invention, the processor is disposed either in the body of the near IR camera. In accordance with another aspect of the invention the processor can be selected from the group of Davinci DM6446 Processor, ADSP-BF533, 750 MHz Blackfin Processor.
  • the above said cost effective method comprises various processor implemented steps.
  • first ground region needs to be estimated.
  • the processor detects the road to focus attention for filtering the region of interest (ROI) objects in the said image by estimating the ground plane.
  • ROI region of interest
  • the processor executes the following steps:
  • Image differentiation 102 is done using sobel operator, then threshold the differentiated image 104 is determined using Otsu algorithm.
  • threshold parameters for differentiated images can be varied based on the applicability.
  • the thresholded of the differentiated binary image at each bottom most pixel along width of the image, traverse upwards until there is 0-1 transition.
  • the region 106 from bottom most pixel to 0-1 -transition indicates smooth regions which indicates road. Mark such region values as ' X '. This procedure is repeated for each bottom most pixel image by the processor.
  • the processor eliminates the non-ground objects based on their distance to ground.
  • the processor executes the following steps:
  • non-ROIs which are bright stripe objects 114 such as a metallic sign boards, light sign board, traffic signs, headlight, an electric pole, and the poles of a guardrail. These objects have regular in shapes.
  • the processor filters the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects.
  • SNR signal to noise ratio
  • the processor executes the following steps: For each component, finds the exterior boundary. A boundary pixel is a pixel whose value is and any one of the 8 neighbors whose pixel value is ⁇ ', then the processor computes centroid (x c , yj for a blob. Then com utes distance 0( ) from centroid of the blob to shape contour.
  • N point DFT R(f) on r(ri) Then computes N point DFT R(f) on r(ri) . Then searches for the local maxima of amplitudes of frequency contents. If local maximas occur periodically computation of such period is done. It is observed that objects which are regular in , shape will have periodic local maximas and objects which are irregular in shape have no periodic local maximas. Then the processor computes replicated version of R
  • SNR will be a very high for a regular shape object whereas for irregular shaped objects SNR will be less. In this way street light, car head light, pillars, lamp post are easily eliminated.
  • the processor eliminates the non-vertical objects 116 by computing inertial moment relative to x and y axis with respect to the centre of mass of such non- vertical objects.
  • the processor In the gray level thresholded binary image, still there would be components which are aligned more horizontally rather than vertically.
  • Pedestrian's objects are usually vertical in nature.
  • the normalized central moments are used by the processor.
  • M is the inertial moment, relative to X axis, in respect to the center of mass.
  • M y is the inertial moment, relative to Y axis, in respect to the center of mass.
  • M xv is the inertial moment, relative to both X and Y axis, in respect to the center of mass.
  • object is vertical elongated otherwise it is horizontally elongated -which will be deleted by the processor.
  • the estimated threshold was around 3.2.
  • the processor classifies the pedestrians in the analyzed frame of the image based their shape. Once the objects basic structure and a qualitative hint about pedestrians have been obtained, a more accurate control is necessary. For classification, pedestrian shape is used as a cue.
  • the processor executes the following steps: For each ROI, the processor finds the exterior boundary.
  • a boundary pixel is a pixel whose value is T and any one of the 8 neighbors whose pixel value is ⁇ '. Initially, computes centroid (x c , y c ) for a blob. Represent the , boundary as a complex co-ordinate function
  • Rotation invariant of the FDs is achieved by ignoring the phase information and by taking only the magnitude values of the FDs.
  • N ⁇ is the truncated number of harmonics needed to index the shape.
  • the processor tracks 120 the movement of the classified pedestrian using mean shift algorithm.
  • the processor executes the following steps:
  • mean shift tracking is employed to track pedestrians.
  • the mean shift tracking algorithm is an appearance based tracking method and it employs the mean shift iterations to find the target candidate that is the most similar to a given model in terms of intensity distribution, with the similarity of the two distributions being expressed by a metric based on the Bhattacharyya coefficient.
  • the derivation of the Bhattacharyya coefficient from sample data involves the estimation of the target density q and the candidate density p , for which employs the histogram formulation.
  • the target model histogram is calculated by considering the feature space.
  • the processor executes the following steps:
  • the above said method further comprises the step of warning the driver for avoiding collision characterized by use of the tracking data of the pedestrian, wherein the alert means is used for warning the driver, wherein the said alert means can be audio and audio visual means including but not limited to an alarm, a voice based caution, or an Indicator.
  • the present invention provides a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collisions which is easy to install and execute.
  • the system of the present invention also provides a method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision having a reasonably high accuracy as compared to the existing conventional systems.
  • the present invention also provides a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision which is cost effective as compared to the conventional systems.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Human Computer Interaction (AREA)
  • Traffic Control Systems (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

A cost effective method for detecting, classifying and tracking the pedestrian present in front of the vehicle by using images captured by near infrared (IR) camera disposed on the vehicle, the said method comprises the processor implemented steps of: detecting the road to focus of attention for filtering the region of interest (ROI) objects in the said image by estimating the ground plane; eliminating the non-ground objects based on their distance to ground; filtering the non-ROI objects based on the shape of such objects by computing the SNR for each of such non-ROI objects; eliminating the non-vertical objects by computing inertial moment relative to x and y axis with respect to the centre of mass of such non-vertical objects; classifying the pedestrians in the analyzed frame of the image based their shape; and tracking the movement of the classified pedestrian using mean shift algorithm.

Description

COST-EFFECTIVE SYSTEM AND METHOD FOR DETECTING, CLASSIFYING AND TRACKING THE PEDESTRIAN USING NEAR INFRARED CAMERA
FIELD OF THE INVENTION
The present invention generally relates to a system and method for detecting, classifying and tracking the pedestrian. More particularly, this invention relates to a cost effective system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle by using real time images captured by near infrared (IR) camera disposed on the vehicle.
BACKGROUND OF THE INVENTION
Road accidents involving pedestrians are far more frequent at night than during the day. Worldwide, the number of people killed in road traffic crashes each year is estimated at almost 1.2 million, while the number injured could be as high as 50 million -which is the combined population of five of the world's largest cities. Many of the people killed in such accidents are the pedestrians. The most important factor is the driver's dramatically reduced range of vision. Fewer pedestrians would be killed or seriously injured if vehicles were equipped with improved pedestrian detection systems combined with driver warning strategies.
Some of the inventions which deals with pedestrian detection and tracking known to us are as follows:
US713941 1 to Fujimura et al teaches that a system and method for detecting and tracking as pedestrians, in low visibility conditions or otherwise. A night vision camera periodically captures an infrared image of a road from a single perspective. A pedestrian detection module determines a position of a pedestrian in the frame by processing the captured image. The pedestrian detection module includes a support vector machine to compare information derived from the night vision camera to a training database. A pedestrian tracking module estimates pedestrian movement of the detected pedestrian from in subsequent frames by applying filters. The tracking module uses Kalman filtering to estimate pedestrian movement at periodic times and mean-shifting to adjust the estimation. US7526102 to Ibrahim Burak Ozer teaches that methods and systems for providing real-time video surveillance of crowded environments. The method consists of several object detection and tracking processes that may be selected automatically to track individual objects or group of objects based on the resolution and occlusion levels in the input videos. Possible objects of interest (OOI) may be human, animals, cars etc. The invention may be used for tracking people in crowded environments or cars in heavy traffic conditions.
US7421091 to Hiroshi Satoh teaches that outputs of pixels present around a given pixel at an image-capturing unit having a plurality of pixels disposed two-dimensionally are added to the output of the given pixel.
US5694487 to Min-Sub Lee teaches a method for determining feature point for each of the blocks based on the gradient magnitude and variance corresponding to each of the pixels therein.
Most of these known devices, systems and methods use complex methods in order to detect and track pedestrian and are costlier. The accuracy of these methods isn't adequate to detect and track the pedestrian.
Thus, in the light of the above mentioned background of the art, it is evident that, there is a need for a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle for avoiding collision, which is simple, easy to install and provides higher accuracy at a lower cost.
OBJECTIVES OF THE INVENTION
The primary objective of the present invention is to provide a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle for avoiding collision which is simple, easy to install, provides higher accuracy at a lower cost.
Another objective of the invention is to provide a systematic way of detecting the road region by estimating ground plane using near IR camera.
Further objective of the invention is to provide a systematic way of eliminating non-ground objects based on their distance to ground using near IR camera. A still another objective of the invention is to provide a systematic way of filtering the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects using near IR camera.
Still another objective of the invention is to provide a systematic way of eliminating the non-vertical objects from the IR images by computing inertial moment relative to x and y axis with respect to the centre of mass of such non-vertical objects.
Yet another objective of the invention is to provide a systematic way of classifying the pedestrians in the analyzed frame of the image based their shape using near IR camera.
Further objective of the invention is to provide a systematic way of tracking the movement of the classified pedestrian using mean shift algorithm.
Yet another objective of the invention is to provide a system and method for detecting and tracking the pedestrians which is simple and cost effective.
SUMMARY OF THE INVENTION
Before the method, system, and hardware enablement of the present invention are described, it is to be understood that this invention in not limited to the particular systems, and methodologies described, as there can be multiple possible embodiments of the present invention which are not expressly illustrated in the present disclosure. It is also to be understood that the terminology used in the description is for the purpose of describing the particular versions or embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims.
The present invention provides a system and method for system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision which is simple, easy to install, provides higher accuracy at a jower cost.
The present invention embodies a cost effective method for detecting, classifying and tracking the pedestrian present in front of the vehicle using images captured by near infrared (IR) camera disposed on the vehicle, wherein the said method comprises the processor implemented steps of: detecting the road to focus of attention and for filtering the region of interest (ROI) objects in the said image by estimating the ground plane; eliminating the non-ground objects based on their distance to ground; filtering the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects; eliminating the non-vertical objects by computing inertial moment relative to x and y axis with respect to the centre of mass of such non-vertical objects; classifying the pedestrians in the analyzed frame of the image based their shape; and tracking the movement of the classified pedestrian using mean shift algorithm.
In one aspect of the invention tracked data with respect to the pedestrian is further communicated by the processor to an output means.
In yet another aspect of the invention an alert means warns the driver for presence of one or more pedestrian, wherein the alert means can be audio and audio visual devices, sounding an alarm, a voice based caution, an Indicator and display.
In accordance with another aspect of the invention, near IR camera can be disposed either on the dashboard or in front or inside or top of the vehicle. In one exemplary embodiment of the invention, the near IR camera can be disposed in front of the vehicle.
BRIEF DESCRIPTION OF DRAWINGS
The foregoing summary, as well as the following detailed description of preferred embodiments, is better understood when read in conjunction with the appended drawings. For the purpose of illustrating the invention, there is shown in the drawings example constructions of the invention; however, the invention is not limited to the specific methods and apparatus disclosed in the drawings:
Figure 1 is flowchart which illustrates a method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision according to various embodiments of the invention. DETAIL DESCRIPTION OF THE INVENTION
Some embodiments of this invention, illustrating its features, will now be discussed in detail. The words "comprising," "having," "containing," and "including," and other forms thereof, are intended to be equivalent in meaning and be open ended in that an item or items following any one of these words is not meant to be an exhaustive listing of such item or items, or meant to be limited to only the listed item or items. It must also be noted that as used herein and in the appended claims, the singular forms "a," "an," and "the" include plural references unless the context clearly dictates otherwise. Although any systems and methods similar or equivalent to those described herein can be used in the practice or testing of embodiments of the present invention, the preferredT systems and methods are now described. The disclosed embodiments are merely exemplary of the invention, which may be embodied in various forms by a person skilled in the art.
A cost effective method for detecting, classifying and tracking the pedestrian present in front of the vehicle by using images captured by near infrared (IR) camera disposed on the vehicle, the said method comprising the processor implemented steps of:
a) detecting the road to focus of attention for filtering the region of interest (ROI) objects in the said image by estimating the ground plane;
b) eliminating the non-ground objects based on their distance to ground;
c) filtering the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects;
d) eliminating the non-vertical objects by computing inertial moment relative to x and y axis with respect to the centre of mass of such non-vertical objects;
e) classifying the pedestrians in the analyzed frame of the image based their shape; and f) tracking the movement of the classified pedestrian using mean shift algorithm.
The present invention provides a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision which is simple, easy to install, provides higher accuracy at a lower cost.
According to one exemplary embodiment of the invention, a cost effective system comprises of a near IR camera disposed on the vehicle for capturing an image; and a processor for analyzing the captured image in real-time for detecting, classifying and tracking the pedestrian present in front of the vehicle. Figure 1 is flowchart which illustrates a method 100 for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision according to various embodiments of the invention.
In one embodiment of the invention, a cost effective method 100 for detecting, classifying and tracking the pedestrian present in front of the vehicle by using images captured by near infrared (IR) camera disposed on the vehicle. In accordance with another aspect of the invention, near IR camera can be disposed either on the dashboard or in front or inside or top of the vehicle. In one example embodiment of the invention, the near IR camera is disposed in front of the vehicle.
In accordance with one aspect of the invention the resolution of the near IR camera can be selected from 640*480, 720*480, etc. In an exemplary embodiment of the invention, the Tesolution of the near IR camera is 720*480. In accordance with one aspect of the invention the IR range of the near IR camera can be selected from a range of (0.7-1 ) to 5 Microns for detecting and tracking the pedestrians. In an exemplary embodiment of the invention, the IR range of the near IR camera can be selected from a range of 0.7 to 5 Microns. In accordance with one aspect of the invention the temperature range of the near IR camera can be selected from 740 to 3,000-5,200 Kelvin for detecting and tracking the pedestrians.
In accordance with one aspect of the invention the processor can be disposed either in the body of the near IR camera or on the dashboard of the vehicle. In one exemplary embodiment of the invention, the processor is disposed either in the body of the near IR camera. In accordance with another aspect of the invention the processor can be selected from the group of Davinci DM6446 Processor, ADSP-BF533, 750 MHz Blackfin Processor.
The above said cost effective method comprises various processor implemented steps. In the case of object detection with an on-board near IR camera, first ground region needs to be estimated. In the first step of the proposed method, the processor detects the road to focus attention for filtering the region of interest (ROI) objects in the said image by estimating the ground plane. In order to detect ground region, the processor executes the following steps:
Initially, Image differentiation 102 is done using sobel operator, then threshold the differentiated image 104 is determined using Otsu algorithm. According to one aspect of the invention, threshold parameters for differentiated images can be varied based on the applicability. In the thresholded of the differentiated binary image, at each bottom most pixel along width of the image, traverse upwards until there is 0-1 transition. The region 106 from bottom most pixel to 0-1 -transition indicates smooth regions which indicates road. Mark such region values as ' X '. This procedure is repeated for each bottom most pixel image by the processor.
In order to detect complete ground region 108, divide the image intensity into 16 bins by the processor. Traverse from bottommost pixel upto half of the height of the image. Those pixels which are marked as ' X ' are considered as seed pixel. Choose the seed pixel, check the neighboring pixels and add them to the ground region if they are similar to the seed by computing Eulidean Distance (D) by the processor. Repeat this process for each of the newly added pixels; stop if no more pixels can be added. Newly added pixels are also now marked as ' X ' by the processor. This method is based on the assumption that the roads have relatively constant temperature thus produces no edges in an edge detected image.
In the second step of the proposed method, the processor eliminates the non-ground objects based on their distance to ground. In order to eliminate the non-ground objects, the processor executes the following steps:
Initially, threshold the original gray level image 110 using Otsu algorithm. In the thresholded binary image, eliminates the pixels which are marked as ' X '. Then executes connected component analysis on binary image. Let consider 'Lr', ' Hr', ' Lc' , 'Hc' be the lower most row, higher most row, left most column, right most column which constitutes the boundary box of a component. Then computes mean ( ) and standard deviation (σ) on ' Hr' of all the components. Any components whose ' Hr' is less than (μ + σ) 112 are deleted by the processor.
After the performance of the above mentioned steps there would be many non-ROIs which are bright stripe objects 114 such as a metallic sign boards, light sign board, traffic signs, headlight, an electric pole, and the poles of a guardrail. These objects have regular in shapes.
In the third step of the proposed method, the processor filters the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects. In order to detect these regular shapes, the processor executes the following steps: For each component, finds the exterior boundary. A boundary pixel is a pixel whose value is and any one of the 8 neighbors whose pixel value is Ό', then the processor computes centroid (xc , yj for a blob. Then com utes distance 0( ) from centroid of the blob to shape contour.
Figure imgf000010_0001
Then computes N point DFT R(f) on r(ri) . Then searches for the local maxima of amplitudes of frequency contents. If local maximas occur periodically computation of such period is done. It is observed that objects which are regular in , shape will have periodic local maximas and objects which are irregular in shape have no periodic local maximas. Then the processor computes replicated version of R
Figure imgf000010_0002
Where M is the periodicity.
In next step, computes N point IDFT R{f) on r(n) and then computes error signal, e(n) = r(n) - r(n) . Then the processor compute a signal to noise ratio(SNR),
SNR = 10 \og(Sv / Se )
Where s, (n) =∑(r[n])2 and St, (») =∑(e[«])2
SNR will be a very high for a regular shape object whereas for irregular shaped objects SNR will be less. In this way street light, car head light, pillars, lamp post are easily eliminated.
In the fourth step of the proposed method, the processor eliminates the non-vertical objects 116 by computing inertial moment relative to x and y axis with respect to the centre of mass of such non- vertical objects. In the gray level thresholded binary image, still there would be components which are aligned more horizontally rather than vertically. Pedestrian's objects are usually vertical in nature. In order to detect vertical elongated objects 118, the normalized central moments are used by the processor.
1
M ∑∑(x - xc )x (y - yc y i(x,
Figure imgf000010_0003
Area Where M, , is the inertial moment, relative to X axis, in respect to the center of mass.
Where M y , is the inertial moment, relative to Y axis, in respect to the center of mass.
Where Mxv , is the inertial moment, relative to both X and Y axis, in respect to the center of mass.
Since pedestrians are more vertical in nature M ' > M x .
M
If — > th, then object is vertical elongated otherwise it is horizontally elongated -which will be deleted by the processor. In one exemplary embodiment of the invention, the estimated threshold was around 3.2.
In the fifth step of the proposed method, the processor classifies the pedestrians in the analyzed frame of the image based their shape. Once the objects basic structure and a qualitative hint about pedestrians have been obtained, a more accurate control is necessary. For classification, pedestrian shape is used as a cue.
In order to classify the pedestrians in the analyzed frame of the image, the processor executes the following steps: For each ROI, the processor finds the exterior boundary. A boundary pixel is a pixel whose value is T and any one of the 8 neighbors whose pixel value is Ό'. Initially, computes centroid (xc , yc ) for a blob. Represent the , boundary as a complex co-ordinate function
Z(n) = [x(n) -x ] + i[y(n) - yc ] ^ Thjs shjft makes the snape representation invariant to translation. Objects shape and model shape can have different sizes. Consequently, the number of data points of the object and model representations will also be different. To avoid this problem, the shape boundary of objects and models must be sampled to have the same number of data points. a) Assuming K is the total number of candidate points to be sampled along the shape boundary. The equal angle sampling selects candidate points spaced at equal angle
Θ = 2 \\ Ι Κ
b) Fourier descriptor (FD) is obtained by computing 32 point FFT on complex co-ordinate a(u) =∑ Z(k) exp(-j2 Π uk I K )
function *=»
c) Rotation invariant of the FDs is achieved by ignoring the phase information and by taking only the magnitude values of the FDs.
d) Scale invariance is then obtained by dividing the magnitude values of the first half of FDs by the DC component. FD\ \ \ FD2 \ FD.
/ =
\ FD0 \ ' \ FDQ \ ' \ FD0 \
e) Now for a model shape indexed by FD feature
/ = [/»>/> , 1 and a <jata shape indexed by FD feature f = [ >< >f<> > since both features are normalized as to translation, rotation and scale, the Euclidean distance between the two feature vectors can be used as the similarity measurement.
d = (∑\ f: - f nui
Where N< is the truncated number of harmonics needed to index the shape.
d < threshold
Pedestrian = [0 d > threshold
Finally, pedestrians are classified based on the above comparison in the image.
In the final step of the proposed method, the processor tracks 120 the movement of the classified pedestrian using mean shift algorithm. In order to track the movement of the classified pedestrian, the processor executes the following steps:
After locating pedestrians in the current frame, from next frame onwards mean shift tracking is employed to track pedestrians. The mean shift tracking algorithm is an appearance based tracking method and it employs the mean shift iterations to find the target candidate that is the most similar to a given model in terms of intensity distribution, with the similarity of the two distributions being expressed by a metric based on the Bhattacharyya coefficient. The derivation of the Bhattacharyya coefficient from sample data involves the estimation of the target density q and the candidate density p , for which employs the histogram formulation.
First, considering the centroid of pedestrian blob as centre *° , the target model histogram is calculated by considering the feature space.
Compute 32 bin histogram q on edge based thresholded '™age Target model : ? = " = 1'2'3> 32 , From next frame onwards, centre of the target is inialized at its previous location ( y ) and target candidate histogram is calculated by considering the same feature space.
Update 32 bin histogram p on edge based thresholded image
Now the distance between target model and target candidate histogram is calculated, d(y) - \ p[p{y), q] ^ where ?[ ] is tne bhattacharya coefficient between
displacement of the target centre is calculated by the weighted mean.
§ W' where ~ PM
Once the new location of target is found, the processor executes the following steps:
a. Computes target candidate histogram at new location with the same feature space involving histogram equalized image range and bottom hat transformed image.
b. Computes Ap(y, ),q]
c. While ^ .?] < ?[/>( „ ),g]
do > < ~(y<> + y» I t- evaluate O,)> 9]
d. If I' stop.
e. Otherwise set y" < ~y> and derive weights, then new location and go to step 1 by the processor.
The above said method further comprises the step of warning the driver for avoiding collision characterized by use of the tracking data of the pedestrian, wherein the alert means is used for warning the driver, wherein the said alert means can be audio and audio visual means including but not limited to an alarm, a voice based caution, or an Indicator.
The preceding description has been presented with reference to various embodiments of the invention. Persons skilled in the art and technology to which this invention pertains will appreciate that alterations and changes in the described process and methods of operation can be practiced without meaningfully departing from the principle, spirit and scope of this invention.
ADVANTAGES OF THE INVENTION
A system and method as proposed in the present invention has following advantages:
1. The present invention provides a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collisions which is easy to install and execute.
2. The system of the present invention also provides a method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision having a reasonably high accuracy as compared to the existing conventional systems.
3. The present invention also provides a system and method for detecting, classifying and tracking the pedestrian present in front of the vehicle while driving for avoiding collision which is cost effective as compared to the conventional systems.

Claims

WE CLAIM
1 ) A cost effective method for detecting, classifying and tracking the pedestrian present in front of the vehicle by using images captured by near infrared (IR) camera disposed on the vehicle, the said method comprising the processor implemented steps of: a) detecting the road to focus of attention for filtering the region of interest (ROI) objects in the said image by estimating the ground plane;
b) eliminating the non-ground objects based on their distance to ground;
c) filtering the non-ROI objects based on the shape of such objects by computing the signal to noise ratio (SNR) for each of such non-ROI objects;
d) eliminating the non-vertical objects by computing inertial moment relative to x and y axis with respect to the centre of mass of such non-vertical objects;
e) classifying the pedestrians in the analyzed frame of the image based their shape; and f) tracking the movement of the classified pedestrian using mean shift algorithm.
2) The method of claim 1 , further comprising the step for warning the driver for avoiding collision characterized by use of the tracking data of the pedestrian.
3) The method of claim 1 , wherein the near IR camera can be disposed either on the dashboard or in front or inside or top of the vehicle.
4) The method of claim 1 , wherein the ground plane is estimated by the processor implemented steps of:
a) differentiating the captured IR image using sobel operator;
b) applying otsu algorithm on the differentiated image to threshold the differentiated image; c) detecting the road by identifying a smooth region, wherein the region from bottom most pixel to 0-1 transition indicates smooth region, subsequently marking the smooth region value as ' X , and repeating the above said marking for each bottom most pixel image; and d) dividing the image intensity into 16 bins, then traverse from bottom most pixel upto half of the height of the image, subsequently considering the marked pixels as seed pixel then choosing the seed pixel, subsequently checking the neighboring pixels and adding them to the ground region if they are similar to the seed pixel by computing Eulidean Distance (D) then repeating the above said process for each of the newly added pixels, Stop the process if no more pixels can be added and finally marking the newly added pixels as ' X '.
5) The method of claim 1 , wherein the non-ground objects are eliminated by the processor implemented steps of:
a) applying otsu algorithm on the original gray level image to threshold the original gray level image;
b) neglecting the marked pixels in the thresholded binary image;
c) running the connected component analysis on the unmarked pixels in the thresholded binary image, wherein the lower most row, higher most row, left most column, right most column which constitutes the boundary box of a component of the image; and
d) computing mean and standard deviation on the higher most row of all the components and subsequently deleting the components, if the value of the higher most row of the component is less than value of sum of mean and standard deviation.
6) The method of claim 1 , wherein the signal to noise ratio (SNR) is computed by the processor implemented steps of:
a) determining the exterior boundary for each component;
b) computing centroid for a blob;
c) computing distance from centroid of the blob to shape contour;
d) computing N point DFT of the computed distance on the shape contour;
e) searching the local maxima of amplitudes of frequency contents and subsequently computing the period if one or more local maximas occurs periodically;
f) computing replicated version of N point OFT of the computed distance;
g) computing N point IDFT on computed distance on the shape contour;
h) computing error signal; and
i) finally computing signal to noise ratio(SNR) of the each component from the error-signal.
7) The method of claim 1 , wherein the classifying of pedestrians is done by the processor implemented steps of:
a) Finding out the exterior boundary for each ROI;
b) Computing the centroid for a blob;
c) Representing the boundary as a complex co-ordinate function, wherein this shift makes the shape representation invariant to translation; d) Sampling the shape boundary of objects and models to have the same number of data points;
e) comparing the data values of the shape boundary of objects and models; and
f) classifying pedestrians based on the above comparison.
8) A cost effective system for detecting, classifying and tracking the pedestrian present in front of the vehicle by using a near infrared (IR) camera, the said system comprises of:
a) a near IR camera disposed on the vehicle for capturing an image; and
b) a processor for analyzing the captured image in real-time for detecting, classifying and tracking the pedestrian present in front of the vehicle.
9) The system of claim 8, further comprises an alert means for warning the driver for avoiding collision characterized by use of the tracking data of the pedestrian.
PCT/IN2010/000783 2009-12-02 2010-12-02 Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera WO2011067790A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2012530411A JP5453538B2 (en) 2009-12-02 2010-12-02 Cost-effective system and method for detecting, classifying, and tracking pedestrians using near infrared cameras
EP10822866.9A EP2481012A2 (en) 2009-12-02 2010-12-02 Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera
CN201080030194.XA CN102511046B (en) 2009-12-02 2010-12-02 Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera
US13/380,559 US8964033B2 (en) 2009-12-02 2010-12-02 Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN2785/MUM/2009 2009-12-02
IN2785MU2009 2009-12-02

Publications (2)

Publication Number Publication Date
WO2011067790A2 true WO2011067790A2 (en) 2011-06-09
WO2011067790A3 WO2011067790A3 (en) 2011-10-06

Family

ID=44115376

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2010/000783 WO2011067790A2 (en) 2009-12-02 2010-12-02 Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera

Country Status (5)

Country Link
US (1) US8964033B2 (en)
EP (1) EP2481012A2 (en)
JP (1) JP5453538B2 (en)
CN (1) CN102511046B (en)
WO (1) WO2011067790A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105261017A (en) * 2015-10-14 2016-01-20 长春工业大学 Method for extracting regions of interest of pedestrian by using image segmentation method on the basis of road restriction
US10789727B2 (en) 2017-05-18 2020-09-29 Panasonic Intellectual Property Corporation Of America Information processing apparatus and non-transitory recording medium storing thereon a computer program
CN112288765A (en) * 2020-10-30 2021-01-29 西安科技大学 Image processing method for vehicle-mounted infrared pedestrian detection and tracking

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8121361B2 (en) 2006-05-19 2012-02-21 The Queen's Medical Center Motion tracking system for real time adaptive imaging and spectroscopy
US8890951B2 (en) * 2008-04-24 2014-11-18 GM Global Technology Operations LLC Clear path detection with patch smoothing approach
WO2013032933A2 (en) 2011-08-26 2013-03-07 Kinecticor, Inc. Methods, systems, and devices for intra-scan motion correction
US8948449B2 (en) * 2012-02-06 2015-02-03 GM Global Technology Operations LLC Selecting visible regions in nighttime images for performing clear path detection
US9412025B2 (en) * 2012-11-28 2016-08-09 Siemens Schweiz Ag Systems and methods to classify moving airplanes in airports
US9717461B2 (en) 2013-01-24 2017-08-01 Kineticor, Inc. Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
US9305365B2 (en) 2013-01-24 2016-04-05 Kineticor, Inc. Systems, devices, and methods for tracking moving targets
US10327708B2 (en) 2013-01-24 2019-06-25 Kineticor, Inc. Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
US9782141B2 (en) 2013-02-01 2017-10-10 Kineticor, Inc. Motion tracking system for real time adaptive motion compensation in biomedical imaging
US10004462B2 (en) 2014-03-24 2018-06-26 Kineticor, Inc. Systems, methods, and devices for removing prospective motion correction from medical imaging scans
CN103902976B (en) * 2014-03-31 2017-12-29 浙江大学 A kind of pedestrian detection method based on infrared image
CN106714681A (en) 2014-07-23 2017-05-24 凯内蒂科尔股份有限公司 Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
CN105740862A (en) * 2014-10-27 2016-07-06 江苏慧眼数据科技股份有限公司 Pedestrian contour detection method based on macro feature point description
CN104778725A (en) * 2015-04-14 2015-07-15 长兴泛亚照明电器有限公司 Underground garage for adaptive background update and tunnel moving target detection method
US9943247B2 (en) 2015-07-28 2018-04-17 The University Of Hawai'i Systems, devices, and methods for detecting false movements for motion correction during a medical imaging scan
US20170032676A1 (en) * 2015-07-30 2017-02-02 Illinois Institute Of Technology System for detecting pedestrians by fusing color and depth information
WO2017091479A1 (en) 2015-11-23 2017-06-01 Kineticor, Inc. Systems, devices, and methods for tracking and compensating for patient motion during a medical imaging scan
CN106845344B (en) * 2016-12-15 2019-10-25 重庆凯泽科技股份有限公司 Demographics' method and device
CN106952293B (en) * 2016-12-26 2020-02-28 北京影谱科技股份有限公司 Target tracking method based on nonparametric online clustering
KR20190062635A (en) * 2017-11-15 2019-06-07 전자부품연구원 Object tracking apparatus and object tracking method
CN109961487A (en) * 2017-12-14 2019-07-02 通用电气公司 Radiotherapy localization image-recognizing method, computer program and computer storage medium
US11062608B2 (en) 2018-05-11 2021-07-13 Arnold Chase Passive infra-red pedestrian and animal detection and avoidance system
US10467903B1 (en) 2018-05-11 2019-11-05 Arnold Chase Passive infra-red pedestrian detection and avoidance system
US11294380B2 (en) 2018-05-11 2022-04-05 Arnold Chase Passive infra-red guidance system
US10750953B1 (en) 2018-05-11 2020-08-25 Arnold Chase Automatic fever detection system and method
JP2020086756A (en) * 2018-11-21 2020-06-04 富士ゼロックス株式会社 Autonomous mobile device and program
CN109800683A (en) * 2018-12-30 2019-05-24 昆明物理研究所 A kind of infrared pedestrian detection method and device based on FPGA
CN111126178B (en) * 2019-12-05 2023-07-04 大连民族大学 Continuous distance estimation method for infrared-visible light binocular pedestrian body multi-component fusion

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5694487A (en) 1995-03-20 1997-12-02 Daewoo Electronics Co., Ltd. Method and apparatus for determining feature points
US7139411B2 (en) 2002-06-14 2006-11-21 Honda Giken Kogyo Kabushiki Kaisha Pedestrian detection and tracking with night vision
US7421091B2 (en) 2003-05-20 2008-09-02 Nissan Motor Co., Ltd. Image-capturing apparatus
US7526102B2 (en) 2005-09-13 2009-04-28 Verificon Corporation System and method for object tracking and activity analysis

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3263311B2 (en) * 1996-04-24 2002-03-04 ニッタン株式会社 Object detection device, object detection method, and object monitoring system
JP3922245B2 (en) * 2003-11-20 2007-05-30 日産自動車株式会社 Vehicle periphery monitoring apparatus and method
JP3903048B2 (en) * 2004-04-26 2007-04-11 キヤノン株式会社 Image extraction method
ES2259543B1 (en) * 2005-02-04 2007-11-16 Fico Mirrors, S.A. SYSTEM FOR THE DETECTION OF OBJECTS IN A FRONT EXTERNAL AREA OF A VEHICLE, APPLICABLE TO INDUSTRIAL VEHICLES.
JP4777196B2 (en) * 2006-09-11 2011-09-21 川崎重工業株式会社 Driving assistance device
JP4813304B2 (en) * 2006-09-19 2011-11-09 本田技研工業株式会社 Vehicle periphery monitoring device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5694487A (en) 1995-03-20 1997-12-02 Daewoo Electronics Co., Ltd. Method and apparatus for determining feature points
US7139411B2 (en) 2002-06-14 2006-11-21 Honda Giken Kogyo Kabushiki Kaisha Pedestrian detection and tracking with night vision
US7421091B2 (en) 2003-05-20 2008-09-02 Nissan Motor Co., Ltd. Image-capturing apparatus
US7526102B2 (en) 2005-09-13 2009-04-28 Verificon Corporation System and method for object tracking and activity analysis

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105261017A (en) * 2015-10-14 2016-01-20 长春工业大学 Method for extracting regions of interest of pedestrian by using image segmentation method on the basis of road restriction
US10789727B2 (en) 2017-05-18 2020-09-29 Panasonic Intellectual Property Corporation Of America Information processing apparatus and non-transitory recording medium storing thereon a computer program
CN112288765A (en) * 2020-10-30 2021-01-29 西安科技大学 Image processing method for vehicle-mounted infrared pedestrian detection and tracking

Also Published As

Publication number Publication date
CN102511046A (en) 2012-06-20
JP5453538B2 (en) 2014-03-26
CN102511046B (en) 2015-03-25
US20120229643A1 (en) 2012-09-13
WO2011067790A3 (en) 2011-10-06
JP2013505509A (en) 2013-02-14
US8964033B2 (en) 2015-02-24
EP2481012A2 (en) 2012-08-01

Similar Documents

Publication Publication Date Title
US8964033B2 (en) Cost-effective system and method for detecting, classifying and tracking the pedestrian using near infrared camera
US9607402B1 (en) Calibration of pedestrian speed with detection zone for traffic intersection control
US9460613B1 (en) Pedestrian counting and detection at a traffic intersection based on object movement within a field of view
CN102663452B (en) Suspicious act detecting method based on video analysis
US9449506B1 (en) Pedestrian counting and detection at a traffic intersection based on location of vehicle zones
US20180151063A1 (en) Real-time detection system for parked vehicles
Cheng et al. Intelligent highway traffic surveillance with self-diagnosis abilities
CN103699905B (en) Method and device for positioning license plate
CN106204640A (en) A kind of moving object detection system and method
Bedruz et al. Real-time vehicle detection and tracking using a mean-shift based blob analysis and tracking approach
JP2020194263A (en) Accident analysis device, accident analysis method, and program
JP2012221162A (en) Object detection device and program
WO2013026205A1 (en) System and method for detecting and recognizing rectangular traffic signs
Xiao et al. Detection of drivers visual attention using smartphone
Ho et al. Intelligent speed bump system with dynamic license plate recognition
CN107256382A (en) Virtual bumper control method and system based on image recognition
KR20150002040A (en) The way of Real-time Pedestrian Recognition and Tracking using Kalman Filter and Clustering Algorithm based on Cascade Method by HOG
WO2017077261A1 (en) A monocular camera cognitive imaging system for a vehicle
Wennan et al. Lane detection in some complex conditions
Teknomo et al. Tracking algorithm for microscopic flow data collection
Kadim et al. Real-time vehicle counting in complex scene for traffic flow estimation using multi-level convolutional neural network
CN113392678A (en) Pedestrian detection method, device and storage medium
Lin et al. A street scene surveillance system for moving object detection, tracking and classification
KR102039814B1 (en) Method and apparatus for blind spot detection
Ong et al. Vehicle Overtaking Detection Using Computer Vision Techniques

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201080030194.X

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10822866

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2010822866

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2012530411

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 13380559

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE