WO2003001810A1 - Invariant filters - Google Patents

Invariant filters Download PDF

Info

Publication number
WO2003001810A1
WO2003001810A1 PCT/SE2002/001187 SE0201187W WO03001810A1 WO 2003001810 A1 WO2003001810 A1 WO 2003001810A1 SE 0201187 W SE0201187 W SE 0201187W WO 03001810 A1 WO03001810 A1 WO 03001810A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
area
operator
filter
sensor
Prior art date
Application number
PCT/SE2002/001187
Other languages
French (fr)
Inventor
Anders Heyden
Original Assignee
Wespot Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wespot Ab filed Critical Wespot Ab
Publication of WO2003001810A1 publication Critical patent/WO2003001810A1/en

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19602Image analysis to detect motion of the intruder, e.g. by frame subtraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/254Analysis of motion involving subtraction of images
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19602Image analysis to detect motion of the intruder, e.g. by frame subtraction
    • G08B13/19604Image analysis to detect motion of the intruder, e.g. by frame subtraction involving reference image or background adaptation with time to compensate for changing conditions, e.g. reference image update on detection of light level change
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19678User interface
    • G08B13/1968Interfaces for setting up or customising the system
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19678User interface
    • G08B13/19691Signalling events for better perception by user, e.g. indicating alarms by making display brighter, adding text, creating a sound
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
    • H04N7/188Capturing isolated or intermittent images triggered by the occurrence of a predetermined event, e.g. an object reaching a predetermined position
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/14Picture signal circuitry for video frequency region
    • H04N5/147Scene change detection

Definitions

  • the present invention relates to a method for detection of a change in the scene in an area. It also relates to a computer program, a monitoring system and use of an operator.
  • Image processing is used today to detect various types of object within a large number of applications, such as in monitoring to detect whether there is an object, such as a person inside a monitored area.
  • a sensor records images of the monitored area .
  • the image processing can be used in a monitoring system to prevent break-ins. If an intruder is falsely detected, resulting in a false alarm, it can be very costly if, for example, the police or other secu- rity personnel are informed and come to the site as a result of the false alarm.
  • a sensor In order to detect whether there is a foreign object within a monitored area, a sensor records the incident intensity as greyscale values in a digital image of the monitored area. The recorded image is then compared with a reference image.
  • the reference image can, for example, be the immediately preceding image or an image taken at a time when there was no foreign object within the area.
  • the monitored area can be said to be a scene that consists of a number of surfaces with reflectance properties.
  • a change in the scene results in a change of the set of surfaces of the recorded image, for example by an object coming into the monitored area or moving in the monitored area, between when the reference image was recorded and when the current image was recorded.
  • a change in the lighting means that the incident light in the scene is changed while the set of surfaces is unchanged, for example by the sun going behind a cloud or by a lamp being switched on.
  • US 5,956,424 and US 5,937,092 describe a method in video monitoring in which an attempt is made to separate the changes in the lighting from the changes in the scene. This is carried out by attempting to model the intensity of the light that radiates from the surfaces in the scene, in order to filter out changes in the lighting from changes of the actual scene.
  • Picture elements or pixels can be said to be another name for elements in the matrix that represents the digital image.
  • the quotient is a measure that only depends on the reflectance r of a surface and is independent on the irradiance k.
  • a new image is created, the different element values of which only reflect the reflectance in the associated pixel, and then this image is compared with a reference image in which the reflectance of each pixel is calculated under the assumption that the changes in the lighting are proportionally linear. If the change in the lighting is the same in the whole image, the curve will look the same for all pixels.
  • the inclination of the curve represents the reflectance.
  • the quotient is thus independent of k.
  • Proportionally linear changes in the intensities that this model represents occur when the light is reflected against a Lambertian surface. This is a matt surface, which when it is illuminated, radiates equally in all directions and does not give rise to any reflection. With this modelling and this method, the probability is increased of a detected change being due to a change in the scene. However, many changes in the light- ing are still detected as changes in the scene, which can cause costly false alarms. If the light intensity is measured, in reality a curve is obtained, which is not a proportionally linear curve.
  • NVD Normalized Vector Distance
  • the first block can have the value (30,30) and the second block (50,50) . These vectors have the same direction and it is therefore decided that the change is due to a change in the lighting. If the direction differs over a particular threshold limit, it is decided that the change is a change in the scene.
  • a measure is obtained that is invariant for proportionally linear changes in the intensities.
  • invariant is meant in this connection that the angle between the vectors of the reference image and the current image is the same, irrespective of proportionally linear transformations of greyscale values in the current image.
  • NVD NVD
  • problems with noise in dark areas also still remain, as a vector is defined with components consisting of the intensities in a square that comprises a number of pixels and then this is normalized. If the vector, for example, consists of a 4-dimensional vector with small components, for example (2,5,4,4) and the reference images contain noise with a maximum of 2 intensity levels, the direction of this vector can vary considerably, which may result in false alarms.
  • An object of the present invention is thus to provide a method of image processing that can detect changes in the scene with greater reliability. More specifically, the method can discriminate between changes caused by lighting conditions and changes caused by scene conditions .
  • this comprises a method in image processing for detection of a change in the scene in an area, compris- ing the steps of recording a digital image of the area with a sensor, transforming the recorded image by an operator that is based on a previous modelling of changes in the lighting in the area and on a modelling of how the sensor depicts the area in greyscale values in the recorded image, and comparing the transformed image with a reference image of the area in order to detect a diffe- rence that indicates a change in the scene.
  • the invention is based on an analysis of changes in the lighting that shows that changes in the intensities in the images do not only depend upon changes in the lighting of the scene and upon the reflective properties of the surfaces, but also upon how the sensor depicts intensities of greyscale values.
  • the number of false alarms is reduced, as these changes in the lighting are not removed by transformation during the comparison between the transformed recorded image and the reference image. False alarms are costly and with a reduction in the number of false alarms, the cost of the system that uses this method is also reduced.
  • a further advantage is that different sensor settings can be made, such as changes in amplification and aperture, without the risk that these will be detected as changes in the scenes in the area.
  • This method can be used for various types of monitoring, such as monitoring for intruders.
  • the method can also be used with manufacturing processes for inspecting various components in order to detect defects in the manufactured product .
  • the reference image is also transformed by said operator.
  • the transformed image is compared with a reference image that has been transformed according to the same method.
  • An advantage of this method is that the transformed reference image and the transformed recorded image will differ in the event of a change in the scene, but are the same for changes in lighting intensity for which the operator is modelled. In this way, the number of false alarms is reduced.
  • the operator is invariant with regard to transformations of said greyscale values in the recorded image .
  • the advantage of the operator being invariant is that the transformed image is the same even if there is a change in the greyscale values arising from a change in lighting, such as that for which the operator is modelled. This means that when the transformed recorded image is compared with a reference image transformed in the same way, the differences that are detected will indicate changes in the scene and not the above-mentioned changes in greyscale values resulting from a change in the lighting of the scene.
  • the operator is an affine invariant.
  • This embodiment is based on the idea of transforming the image and calculating an affine invariant measure that is constant over affine changes in intensity.
  • the advantage of using affine functions is that a measure is obtained that is unchanged by affine changes in intensity, but is changed by a change in the scene.
  • the operator is invariant for affine transformations.
  • the sensor can have range 0 - 255, which is a measurement range within which the sensor is set to record intensities and to convert these to numbers between 0 and 255. Intensities that are outside the measurement range are converted to either number 0 or 255. After a change in incident light intensity, the measurement range can be displaced and information can be lost. In order to prevent this, the measurement range can be moved. There are known algorithms for this movement of the measurement range.
  • the invention models both how the lighting affects the light emitted from a Lambertian surface and how the sensor depicts the incident intensity.
  • Another advantage of the invention is that it takes into account affine changes in intensity and can thus also handle different settings in the sensor, such as exposure and aperture. This has the result that the number of false alarms is reduced. This also reduces the cost of monitoring.
  • the invention can also handle changes in intensity in surfaces that radiate the same amount all the time, that is self-radiating surfaces, such as a lamp.
  • the invention can also handle surfaces that are intermediate between self- radiating surfaces and Lambertian surfaces as well as reflecting surfaces.
  • the invention can also handle Lambertian surfaces. The change in the lighting can be modelled as
  • ⁇ Taster aTbefore ⁇ This is useful in the cases when b is small, which can occur for certain changes in the lighting.
  • the step of transforming comprises the steps of filtering the recorded image with a first filter, filtering the recorded image with a second filter, the first and the second filter differing from each other and both having the coefficient sum zero, and determining the quotient between the filtered images.
  • the advantage of using filters is that it reduces the sensitivity to noise.
  • the recorded image is filtered before the quotients between the different pixels are calculated, in order to reduce the sensitivity to noise.
  • the quotient between the images is carried out pixel by pixel .
  • the affine invariant measure can thus be written as
  • *I f F ⁇ *I f m ⁇ i F 2 *I e F 2 *(al f ) aF 2 *I f F 2 *I f
  • the first and the second filter are a derivative of the Gaussian function.
  • the advantage of the Gaussian function is that it is simple to implement and that it is based on well- documented Scale Space theory, which means that the smoothing and the noise reduction is optimal.
  • the first filter is a derivative of the Gaussian function in the x-direction and the second filter is a derivative of the same Gaussian function in the y-direc- tion.
  • the Gaussian function of the same scale is advantageously used as this results in a simpler implementation.
  • the first and the second filter are two simple difference filters between the intensity in one pixel and the intensity in another pixel .
  • the first filter is a difference filter between two pixels horizontally adjacent to each other and the second filter is a difference filter between two pixels vertically adjacent to each other. Both filters are thus difference filters between adjacent pixels in two orthogonal directions.
  • a pixel from the difference in the vertical direction agrees with a pixel from the difference in the horizontal direction.
  • the operator is a modification of normalized vector distance, in which the step of transforming comprises the steps of calculating the mean value of the intensity in a subset of the recorded image, subtracting the mean value in each pixel and carrying out a normalized vector distance calculation.
  • the step of comparing comprises the step of calculating the difference between respective vectors in the transformed recorded image and the transformed reference image .
  • the difference can be calculated by calculating the angle between the vectors or the distance between the vectors .
  • a predetermined threshold value can be set. If the distance exceeds this level, it can be decided that there has been a change in the scene. Alternatively, it can be said that it is the angle between the vectors that is compared with the threshold value. If the angle is essentially zero, no change in the scene is said to have occurred and if the angle exceeds a predetermined value it can be decided that there has been a change in the scene .
  • the method comprises the step of adapting at least two parameters to the operator.
  • problems can arise when two filtered images are divided by each other. For example, when the intensity is constant in an area, which implies that corresponding elements in the filtered image will be near zero. This can be avoided by adapting coefficients instead of comparing the quotient of filtered images .
  • the coefficients can be calculated by solving a least square problem.
  • An advantage of this embodiment is that it is less sensitive to noise in certain situations.
  • An example of such a situation is when a part of the image with great variation in intensity changes into an area of almost constant intensity, for example when a lamp is switched off.
  • said parameters are adapted in such a way that they cover all transformations of said greyscale values arising for changes of the lighting of the scene.
  • the operator is an affine transformation.
  • the parameters are adapted locally in the recorded image .
  • the fact that the adaptation is carried out locally means that the recorded image is divided into different parts and the adaptation of parameters is carried out in each part of the image.
  • An advantage of this embodiment is that local changes in the lighting that only occur in a small part of the area can be handled better.
  • the local adapting can be regarded as if the method for adapting parameters described above was applied to a part of the image.
  • Another embodiment according to the invention comprises in addition the step of filtering both the recorded image and the reference image with a smoothing filter.
  • the advantage of filtering is that the sensitivity to noise is reduced.
  • a smoothing filter is a filter that creates a weighted mean value of the intensities locally, that is in a part of the image. The creation of the weighted mean value means that the variance of the noise in the filtered image is less than in the original image.
  • the smoothing filter is a
  • this comprises a computer program that is stored on a computer-readable memory medium that comprises instructions for causing a computer to carry out the method according to any one of claims 1-17.
  • this comprises the use of an operator in image processing for detection of a change in the scene in an image of an area recorded by a sensor, which operator is based on a previous modelling of changes in the lighting in the area and on ' a modelling of how the sensor depicts the area in greyscale values in the recorded image.
  • this comprises a system for monitoring an area comprising at least one sensor for recording images of the area and at least one processing unit in which the computer program according to claim 18 is stored.
  • the method can be used with an automatic door-opener.
  • a sensor unit can be arranged to continually record images of a monitored area in front of a door.
  • the door can, for example, be a revolving door or a sliding door.
  • a processing unit can be arranged to carry out the above-mentioned method. If a person moves into the monitored area in front of the door, the person is detected as an object and a decision can be taken concerning whether the detected object is to cause the door to open.
  • the image processing that is used as the basis for the decision concerning the opening of the door can have different degrees of intelligence level. This means that the image processing can be very simple and the decision that the door is to be opened can be made for all objects that cause movement to be detected.
  • a signal that the door is to be opened can be transmitted to a door-opening device, that physically opens the door.
  • Automatic door-openers are, for example, very common at the main entrances to various companies. Just inside the door there is usually a manned reception area. If the door is opened frequently, this affects the temperature inside the reception area, with resultant often costly heat losses. In addition, the people working there are exposed to draughts and cold air. It is therefore impor- tant that the door is not opened in error.
  • the risk is reduced of the door being opened in error, in, for example, difficult weather conditions, such as snow and rain, and different lighting and shade conditions that can arise when, for example, the sun goes behind a cloud.
  • the automatic door- opener is also reliable when the monitored area is dark, as with the method above it is able more effectively to identify persons moving in the monitored area and can thus decide in a reliable way whether the door is to open.
  • Fig. 1 is a block diagram and shows schematically assumed sources of changes in intensity according to prior-art technique
  • Fig. 2 is a diagram and shows schematically according to prior-art technique that the changes in the intensities are linearly proportional
  • Fig. 3 is a diagram in which by way of experiment the intensities have been recorded before and after a change in the lighting of the scene
  • Fig. 4 is a schematic diagram and shows vectors according to NVD
  • Fig. 5 is a diagram and shows schematically a modelling of the sensor
  • Fig. 6 is a diagram which shows schematically the location of the coordinate system in an image for description of a filter
  • Fig. 7 is a schematic flow diagram of an embodiment according to the present invention.
  • Fig. 8 is a schematic flow diagram of another embodiment according to the present invention
  • Fig. 9 is a partially schematic perspective view and shows a monitoring system according to the present invention
  • Fig. 10 is a schematic block diagram for hardware in a sensor unit according to one embodiment .
  • Fig. 1 discloses, according to prior-art technique, that an intensity incident upon the sensor is dependent upon the scene and the light that falls on the scene. Thus both the incident light and the scene contribute to the intensity incident upon the sensor.
  • the incident intensity may be recorded as grayscale values in the sensor.
  • Fig. 2 is a diagram and shows the realation between the intensity before a light change and after a light change. As appears from Fig. 2, the relationship is considered to be linear according to prior art technique.
  • Fig. 4 is a diagram disclosing that adjacent pixels designated as "30,30" and “50,50” may be grouped into a vector, which is depicted in the diagram.
  • Fig. 5 shows how the sensor is adapted to different light conditions.
  • the abscissa shows the intensity before the adjustment of the sensor and the ordinata shows the intensity after the adaption of the sensor.
  • V denotes the space of digital images (Z 11 TM, where Z denotes the number of integers - normally integers between 0 and 255 -, n denotes the number of rows and m the number of columns)
  • B is an element in V (that is a digital image)
  • V is some other (linear) space
  • B' is an element in V .
  • Normally V also consists of digital images of some size, but also a more general space can be used, for example vector-based images.
  • the group can be the one-dimensional affine group, para- meterised by two parameters, a and b, according to
  • W denotes the space of digital images of the same size as the images in V, but with real intensities.
  • the metric measures distances in the space V and relates these to the original image where we can read off changes in the image that do not originate from the group action. What we have now done can be described as if we are considering the equivalence classes in V that arise from the group action from G and then introducing a distance measure of these equivalence classes .
  • the monitoring system comprises at least one light-sensitive sensor unit 1 that monitors a monitored area 2.
  • the monitored area can be an area in which no object, such as a person should be found.
  • the sensor unit 1 continually records digital images of the monitored area 2 in order to detect whether, for example, a person 3 is within the monitored area 2.
  • a digital image can be said to be a matrix in which elements at the position (i,j) tell what light intensity has been detected at that point. If a person 3 is detected within the monitored area 2, the sensor unit can output an alarm signal, that is sent to an alarm centre 4.
  • the alarm signal that is sent to the alarm centre 4 can consist of only a signal that a movement has been detected, but it can also comprise a recorded image or an image of only the moving object that caused the alarm. This image can be displayed on a screen in the alarm centre 4 and a person in the alarm centre 4 can then carry out a further check on what caused the alarm.
  • the alarm centre 4 can be a device that emits a sound signal when it receives an alarm signal from the sensor unit 1.
  • Fig. 10 discloses a block diagram of the hardware in the sensor unit 1.
  • the sensor unit 1 is supplied with a voltage at a voltage connection 10.
  • the sensor unit 1 comprises a powerful processing unit 11.
  • the sensor unit 1 comprises a communication unit 12.
  • the communication unit can be arranged to send an alarm signal to the alarm centre 4 in the event of detection of a movement.
  • the sensor unit 1 comprises a light-sensitive sensor 13, for example a CMOS or CCD sensor, for recording images.
  • the sensor 13 is integrat- ed on a chip and it has also a lens arrangement 14.
  • the sensor unit 1 comprises a volatile memory or RAM memory 15.
  • the sensor unit 1 uses an operating system and can carry out advanced image processing.
  • the sensor unit 1 also comprises a permanent memory 16 for processing code and other data that must be saved in a non-volatile memory. All the components in the sensor unit 1 are advantageously integrated on a circuit board. The advantage of this is that the sensor unit 1 is very robust, that is to say that it is less sensitive to sources of interference and has fewer points where sabotage can be carried out .
  • the algorithms that are used are stored in the permanent memory 16.
  • the sensor 13 records 100, 200 an image of the monitored area 2.
  • the image is transformed by an affine invariant measure being calculated.
  • the affine invariant measure is the same for all affine changes in the lighting and all sensor sett- ings . It is calculated by the following method.
  • the recorded image is filtered 210 with two arbitrary linear filters F ⁇ (x,y) and F 2 (x,y) with the property that
  • the filtering is carried out in order to reduce the sensitivity to noise.
  • the derivative of the Gaussian function, G a can be used as filter
  • a filter may be, for example
  • the filtering can be carried out in a number of different ways. For example, we can also use derivatives in the same direction, but with different scales (a) .
  • an affine measure is calculated 110, 220 in accordance with the following
  • the reference image has been processed in the same way as the recorded image, that is it has been filtered using the same linear filter as the recorded image and the affine invariant measure has been calculated.
  • the affine invariant measure of the recorded processed image is compared 120, 230 with the affine invariant measure of the reference image. If a difference is detected 130, 240, this is said to originate from a change in the scene and it is decided that an alarm situation exists 140, 250.
  • this embodiment corresponds to the operator F depicting the image B on a quotient of filtered images B' , which is also an image, but with real numbers as elements.
  • the filter that is used can be of various types.
  • the filter is a linear position-invariant operator that is represented by a matrix and operates on the digital image.
  • the filter is a diffe- rence between adjacent pixels.
  • an affine invariant is calculated from the intensities in three adjacent pixels, I l f I 2 and I 3 . This can, for example, be calculated in accordance with the following:
  • a modified NVD Normalized Vector Distance
  • the sensor records an image of the monitored area.
  • the image is divided into a number of squares..
  • Each square contains a number of pixels, which can, for example, be 8x8 pixels.
  • a mean value of the intensity in each of the squares is calculated. This mean value is then subtracted from each pixel in the respective square.
  • vectors are calculated based on the intensities of the squares. These vectors are normalized.
  • the vectors are affine invariant measures.
  • the vectors in the transformed recorded image are com- pared with the vectors in the reference image transformed in the same way.
  • a measure is obtained that is invariant for affine changes in the intensities. If the distance between the vectors is zero, there has been no change in the scene, that is if the angle between the vectors is zero, there has been no change in the scene.
  • a threshold value is often set that means that the angular difference must be a particular minimum size in order for it to be determined that there has been a change in the scene. This is because there is often a certain amount of noise in the image.
  • this modification of NVD can be obtained by letting the space V denote vector-value images with a lower resolution than the original image. Each matrix element in the image B' in V is then a vector. The generalized measure is then the normalized distance between the vectors related to the original image.
  • the above method can advantageously be modified by filtering both the reference image and the current image with any smoothing filter before the parameters a and b are calculated. This method means that the sensitivity to noise is reduced and it is particularly preferable to use a Gaussian filter.
  • the modification of NVD can be handled by adapting coefficients a and b for each block instead of calculating the normalized distance.
  • a threshold value is set, such that if the above minimizing exceeds this value it is determined that there has been a change in the scene.
  • New constants a and b must be calculated for each new image that is recorded by the sensor.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Image Processing (AREA)
  • Image Analysis (AREA)

Abstract

A method for detection of a change in the scene in an area. The method comprises the steps of recording (100) a digital image of the area using a sensor and transforming (110) the recorded image by an operator. The operator is based on a previous modelling of changes in the lighting in the area and on a modelling of how the sensor depicts the area in greyscale values in the recorded image. The method further comprises the step of comparing (120) the transformed image with a reference image of the area in order to detect a difference that indicates a change in the scene.

Description

INVARIANT FILTERS
Field of the Invention
The present invention relates to a method for detection of a change in the scene in an area. It also relates to a computer program, a monitoring system and use of an operator.
Background Art
Image processing is used today to detect various types of object within a large number of applications, such as in monitoring to detect whether there is an object, such as a person inside a monitored area. A sensor records images of the monitored area .
There are strong requirements for reliable results from the image processing of these images, as incorrect evaluations can be costly.
For example, the image processing can be used in a monitoring system to prevent break-ins. If an intruder is falsely detected, resulting in a false alarm, it can be very costly if, for example, the police or other secu- rity personnel are informed and come to the site as a result of the false alarm.
In order to detect whether there is a foreign object within a monitored area, a sensor records the incident intensity as greyscale values in a digital image of the monitored area. The recorded image is then compared with a reference image. The reference image can, for example, be the immediately preceding image or an image taken at a time when there was no foreign object within the area.
If there is a difference between the compared images, this can be due to a change in the scene or to a change in the lighting of the scene. The monitored area can be said to be a scene that consists of a number of surfaces with reflectance properties. A change in the scene results in a change of the set of surfaces of the recorded image, for example by an object coming into the monitored area or moving in the monitored area, between when the reference image was recorded and when the current image was recorded. A change in the lighting means that the incident light in the scene is changed while the set of surfaces is unchanged, for example by the sun going behind a cloud or by a lamp being switched on.
It is normally only the change in the scene that is of interest while a change in the lighting of the scene should be neglected. This is a problem, as it is very difficult to distinguish between a change in the scene and a change in the lighting.
US 5,956,424 and US 5,937,092 describe a method in video monitoring in which an attempt is made to separate the changes in the lighting from the changes in the scene. This is carried out by attempting to model the intensity of the light that radiates from the surfaces in the scene, in order to filter out changes in the lighting from changes of the actual scene.
In the method according to US 5,956,424 and US 5,937,092, it is assumed that the intensity that radiates from a surface Iout is directly proportional to the incident intensity Iin, that is Iout = r*Iin, where r is the reflectance of a surface. If a change in the lighting occurs, it is assumed that this is linearly proportional, t t IS lout after change in the light = lout before change in the light = k*r*Iin before change in the light, where k is a change in the light factor or irradiance. The method according to US 5,956,424 is based on calculating quotients between greyscale values of adjacent pixels, also called picture elements. Picture elements or pixels can be said to be another name for elements in the matrix that represents the digital image. The quotient is a measure that only depends on the reflectance r of a surface and is independent on the irradiance k. A new image is created, the different element values of which only reflect the reflectance in the associated pixel, and then this image is compared with a reference image in which the reflectance of each pixel is calculated under the assumption that the changes in the lighting are proportionally linear. If the change in the lighting is the same in the whole image, the curve will look the same for all pixels. The inclination of the curve represents the reflectance. The quotient between two adjacent pixels may be calculated pixel by pixel, assuming that the lighting of the scene is the same for adjacent areas, that is Ij.n(x+1, y) = Iin(x,y) by equation:
Figure imgf000005_0001
The quotient is thus independent of k. Thus, a change in the lighting can be discriminated from a change in the scene, since at a change of the lighting, the ratio between adjacent pixels in the present image and the ratio of the same adjacent pixels in the reference image is constant and independent of a change in the irradiance .
Proportionally linear changes in the intensities that this model represents occur when the light is reflected against a Lambertian surface. This is a matt surface, which when it is illuminated, radiates equally in all directions and does not give rise to any reflection. With this modelling and this method, the probability is increased of a detected change being due to a change in the scene. However, many changes in the light- ing are still detected as changes in the scene, which can cause costly false alarms. If the light intensity is measured, in reality a curve is obtained, which is not a proportionally linear curve.
The fact that the curve is not proportionally linear is due primarily to the fact that the sensor does not depict the incident intensities proportionally linearly in greyscale values, but as an affine function. This is partially due to the fact that certain surfaces in an area monitored by the sensor do not fulfil the require- ment of being a Lambertian surface. By an affine representation is meant that I after = 3. befθre + b .
Another problem with the method according to US 5,956,424 is that the calculation of the quotient between the intensities of adjacent pixels means that the system is more sensitive to noise. The sensitivity to noise arises, for example, in very dark areas in the image, or at edges where one side is dark and the other is light. Assume that the quotient is calculated between two pixels where the intensities in the reference image are 5 and 20 respectively, that is the quotient is 20/5 = 4. If the current image of the monitored area is recorded by a sensor that contains noise in each pixel of a maximum of 2 intensity levels, this quotient can vary between 22/3 = 7.3 and 18/7=2.4, which can be compared with the reference image's quotient of 4 (+83% to -40%) .
Another known technique for attempting to solve the problem of changes in the lighting being detected as changes in the scene is a technique called NVD "Normalized Vector Distance" described in Matsuyama, Ohya, Habe : "Background subtraction for non-stationary scene", Proceedings of the fourth Asian conference on computer vision 2000, pp 662-667. In this article there is an attempt, precisely as above, to solve the problem of changes in the lighting in the image by modelling them as proportionally linear changes in the intensities, lout, fter = krlin where k is a change in the lighting factor. In NVD the image is divided into blocks. The size of the blocks can be chosen according to the application. For example, the blocks can be 2 pixels in size. The first block can have the value (30,30) and the second block (50,50) . These vectors have the same direction and it is therefore decided that the change is due to a change in the lighting. If the direction differs over a particular threshold limit, it is decided that the change is a change in the scene. By considering angles between vectors defined on the basis of intensities, forming the elements of the vectors, in partial areas, a measure is obtained that is invariant for proportionally linear changes in the intensities. By invariant is meant in this connection that the angle between the vectors of the reference image and the current image is the same, irrespective of proportionally linear transformations of greyscale values in the current image.
Using NVD, there are the same disadvantages as mentioned above, which lead to a change in the lighting being able to be interpreted as a change in the scene. The problems with noise in dark areas also still remain, as a vector is defined with components consisting of the intensities in a square that comprises a number of pixels and then this is normalized. If the vector, for example, consists of a 4-dimensional vector with small components, for example (2,5,4,4) and the reference images contain noise with a maximum of 2 intensity levels, the direction of this vector can vary considerably, which may result in false alarms.
Summary of the Invention
An object of the present invention is thus to provide a method of image processing that can detect changes in the scene with greater reliability. More specifically, the method can discriminate between changes caused by lighting conditions and changes caused by scene conditions .
According to a first aspect of the present invention, this comprises a method in image processing for detection of a change in the scene in an area, compris- ing the steps of recording a digital image of the area with a sensor, transforming the recorded image by an operator that is based on a previous modelling of changes in the lighting in the area and on a modelling of how the sensor depicts the area in greyscale values in the recorded image, and comparing the transformed image with a reference image of the area in order to detect a diffe- rence that indicates a change in the scene.
The invention is based on an analysis of changes in the lighting that shows that changes in the intensities in the images do not only depend upon changes in the lighting of the scene and upon the reflective properties of the surfaces, but also upon how the sensor depicts intensities of greyscale values. By modelling for these changes in the lighting, the number of false alarms is reduced, as these changes in the lighting are not removed by transformation during the comparison between the transformed recorded image and the reference image. False alarms are costly and with a reduction in the number of false alarms, the cost of the system that uses this method is also reduced. A further advantage is that different sensor settings can be made, such as changes in amplification and aperture, without the risk that these will be detected as changes in the scenes in the area.
This method can be used for various types of monitoring, such as monitoring for intruders. The method can also be used with manufacturing processes for inspecting various components in order to detect defects in the manufactured product .
In another embodiment, the reference image is also transformed by said operator.
The transformed image is compared with a reference image that has been transformed according to the same method. An advantage of this method is that the transformed reference image and the transformed recorded image will differ in the event of a change in the scene, but are the same for changes in lighting intensity for which the operator is modelled. In this way, the number of false alarms is reduced. In a further embodiment, the operator is invariant with regard to transformations of said greyscale values in the recorded image .
The advantage of the operator being invariant, is that the transformed image is the same even if there is a change in the greyscale values arising from a change in lighting, such as that for which the operator is modelled. This means that when the transformed recorded image is compared with a reference image transformed in the same way, the differences that are detected will indicate changes in the scene and not the above-mentioned changes in greyscale values resulting from a change in the lighting of the scene.
In a still further embodiment, the operator is an affine invariant.
This embodiment is based on the idea of transforming the image and calculating an affine invariant measure that is constant over affine changes in intensity. The advantage of using affine functions is that a measure is obtained that is unchanged by affine changes in intensity, but is changed by a change in the scene. The operator is invariant for affine transformations.
If it is not taken into account that the greyscale values that are recorded in the digital image are due both to the incident intensity and to how the sensor converts these intensities into greyscale values, image information can be lost that may be important for the continued image processing. The sensor can have range 0 - 255, which is a measurement range within which the sensor is set to record intensities and to convert these to numbers between 0 and 255. Intensities that are outside the measurement range are converted to either number 0 or 255. After a change in incident light intensity, the measurement range can be displaced and information can be lost. In order to prevent this, the measurement range can be moved. There are known algorithms for this movement of the measurement range. If only quotients are used that take into account linearly proportional changes in the intensities, this movement affects the transformed image. If, on the other hand, the affine invariant measure is used, the movement of the measurement range does not affect the transformed image. The invention models both how the lighting affects the light emitted from a Lambertian surface and how the sensor depicts the incident intensity.
With a proportionally linear transformation it may not be possible to capture in a reliable way the variations in the intensity that arise, as these occur within a limited greyscale range. By instead using an affine transformation and by moving the measurement range, as in our invention, the variations that arise in a particular range can be captured more accurately. The appearance of this affine transformation, that is where it ends up in the coordinate system, depends among other things on the exposure time and aperture of the sensor. The settings of the sensor change the position of the greyscale range. An advantage of our invention is thus that a change in the measurement range does not give rise to a change in the scene .
Another advantage of the invention is that it takes into account affine changes in intensity and can thus also handle different settings in the sensor, such as exposure and aperture. This has the result that the number of false alarms is reduced. This also reduces the cost of monitoring.
An additional advantage is that the invention can also handle changes in intensity in surfaces that radiate the same amount all the time, that is self-radiating surfaces, such as a lamp. In addition, the invention can also handle surfaces that are intermediate between self- radiating surfaces and Lambertian surfaces as well as reflecting surfaces. Of course, the invention can also handle Lambertian surfaces. The change in the lighting can be modelled as
Iafter change in the light = C K lbefore change in the light + b where c is a constant that depends on the sensor and on the reflectance and b is a constant that depends on the sensor. Below, the designation a=c*k is used. The advantage of this modelling is that it also takes into account the settings of the sensor. By modelling in this way, it is taken account of light change functions Jafter = albefore + b in which a=l, that is a function according to equa- tion Jafte = ^befo e +b . This is particularly useful in the cases when b is large, which may occur for certain settings of the sensor.
An additional advantage is that proportionally linear changes in the lighting can also be handled, that is Iafter = albefore + , with b=0 , which gives
■Taster = aTbefore■ This is useful in the cases when b is small, which can occur for certain changes in the lighting.
In another embodiment of the method according to the invention, the step of transforming comprises the steps of filtering the recorded image with a first filter, filtering the recorded image with a second filter, the first and the second filter differing from each other and both having the coefficient sum zero, and determining the quotient between the filtered images.
The advantage of using filters is that it reduces the sensitivity to noise. The recorded image is filtered before the quotients between the different pixels are calculated, in order to reduce the sensitivity to noise. The quotient between the images is carried out pixel by pixel .
The affine invariant measure can thus be written as
F, *I m "I, =
F, *I
where Fi and F2 denote different filters with the coeffi- cient sum zero, * denotes convolution and 1 = 1 (x,y) denotes the intensities in the image. From this formula, it can be seen that affine transformations of the intensities in the images do not affect the measure mi
R*I Fλ* (alf + b) aF, *If+F1*b Ft*If mτ = — = = = —
F2*Ie F2*(alf+b) aF2*If+F2*b F2*If
where Ie is the intensity after the change in the lighting and If is the intensity before the change in the lighting. The last quotient is obtained as Fχ*b= F2*b = 0, which is due to the fact that the coefficient sum for F and F2 is zero. In the special case mentioned above, where Ta_rter = albefore +b and a=l, that is a function according to equation Ta_fter = Ibefoe +b, it is sufficient to use the approximate invariant mI(x,y) = F*I where F denotes an arbitrary filter with the coefficient sum 0.
In the case where Tafter = a before +b, with b=0, which gives Iafter = aIbefore, it is sufficient to use the approximate invariant
Ft*I
F2*I where F2 and F2 denote arbitrary filters, since
F *Iβ = Fγ *(alf) = aF, *If = Fγ *If m i F2*Ie F2*(alf) aF2*If F2*If
In one embodiment, the first and the second filter are a derivative of the Gaussian function.
The advantage of the Gaussian function is that it is simple to implement and that it is based on well- documented Scale Space theory, which means that the smoothing and the noise reduction is optimal. In an additional embodiment according to the invention, the first filter is a derivative of the Gaussian function in the x-direction and the second filter is a derivative of the same Gaussian function in the y-direc- tion.
The Gaussian function of the same scale is advantageously used as this results in a simpler implementation.
In one embodiment according to the invention, the first and the second filter are two simple difference filters between the intensity in one pixel and the intensity in another pixel .
The advantage of this embodiment is that it is very simple to implement and also very fast . In another embodiment according to the invention, the first filter is a difference filter between two pixels horizontally adjacent to each other and the second filter is a difference filter between two pixels vertically adjacent to each other. Both filters are thus difference filters between adjacent pixels in two orthogonal directions. Advantageously, a pixel from the difference in the vertical direction agrees with a pixel from the difference in the horizontal direction. In another embodiment according to the invention, the operator is a modification of normalized vector distance, in which the step of transforming comprises the steps of calculating the mean value of the intensity in a subset of the recorded image, subtracting the mean value in each pixel and carrying out a normalized vector distance calculation.
The advantage of this modified NVD is that, unlike the standard NVD, it takes into account affine changes in the intensities. In still another embodiment according to the invention, the step of comparing comprises the step of calculating the difference between respective vectors in the transformed recorded image and the transformed reference image .
The difference can be calculated by calculating the angle between the vectors or the distance between the vectors .
A predetermined threshold value can be set. If the distance exceeds this level, it can be decided that there has been a change in the scene. Alternatively, it can be said that it is the angle between the vectors that is compared with the threshold value. If the angle is essentially zero, no change in the scene is said to have occurred and if the angle exceeds a predetermined value it can be decided that there has been a change in the scene . In an embodiment according to the invention, the method comprises the step of adapting at least two parameters to the operator.
In certain situations, problems can arise when two filtered images are divided by each other. For example, when the intensity is constant in an area, which implies that corresponding elements in the filtered image will be near zero. This can be avoided by adapting coefficients instead of comparing the quotient of filtered images . The coefficients can be calculated by solving a least square problem.
An advantage of this embodiment is that it is less sensitive to noise in certain situations. An example of such a situation is when a part of the image with great variation in intensity changes into an area of almost constant intensity, for example when a lamp is switched off.
In another embodiment, said parameters are adapted in such a way that they cover all transformations of said greyscale values arising for changes of the lighting of the scene.
In this way, changes in intensity that arise as a result of changes in the lighting or as a result of how the sensor depicts changes in intensity will not be detected as changes in the scene and will therefore not give rise to false alarms.
In still another embodiment according to the inven- tion, the operator is an affine transformation.
In an additional embodiment according to the present invention, the parameters are adapted locally in the recorded image .
The fact that the adaptation is carried out locally means that the recorded image is divided into different parts and the adaptation of parameters is carried out in each part of the image. The adaptation of parameters can be carried out by adapting the recorded image to the reference image, in accordance with the following aIrecorded image + = 1reference image-
An advantage of this embodiment is that local changes in the lighting that only occur in a small part of the area can be handled better. The local adapting can be regarded as if the method for adapting parameters described above was applied to a part of the image.
Another embodiment according to the invention comprises in addition the step of filtering both the recorded image and the reference image with a smoothing filter. The advantage of filtering is that the sensitivity to noise is reduced. A smoothing filter is a filter that creates a weighted mean value of the intensities locally, that is in a part of the image. The creation of the weighted mean value means that the variance of the noise in the filtered image is less than in the original image. In one embodiment, the smoothing filter is a
Gaussian function.
Different breadths for the Gaussian function may be chosen, and in this way different levels of smoothing can be obtained. According to a third aspect of the invention, this comprises a computer program that is stored on a computer-readable memory medium that comprises instructions for causing a computer to carry out the method according to any one of claims 1-17.
According to a fourth aspect of the invention, this comprises the use of an operator in image processing for detection of a change in the scene in an image of an area recorded by a sensor, which operator is based on a previous modelling of changes in the lighting in the area and on' a modelling of how the sensor depicts the area in greyscale values in the recorded image. According to a fifth aspect of the invention, this comprises a system for monitoring an area comprising at least one sensor for recording images of the area and at least one processing unit in which the computer program according to claim 18 is stored. The advantages of these aspects of the invention are apparent from the discussions above.
In a seventh aspect of the invention, the method can be used with an automatic door-opener.
The method is particularly advantageous for use in an automatic door-opener. A sensor unit can be arranged to continually record images of a monitored area in front of a door. The door can, for example, be a revolving door or a sliding door. A processing unit can be arranged to carry out the above-mentioned method. If a person moves into the monitored area in front of the door, the person is detected as an object and a decision can be taken concerning whether the detected object is to cause the door to open. The image processing that is used as the basis for the decision concerning the opening of the door can have different degrees of intelligence level. This means that the image processing can be very simple and the decision that the door is to be opened can be made for all objects that cause movement to be detected. It can also be very advanced and only cause the door to open in the event that the detected object has, for example, a particular shape, size or direction of movement. If it is decided that the door is to be opened, a signal that the door is to be opened can be transmitted to a door-opening device, that physically opens the door.
Automatic door-openers are, for example, very common at the main entrances to various companies. Just inside the door there is usually a manned reception area. If the door is opened frequently, this affects the temperature inside the reception area, with resultant often costly heat losses. In addition, the people working there are exposed to draughts and cold air. It is therefore impor- tant that the door is not opened in error. By the use of the above-mentioned method, the risk is reduced of the door being opened in error, in, for example, difficult weather conditions, such as snow and rain, and different lighting and shade conditions that can arise when, for example, the sun goes behind a cloud. The automatic door- opener is also reliable when the monitored area is dark, as with the method above it is able more effectively to identify persons moving in the monitored area and can thus decide in a reliable way whether the door is to open.
Brief Description of the Drawings
Further objects, features and advantages of the invention will appear from the detailed description given below with reference to the accompanying drawings, in which
Fig. 1 is a block diagram and shows schematically assumed sources of changes in intensity according to prior-art technique, Fig. 2 is a diagram and shows schematically according to prior-art technique that the changes in the intensities are linearly proportional,
Fig. 3 is a diagram in which by way of experiment the intensities have been recorded before and after a change in the lighting of the scene,
Fig. 4 is a schematic diagram and shows vectors according to NVD, Fig. 5 is a diagram and shows schematically a modelling of the sensor,
Fig. 6 is a diagram which shows schematically the location of the coordinate system in an image for description of a filter,
Fig. 7 is a schematic flow diagram of an embodiment according to the present invention,
Fig. 8 is a schematic flow diagram of another embodiment according to the present invention, Fig. 9 is a partially schematic perspective view and shows a monitoring system according to the present invention, and
Fig. 10 is a schematic block diagram for hardware in a sensor unit according to one embodiment .
Description of Embodiments of the invention
The invention will next be described first in abstract mathematical language and thereafter by means of a number of embodiments. Fig. 1 discloses, according to prior-art technique, that an intensity incident upon the sensor is dependent upon the scene and the light that falls on the scene. Thus both the incident light and the scene contribute to the intensity incident upon the sensor. The incident intensity may be recorded as grayscale values in the sensor.
Fig. 2 is a diagram and shows the realation between the intensity before a light change and after a light change. As appears from Fig. 2, the relationship is considered to be linear according to prior art technique.
In reality, as shown in Fig. 3, there is an offset in the relationship between intensity before and after the light change, whereupon the relationship is essentially linear. Fig. 4 is a diagram disclosing that adjacent pixels designated as "30,30" and "50,50" may be grouped into a vector, which is depicted in the diagram. Fig. 5 shows how the sensor is adapted to different light conditions. Thus, the abscissa shows the intensity before the adjustment of the sensor and the ordinata shows the intensity after the adaption of the sensor.
Mathematical Description of the Invention
The basic idea of the present invention may be formulated in abstract mathematical language.
In order to model the physical reality and how the sensor records the physical world, operators are used.
The operators that we want to find are functions, F, that depict a digital image in some other space, that is
F : V s B ^ B' e V
Here V denotes the space of digital images (Z11™, where Z denotes the number of integers - normally integers between 0 and 255 -, n denotes the number of rows and m the number of columns) , B is an element in V (that is a digital image) , V is some other (linear) space and B' is an element in V . Normally V also consists of digital images of some size, but also a more general space can be used, for example vector-based images.
The transformation of the images in V that we want to "filter" out can be regarded as some group action over V. Introduce the notation G for this group:
g e G : V B → g(B) <= V where an element g in the group G transforms one image B into another image g (B) (in the same space) . For example, the group can be the one-dimensional affine group, para- meterised by two parameters, a and b, according to
g(a,b) e G : V 31(x,y) -> al(x,y) + b e V where I(x,y) denotes the intensity in position (x,y) .
The operators, F, that we are seeking are those that are invariant for the group action with the group G, which means that F(B) = F(g(B)), Vg ≡ G
For the affine group, this means that the image B is depicted on the same element in V as the image g(B), that is we get the same element in V irrespective of affine transformations of the intensity.
We can drive the formalism a step further by considering a generalized measure (metric) of the space V
Figure imgf000020_0001
where W denotes the space of digital images of the same size as the images in V, but with real intensities. The metric measures distances in the space V and relates these to the original image where we can read off changes in the image that do not originate from the group action. What we have now done can be described as if we are considering the equivalence classes in V that arise from the group action from G and then introducing a distance measure of these equivalence classes .
Application in a Monitoring System Using a number of embodiments, it will now be described how the mathematical description above can be applied in practice for image processing in a monitoring system, as shown in Fig. 9. The monitoring system comprises at least one light-sensitive sensor unit 1 that monitors a monitored area 2. The monitored area can be an area in which no object, such as a person should be found. The sensor unit 1 continually records digital images of the monitored area 2 in order to detect whether, for example, a person 3 is within the monitored area 2. A digital image can be said to be a matrix in which elements at the position (i,j) tell what light intensity has been detected at that point. If a person 3 is detected within the monitored area 2, the sensor unit can output an alarm signal, that is sent to an alarm centre 4. The alarm signal, that is sent to the alarm centre 4, can consist of only a signal that a movement has been detected, but it can also comprise a recorded image or an image of only the moving object that caused the alarm. This image can be displayed on a screen in the alarm centre 4 and a person in the alarm centre 4 can then carry out a further check on what caused the alarm. In a very simple case, the alarm centre 4 can be a device that emits a sound signal when it receives an alarm signal from the sensor unit 1. Fig. 10 discloses a block diagram of the hardware in the sensor unit 1. The sensor unit 1 is supplied with a voltage at a voltage connection 10. In addition, the sensor unit 1 comprises a powerful processing unit 11. The sensor unit 1 comprises a communication unit 12. The communication unit can be arranged to send an alarm signal to the alarm centre 4 in the event of detection of a movement. In addition, the sensor unit 1 comprises a light-sensitive sensor 13, for example a CMOS or CCD sensor, for recording images. The sensor 13 is integrat- ed on a chip and it has also a lens arrangement 14. In addition, the sensor unit 1 comprises a volatile memory or RAM memory 15. The sensor unit 1 uses an operating system and can carry out advanced image processing. The sensor unit 1 also comprises a permanent memory 16 for processing code and other data that must be saved in a non-volatile memory. All the components in the sensor unit 1 are advantageously integrated on a circuit board. The advantage of this is that the sensor unit 1 is very robust, that is to say that it is less sensitive to sources of interference and has fewer points where sabotage can be carried out .
The algorithms that are used are stored in the permanent memory 16.
Filtering
A first embodiment will now be explained with reference to the flow diagrams in Figs 7 and 8. The sensor 13 records 100, 200 an image of the monitored area 2. The image is transformed by an affine invariant measure being calculated. The affine invariant measure is the same for all affine changes in the lighting and all sensor sett- ings . It is calculated by the following method.
The recorded image is filtered 210 with two arbitrary linear filters Fι(x,y) and F2(x,y) with the property that
∑Fl(x,y) = Q, ι = l,2
The filtering is carried out in order to reduce the sensitivity to noise. For example, the derivative of the Gaussian function, Ga, can be used as filter
Figure imgf000022_0001
where a denotes the breadth (the scale) . Thus, a filter may be, for example
Fl(x,y) = ^ e-^2^2 ox 2ττa
Figure imgf000022_0002
The filtering can be carried out in a number of different ways. For example, we can also use derivatives in the same direction, but with different scales (a) .
After the filtering, an affine measure is calculated 110, 220 in accordance with the following
_ F * Ie _ Fx * {fllf +b) _ aF{ *If + Fxb Fγ * If m, =
1 2 e ~ F2 * {alf + b) ~ aF2 *If + F2b ~ r F2 *I f since F2*J= F2*b = 0 , which is due to the fact that the coefficient sum for Fx and F2 is zero. Given the properties of the filters, the measure will be independent of the constants a and b. The image is filtered before the quotients between different pixels are calculated, in order to reduce the sensitivity to noise.
The reference image has been processed in the same way as the recorded image, that is it has been filtered using the same linear filter as the recorded image and the affine invariant measure has been calculated. The affine invariant measure of the recorded processed image is compared 120, 230 with the affine invariant measure of the reference image. If a difference is detected 130, 240, this is said to originate from a change in the scene and it is decided that an alarm situation exists 140, 250.
In the formalism of the mathematical description, this embodiment corresponds to the operator F depicting the image B on a quotient of filtered images B' , which is also an image, but with real numbers as elements.
Special Case of Filtering Another embodiment can be regarded as a special case of filtering as above. The filter that is used can be of various types. The filter is a linear position-invariant operator that is represented by a matrix and operates on the digital image. In one example, the filter is a diffe- rence between adjacent pixels. In this embodiment, an affine invariant is calculated from the intensities in three adjacent pixels, Il f I2 and I3. This can, for example, be calculated in accordance with the following:
Figure imgf000023_0001
J2 Jl In particular, the pixel to the right and the pixel below can be used, which gives
I(x + l,y) -I(x,y) I(x,y + Ϊ) -I(x,y) In this way, we obtain by transformation a "new" image. The same transformation is carried out on the reference image . The affine invariant measure of the recorded image is compared with the affine invariant measure of the reference image. If there is a difference between the two images, a change in the scene is said to have taken place .
There are many types of filter that can be used
Figure imgf000024_0001
where the axes are in accordance with Fig. 6. Which filter is used, depends on what requirements are imposed relating to the sensitivity to noise and what processing power is available. The larger the filter, the more complex the calculations that are required, but on the other hand, a more robust system is obtained that is not so sensitive to interference.
Modified NVD
In another embodiment, a modified NVD (Normalized Vector Distance) is used. The sensor records an image of the monitored area. The image is divided into a number of squares.. Each square contains a number of pixels, which can, for example, be 8x8 pixels. A mean value of the intensity in each of the squares is calculated. This mean value is then subtracted from each pixel in the respective square. Following this, vectors are calculated based on the intensities of the squares. These vectors are normalized. The vectors are affine invariant measures. The vectors in the transformed recorded image are com- pared with the vectors in the reference image transformed in the same way. By considering the angles between the vectors in the recorded image and the reference image, a measure is obtained that is invariant for affine changes in the intensities. If the distance between the vectors is zero, there has been no change in the scene, that is if the angle between the vectors is zero, there has been no change in the scene. A threshold value is often set that means that the angular difference must be a particular minimum size in order for it to be determined that there has been a change in the scene. This is because there is often a certain amount of noise in the image.
In the formalism above, this modification of NVD can be obtained by letting the space V denote vector-value images with a lower resolution than the original image. Each matrix element in the image B' in V is then a vector. The generalized measure is then the normalized distance between the vectors related to the original image.
Adapting Coefficients In certain situations, problems can arise when two filtered images are divided by each other. For example, when the intensity is constant in an area, which implies that corresponding elements in the filtered image will be near zero. This can be avoided by matching coefficients a and b so that a*Ir(x,y)+b is as near to the recorded image as possible, where Ir denotes the reference image, instead of comparing the quotient of filtered images. The coefficients are most suitably calculated by solving the least square problem
™^al,.efi,y)+b-l
Figure imgf000025_0001
where Ω denotes a suitable subset in the image. This least square problem is very simple to solve by writing down the standard equations and inverting a 2x2 matrix. Let x denote a vector that contains all intensities within Ω for the reference image ordered in some suitable way and let y denote corresponding intensities for the current image. The standard equations can now be written as
y, = ax, + b, i = !,...«, where n denotes the number of pixels in the area Ω. In matrix form, the above can be written as
y = ax +b = [a b]
If we now multiply by the transposition of the last vector on both sides, we get
xx X y[xτ l]= [a b] * [xτ l]= [a b] T X 1 which in turn gives
Figure imgf000026_0001
In the next step, these parameters a and b are used to transform the reference image into a new image by defining Irnew = a*Ir+b. Irnew is then used as comparison with the current image. For example, the difference image can be considered and then this can be thresholded in order to detect changes in the scene . The above method can advantageously be modified by filtering both the reference image and the current image with any smoothing filter before the parameters a and b are calculated. This method means that the sensitivity to noise is reduced and it is particularly preferable to use a Gaussian filter.
In the same way, the modification of NVD can be handled by adapting coefficients a and b for each block instead of calculating the normalized distance. A threshold value is set, such that if the above minimizing exceeds this value it is determined that there has been a change in the scene. New constants a and b must be calculated for each new image that is recorded by the sensor.
Even though several embodiments of the invention have been described above, it is obvious to those skilled in the art that many alternatives, modifications and variations are feasible in the light of the above description. The invention is only limited by the appended patent claims.

Claims

1. A method for detection of a change in a scene in an area, comprising the steps of recording (100) a digital image of the area using a sensor, transforming (110) the recorded image by an operator that is based on a previous modelling of changes in the lighting in the area and on a modelling of how the sensor depicts the area in greyscale values in the recorded image , and comparing (120) the transformed image with a reference image of the area in order to detect a difference that indicates a change in the scene.
2. A method according to claim 1, in which the reference image is transformed by said operator.
3. A method according to any one of the preceding claims, in which the operator is invariant with regard to transformations of said greyscale values in the recorded image .
4. A method according to any one of the preceding claims, in which the operator is an affine invariant.
5. A method according to any one of the preceding claims, in which the step of transforming comprises the steps of filtering the recorded image with a first filter, filtering the recorded image with a second filter, the first and the second filter differing from each other and both having the coefficient sum zero, and determining the quotient between the filtered images.
6. A method according to claim 5, in which the first and the second filter are a derivative of the Gaussian function.
7. A method according to claim 6, in which the first filter is a derivative of the Gaussian function in the x-direction and the second filter is a derivative of the same Gaussian function in the y-direction.
8. A method according to claim 5, in which the first and the second filter are two simple difference filters between the greyscale value in one pixel and the greyscale value in another pixel.
9. A method according to claim 8, in which the first filter is a difference filter between two pixels horizontally adjacent to each other and the second filter is a difference filter between two pixels vertically adjacent to each other.
10. A method according to any one of claims 1-4, in which the operator is a modification of normalized vector distance, the step of transforming comprising the steps of calculating the mean value of the greyscale value in a subset of the recorded image, subtracting the mean value in each pixel, and carrying out a normalized vector distance calculation.
11. A method according to claim 10, in which the step of comparing comprises the step of calculating the difference between respective vectors in the transformed recorded image and the transformed reference image.
12. A method according to claim 1, further comprising the step of adapting at least two parameters to the operator.
13. A method according to claim 12, in which said parameters are adapted in such a way that they cover all transformations of said greyscale values.
14. A method according to claim 13, in which the operator is an affine transformation.
15. A method according to any one of claims 12-14, in which the step of adapting said parameters to the operator is carried out locally in the recorded image.
16. A method according to any one of claims 13-15, further comprising the step of filtering both the recorded image and the reference image with a smoothing filter.
17. A method according to claim 16, in which the smoothing filter is a Gaussian function.
18. A computer program that is stored on a computer- readable memory medium that comprises instructions for causing a computer to carry out the method according to any one of claims 1-17.
19. Use of an operator in image processing for the detection of a change in the scene in an image of an area recorded by a sensor, which operator is based on a previous modelling of changes in the lighting in the area and on a modelling of how the sensor depicts the area in greyscale values in the recorded image.
20. Use according to claim 19, in which the operator is invariant with regard to the transformations of the greyscale values that arise as a result of the changes in the lighting in the area and as a result of how the sensor depicts these changes in the lighting in greyscale values in the digital image .
21. Use according to claim 20, in which the operator is an affine invariant.
22. Use of a method according to any one of claims 1-17 for controlling an automatic door-opener.
23. A system for monitoring an area, comprising at least one sensor for recording images of the area, a memory in which the computer program according to claim 18 is stored and a processing unit.
24. A system for detection of a change in a scene in an area, comprising: a sensor (1) for recording a digital image of the area, and a processing means (11) for transforming the recorded image by an operator that is based on a previous modeling of changes in the lighting in the area and on a modeling of how the sensor depicts the area in grayscale values in the recorded image, and comparing the transformed image with a reference image of the area in order to detect a difference that indicates a change in the scene.
25. A system according to claim 24, in which the operator is invariant with regard to transformations of said grayscale values in the recorded image.
26. A system according to claim 25, in which the operator is an affine invariant.
PCT/SE2002/001187 2001-06-21 2002-06-19 Invariant filters WO2003001810A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
SE0102209A SE519279C2 (en) 2001-06-21 2001-06-21 Custom filters for scene change detection
SE0102209-4 2001-06-21

Publications (1)

Publication Number Publication Date
WO2003001810A1 true WO2003001810A1 (en) 2003-01-03

Family

ID=20284563

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2002/001187 WO2003001810A1 (en) 2001-06-21 2002-06-19 Invariant filters

Country Status (2)

Country Link
SE (1) SE519279C2 (en)
WO (1) WO2003001810A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111611930A (en) * 2020-05-22 2020-09-01 华域汽车系统股份有限公司 Parking space line detection method based on illumination consistency

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0720114A2 (en) * 1994-12-28 1996-07-03 Siemens Corporate Research, Inc. Method and apparatus for detecting and interpreting textual captions in digital video signals
US5561718A (en) * 1992-01-17 1996-10-01 U.S. Philips Corporation Classifying faces
DE19623524A1 (en) * 1996-06-13 1998-01-02 Pintsch Bamag Ag Monitoring unit for danger area at railway level crossing
US5719959A (en) * 1992-07-06 1998-02-17 Canon Inc. Similarity determination among patterns using affine-invariant features
US5767922A (en) * 1996-04-05 1998-06-16 Cornell Research Foundation, Inc. Apparatus and process for detecting scene breaks in a sequence of video frames
EP0932115A2 (en) * 1998-01-23 1999-07-28 Seiko Epson Corporation Apparatus and method for pattern recognition
WO2001048696A1 (en) * 1999-12-23 2001-07-05 Wespot Ab Method, device and computer program for monitoring an area

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5561718A (en) * 1992-01-17 1996-10-01 U.S. Philips Corporation Classifying faces
US5719959A (en) * 1992-07-06 1998-02-17 Canon Inc. Similarity determination among patterns using affine-invariant features
EP0720114A2 (en) * 1994-12-28 1996-07-03 Siemens Corporate Research, Inc. Method and apparatus for detecting and interpreting textual captions in digital video signals
US5767922A (en) * 1996-04-05 1998-06-16 Cornell Research Foundation, Inc. Apparatus and process for detecting scene breaks in a sequence of video frames
DE19623524A1 (en) * 1996-06-13 1998-01-02 Pintsch Bamag Ag Monitoring unit for danger area at railway level crossing
EP0932115A2 (en) * 1998-01-23 1999-07-28 Seiko Epson Corporation Apparatus and method for pattern recognition
WO2001048696A1 (en) * 1999-12-23 2001-07-05 Wespot Ab Method, device and computer program for monitoring an area

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111611930A (en) * 2020-05-22 2020-09-01 华域汽车系统股份有限公司 Parking space line detection method based on illumination consistency
CN111611930B (en) * 2020-05-22 2023-10-31 华域汽车系统股份有限公司 Parking space line detection method based on illumination consistency

Also Published As

Publication number Publication date
SE0102209D0 (en) 2001-06-21
SE519279C2 (en) 2003-02-11
SE0102209L (en) 2002-12-22

Similar Documents

Publication Publication Date Title
US7203337B2 (en) Adjusted filters
US6628805B1 (en) Apparatus and a method for detecting motion within an image sequence
EP1367554B1 (en) Object detection for sudden illumination changes using order consistency
JP4648981B2 (en) Non-motion detection method
KR101078474B1 (en) Uncleanness detecting device
CA2519908C (en) Target detection improvements using temporal integrations and spatial fusion
EP1881454A1 (en) Image processing for change detection
US20070058717A1 (en) Enhanced processing for scanning video
US20100092030A1 (en) System and method for counting people near external windowed doors
EP2357615B1 (en) Video processing
EP1766581A1 (en) Method for detecting desired objects in a highly dynamic environment by a monitoring system
JP2005504457A (en) Motion detection by image alignment
US6819353B2 (en) Multiple backgrounds
CN111694064A (en) Processing system
WO2003001467A1 (en) Method and device for monitoring movement
Hötter et al. Detection and description of moving objects by stochastic modelling and analysis of complex scenes
Jiang et al. Tracking objects with shadows
Tsesmelis et al. Tamper detection for active surveillance systems
WO2003001810A1 (en) Invariant filters
Forshaw et al. Image comparison methods for perimeter surveillance
Oppliger et al. Sensor fusion of 3D time-of-flight and thermal infrared camera for presence detection of living beings
Sexton et al. Suppression of shadows for improved object discrimination
JP3957495B2 (en) Image sensor
KR101648562B1 (en) Apparatus for detecting moving object
JP2001169270A (en) Image supervisory device and image supervisory method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ CZ DE DE DK DK DM DZ EC EE EE ES FI FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP