EP2790165A1 - Quality determination in data acquisition - Google Patents

Quality determination in data acquisition Download PDF

Info

Publication number
EP2790165A1
EP2790165A1 EP13162865.3A EP13162865A EP2790165A1 EP 2790165 A1 EP2790165 A1 EP 2790165A1 EP 13162865 A EP13162865 A EP 13162865A EP 2790165 A1 EP2790165 A1 EP 2790165A1
Authority
EP
European Patent Office
Prior art keywords
detection signal
error
model
sensor
prediction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP13162865.3A
Other languages
German (de)
French (fr)
Inventor
Ralf Junghans
Karl Grüner
Kyandoghere Kyamakya
Alireza Fasih
Fadi Al Machot
Ahmad Haj Mosa
Mouhannad Ali
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SWARCO Traffic Systems GmbH
Original Assignee
SWARCO Traffic Systems GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SWARCO Traffic Systems GmbH filed Critical SWARCO Traffic Systems GmbH
Priority to EP13162865.3A priority Critical patent/EP2790165A1/en
Publication of EP2790165A1 publication Critical patent/EP2790165A1/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/015Detecting movement of traffic to be counted or controlled with provision for distinguishing between two or more types of vehicles, e.g. between motor-cars and cycles
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/042Detecting movement of traffic to be counted or controlled using inductive or magnetic detectors
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/07Controlling traffic signals

Definitions

  • the present invention relates to quality determination in data acquisition.
  • the present invention relates to a method, an apparatus and a computer program product for quality determination in data acquisition.
  • Inductive loops and other well-known data suppliers like video cameras with virtual loops, infrared sensors, radar sensors and the like are called traffic detectors or sensors, respectively, and are especially used in traffic control systems for delivering a good picture of the traffic situation based on the cross section and lane-related measurement principle of traffic parameters.
  • each vehicle driving over/through these physical/virtual sections on given lanes is recognized and causes the electronic detector component to generate a pulse for each vehicle presence in form of digital information.
  • inductive loop detectors generate digital information out of the analogue signal response.
  • the analogue and digital circuits of the detector component together with the connected loop-wire located in the road concrete/bitume may also be capable of using special parameters as measurement mechanism.
  • the obtained digital information is provided to the traffic control software within the intersection traffic controller and to any traffic monitoring system.
  • this problem is solved by picking up the digital pulse information directly at the sensor output.
  • a sensor reliability data model is trained offline to gather all significant characteristics of the sensor component, the physical object in the road surface/concrete, environmental influences, all this for enabling a reliable assessment of the data quality of the single sensor.
  • mutual interferences are also trained in the sensor reliability data model.
  • the resulting model or rather the set of models offers for the first time the possibility of monitoring (in real- or quasi real-time and proactively) both the current state and the evolution of sensor data quality especially for sensors delivering a digital pulse stream caused induced by the presence of vehicles in traffic.
  • the present invention presents a global and comprehensive concept for a robust "traffic detectors' quality management".
  • the overall concept relates to all traffic sensors types, independently of whether they are point-sensors or area-sensors.
  • sensors types are: loop detectors systems, video-based systems, radar-based systems (all forms of radars are meant: radio-wave based; infrared-light-based, light-based, sound-based), and (earth) magnet fields based systems.
  • a signal processing apparatus for processing at least one detection signal, the apparatus comprising:
  • a method for processing at least one detection signal comprising:
  • the method further comprises
  • a computer program product comprising computer-executable computer program code which, when the program is run on a computer (e.g. a computer of an apparatus according to any one of the aforementioned apparatus-related exemplary aspects of the present invention), is arranged to cause the computer to carry out the method according to any one of the aforementioned method-related exemplary aspects of the present invention.
  • a computer e.g. a computer of an apparatus according to any one of the aforementioned apparatus-related exemplary aspects of the present invention
  • Such computer program product may comprise or be embodied as a (tangible) computer-readable (storage) medium or the like on which the computer-executable computer program code is stored, and/or the program may be directly loadable into an internal memory of the computer or a processor thereof.
  • Fig. 1 is an overview of a scenario to which the present invention is applicable.
  • Fig. 1 shows an example of a crossroad that is equipped with various traffic lights and sensors.
  • reference signs TS1 to TS4 indicate traffic signal lights and reference signs D1 to D10 indicate inductive loop detectors.
  • reference sign TC denotes a traffic controller for traffic light signaling including a traffic control module 10. As shown in Fig. 1 , analogue signal from the various inductive loop detectors are input into the traffic control module 10.
  • Fig. 2 is a block diagram illustrating the traffic control module 10 shown in Fig. 1 .
  • the traffic control module 10 comprises a detection module 11, a RoSiT (Robust Sensors in Traffic) module 20 and traffic programming module 12.
  • analogue signals from the inductive loop detectors D1 to D10 are input to the detection module 11 and the detection module 11 outputs digitized signals to the RoSiT module 20. Further, the RoSiT module 20 outputs an indication regarding loop data quality to the traffic programming module 12.
  • the RoSiT module 20 comprises a pattern recognition module 21, an environmental influences module 22, an error classification module 23, a quality assessment and forecast module 24, an error correction module 25, a detector parameter recommendation module 26, and a truck detection module 27.
  • the pattern recognition module 21, the environmental influences module 22, the error classification module 23, the error correction module 25, the detector parameter recommendation module 26, and the truck detection module 27 each input data to the quality assessment and forecast module 24, which outputs loop data quality information to the traffic programmers module 12.
  • the present invention comprises the following aspects. Each of the below aspects will be described in more details later on.
  • the first aspect concerns the from and granularity of the detector's signal raw data observation level. That is, according to a first aspect, solely microscopic detector data is used for quality management.
  • a third aspect deals with the issue of how to differentiate and classify the error types observed. That is, according to the third aspect, in case of errors/faults presence, these faults are classified. In this regard, it is proposed to robustly classify and identify OCCURING error/faults types, the error types depending on the underlying sensor type.
  • the type of vehicle observed is classified. That is, from the pattern collected out of non-erroneous data, the presence of either a car or a truck is classified (this classification is independent of the sensor type).
  • a fifth aspect concerns the issue of how to perform erroneous detector data correction, whenever possible.
  • the fifth aspect while involving sensor type related error types knowledge, sensor topology, environmental information, and observed error types, the errors are robustly corrected and a cleaned data stream is generated.
  • the sixth aspect relates to how to build a black box model of the error rate related detector's behavior in dependence of both environmental information (weather and temperature) and traffic volume.
  • the seventh aspect it is described how to perform short, middle and long term detector error rate performance prediction while involving the black-box model. That is, based on archived pattern history and environmental conditions information we do predict detector faults rate for short, middle and long term. This does also give indication about the remaining detector life time.
  • An eighth aspect deals with how to either automatically tune or recommend contextually appropriate detector 'sensitivity' values and/or 'measurement time' values and/or other equivalent or relevant settings in order to minimize detector's error rate performance.
  • a detector tuning recommendation that takes environmental information and a detector reliability physics model into consideration
  • the recommendation concerns measurement time and/or sensitivity level, and/or other equivalent detector settings.
  • the processing related to all of the above described 8 aspects is performed in real-time and does reach a detection rate above 96% according to very extensive tests that did involve real field data.
  • microscopic level it is referred to the level at which the detector raw data have been transformed into an information about presence or non-presence of a vehicle on the road.
  • the atomic information (or atomic data stream) considered in the present invention and on which all further processing steps are based is that about
  • the first step in the global detectors quality management is to transform the detector raw data into an "atomic data stream".
  • a pre-processing module for generating/guessing/synthesizing the related 'atomic data stream' can be designed.
  • Such a module to be calibrated once by real sample observations may eventually contain some form of stochastic intelligence.
  • the atomic data stream is to be observed and analyzed in order to determine whether it contains errors or not. In this point, the overall methodology of this determination is presented.
  • the output of this processing is information-related to the presence or not of errors in the observed atomic data stream. Either a single cross-section or a set of consecutive cross-sections is/are considered simultaneously depending on their logical interdependency as determined/fixed by the underlying sensor network topology.
  • a reference clean or errorless data stream of appropriate significant length is collected or generated. This corresponds to an observation from typical portions from several typical hours up to days.
  • the collected reference data stream is transformed into various images of different sizes (using sliding windows of different sizes) while involving the logical-topological information.
  • the different images are transformed though nonlinear processing (radon transformation and CNN processing) out of which a set of reference classes is determined and saved.
  • the discrete time CNN classification allows a comparison with the references classes.
  • Portions of the "atomic data stream" containing errors are identified in and marked as such.
  • marked atomic data stream portions are processed and errors pattern are differentiated and then clustered.
  • a semi-supervised labeling involving for example rough-set theory is then performed offline by a human expert. This labeling depends on the underlying sensor type and is done once. For loop detectors, for example and just for illustrative purposes, the labeling will differentiate 6 error classes: weak-signal, pulse break-up, chattering, splash-over, over-counting and under-counting.
  • the error classes are of course not limited to the above mentioned 6 error classes.
  • the labeling will depend on their inherent physics and on the sensor network topological information.
  • the offline phase 1 is used to record reference "error containing data streams or data stream pairs" of different lengths while taking the following into consideration: reference logical-topological contexts, weather and environmental conditions and sensor type or sensor types combinations.
  • a clustering of the different error types is performed by involving for example rough-set theory.
  • a human expert is used once in order to label the different types/classes of errors observed/identified/known. The latter is done once for every sensor type or sensors type combinations.
  • corresponding signatures are fixed/extracted by using either a nonlinear feature extractor like CNN or by a statistical measure performed over sliding windows.
  • the signatures related to each error type are saved for further use in the online phase.
  • This OFFLINE phase is performed once and the output is the set of signatures for all relevant error types for a given sensor type or sensor types combination. These signatures will then be used in phase 2 (ONLINE) in order to detect/ classify errors (types).
  • the phase 2 is an online process. It assumes that the signatures of all possible errors are known (while considering related logical-topological and weather + environment information).
  • CNN CNN based, SVM based, or other
  • Phase 1 of Aspect 3 has been done once and thus has already provided the signatures of all possible errors, to skip all steps of Aspect 2 coming after Aspect 2/Phase2/(1).
  • the online data streams of different lengths would then be collected from here and processed for error classification.
  • Aspect 2 is merely de facto skipped and considered or integrated as an offline part of Aspect 3.
  • This Online phase 2 can be directly performed on the online data-stream while skipping most of the online phase (Phase 2) of Aspect 2 as explained in the note "Alternative to (3)".
  • Clean portions of the "atomic data stream" (i.e. not containing errors), originated from a single detector system, are processed in order to classify the type of vehicle observed.
  • This classification relates to vehicle length and respective headways as sole differentiating features. Thus, it is mainly differentiated between trucks and cars. Further differentiations are possible so far they are related to vehicle length.
  • the main classification is based on the features sets constituted by the presence data pulse lengths and the related distribution of the respective headways.
  • a shock wave pattern is then extracted and involved in a support vector machine based classification.
  • the classification output is whether the vehicle is a car or a truck, and whether it has been in movement or stopping, the latter for example due to red traffic light phase.
  • This phase 1 is an offline process. It is done once and intends to produce the signatures of a truck presence within the atomic data streams while taking related context information into consideration.
  • the reference data streams involved here must be error free.
  • the obtained various signatures of the truck's presence in a traffic stream are then saved for use in the online truck detection process.
  • This offline phase results in a set of signatures indicating the presence and the position of a truck within a traffic stream from different perspectives.
  • This phase 2 is an online process.
  • the online data streams are assumed to be error free. If they are not error free, the stream should be taken after error correction or the portions containing errors should be skipped for the truck detection.
  • a CNN based classifier is preferred due to the high performance. Other classifiers can however also been used.
  • This online phase detects the presence of a truck within a traffic stream and gives also the exact position.
  • the atomic 'presence pulse' corresponding to a truck will be identified.
  • Error correction is based on a reasoning based on some prior knowledge formulated logically by a human expert and that depends on error classes related to a given detector type and on fixed sensor network topological configurations.
  • the topological information is related to how detectors lay as a group in a common cross-section or are situated with a given spatial separation within a direct longitudinal neighborhood.
  • the prior knowledge that expresses the essence of the rules for the real-time, logic-programming-based reasoning correction is based on a plausibility matrix fixing the possible either simultaneous or consecutive error occurrences depending on the underlying topology and while considering the detector-type related variety of error classes.
  • the output of the correction process is, for each given error (detected by the concept of Aspect 3), that:
  • This phase 1 is an offline process.
  • Aspect 2 and Aspect 3 knowledge from Aspect 2 and Aspect 3 is used by a human expert to formulate logic-programming rules that are then integrated in a logic-programming reasoner to be used in real-time for error correction.
  • the reasoner will involve an appropriate ontology that includes the context information indicated in Aspect 2: logical-topology, weather and environment, data streams lengths, error types, etc.
  • the ontology and the reasoner will be both designed during this offline phase.
  • ASP Answer Set Programming
  • This phase 2 is an online process.
  • the reasoner will determine whether the error can be corrected. If yes, it will correct it. If not it will either state a default recommendation or state that neither default recommendation nor correction is possible due to lack of information.
  • Each default recommendation should be endued with a probability value. This latter value is obtained offline as explained in Phase (1) of Aspect 5.
  • the reasoner will preferably involve a solver based on ASP (Answer Set Programming) to ensure a fast speed under real-time constraints.
  • ASP Automatic Set Programming
  • This Online phase involves the use of an ultrafast logic-programming reasoner for online error corrections and/or for default recommendations for cases a sure correction is not possible.
  • a black-box model of the detector's performance is constructed with the objective of probabilistically describing the error rate related detector's behavior. In the essence, the intention is to determine the detector's reliability physics in dependence of both internal and external parameters.
  • Internal detector parameters are different settings determining and/or related to its core functioning principle. External parameters are traffic volume, detector's network topology, weather and environmental related ones.
  • Potential/candidate internal parameters are for example: sensor sensitivity level, measurement time level, etc. These parameters may differ depending on the specific sensor system type.
  • Potential/candidates for external parameters are: weather, temperature, traffic level, proportion of trucks in the traffic, etc.
  • a robust detectors' quality management requires the capability to predict the detector's reliability in three time horizons: short term (range of some minutes up to one hour); middle term (range of some hours, up to some days, and up to a week); long term (range of several weeks, up to several months, and up to some years).
  • the short term prediction is of relevance for managing the impact of detector quality on traffic management.
  • the middle term prediction is mainly of relevance for short term traffic infrastructure maintenance activities.
  • the long term prediction is mainly of relevance to predict the operationally acceptable lifetime of a given detector or detector group.
  • the prediction does involve the prior knowledge that is compressed in the black-box model obtained in Aspect 6. Further, for both short-term and middle term detector's-performance prediction, a concept involving mainly hidden markov models and the black-box model is used. For both short and middle term, all external parameters are either known or can be easily predicted (e.g. traffic volume) or obtained from external sources (e.g. weather information).
  • the available detector's data history to be involved in the long-term prediction should be at least two times longer than the future time-frame ahead for prediction. For example, to predict for the next three weeks (or months), the available observed history should be longer than 6 weeks (or months). Thus, the appropriate level of data aggregation should be selected in a way that the available history should satisfy this length requirement.
  • the envelope's prediction involves two major steps: a) time series filtering through a quadratic trend estimation and a consecutive smoothing; and b) estimate a cubic trend of the later-resulting time series and use it for future prediction.
  • Step 1 Offline Short Term Prediction
  • this process is aiming at predicting the error rate in order to evaluate the SHORT TERM performance of the sensor system.
  • the aggregation time unit is hours; one predict for the next couple of hours (1 to 4 hours).
  • the black box reliability physics model is taken unto consideration.
  • the short term prediction considers the history of error rates associated with the conditions (weather, temperature and traffic volume).
  • State space models stocastic and probabilistic are proposed for this prediction. For instance, hidden Markov model (HMM), Bayesian network and Kalman filter.
  • the sensor system internal parameter tuning control system (see Aspect 8) will need the prediction result as one of the inputs if a tuning assessment for the corresponding time horizon is needed.
  • this process is aiming at predicting the error rate in order to evaluate the MIDDLE TERM performance of the sensor.
  • the aggregation time unit is days and weeks. According to these aggregations, the error rate time series is smoother than in short term, and the environmental conditions have due to the aggregation a relatively smaller impact on error rates.
  • the proposed middle term prediction uses a stochastic state space model (e.g. Auto Regression Moving Average Model, Local Trend Model, etc.).
  • the sensor system internal parameter tuning control system (see Aspect 8) will need the prediction result as one of the inputs if a tuning assessment for the corresponding time horizon is needed.
  • this process is aiming at LONG TERM predicting the error rate in order to estimate amongst others the lifetime of a given sensor.
  • This threshold should be sensor system type specific. If for example an average of 10% error rate (@ 500 vehicles) is no more acceptable, then it will be fixed as a threshold on which basis the potential remaining life time of a sensor can be estimated based on the long term prediction functionality.
  • a future 'time horizon' of 3 months and more is taken, for example.
  • the Long term prediction model estimates the trend of the worst cases of error rates. It involves similar performance worst cases observed in the recent related history of the historical data.
  • This online process enables a future prediction of the sensor system performance. It allows to know in advance whether the sensor system may reach performance regions that are unacceptable.
  • the reliability physics model is an input used to extensively train the new sensor system internal parameter tuning control model of Fig. 4 .
  • the tuning can be calculated for the current time or for a future time interval. Respective related inputs will be then needed.
  • This online sensor system tuning model controller is in essence a REASONER that builds on knowledge from the sensor system reliability physics model. It ensures an adaptivity of the sensor system to keep the error rates performance the lowest possible at any moment.
  • a point sensor is typically stationed/placed at a fixed location along/on/under a roadway and watches (i.e. counts, records, classify, etc.) vehicles passing at this particular location in the time domain.
  • the fixed location may be:
  • point sensors are: loop detectors, radars, ultrasonic radars, Lidars (light detection and ranging) and video cameras.
  • area sensors are: aerial photography or satellite imagery, and mobile sensors such as automatic vehicle location and global positioning system.
  • the inductive-loop detector is the most utilized sensor in a traffic management system.
  • the principal components of an inductive-loop detector are:
  • Cars passing over or that stop within the detection area of an inductive-loop detector do decrease the inductance of the loop.
  • the electronics unit senses this event as a decrease in frequency and sends a pulse to the controller signifying the passage or presence of a vehicle.
  • Video-based traffic sensor systems :
  • Video-based detectors use advanced image processing schemes running on an appropriate microprocessor to analyze a video image input. Different approaches are used by video detection sensors. Some analyze the video image of a target area on the pavement. A change in the image of the target area as a vehicle passes through the target area is analyzed. Another approach does rather identify when a target vehicle enters the camera field of view and tracks the target vehicle through this field of view. Finally, other video sensors use a combination of these two approaches.
  • a radar operates according to the following principle: " The radar dish or antenna transmits pulses of radio waves or microwaves which bounce off any object in their path. The object returns a tiny part of the wave's energy to a dish or antenna which is usually located at the same site as the transmitter" (Source: http://en.wikipedia.org/wiki/Radar; latest access: Sept. 10th, 2012 ). Through an appropriate processing of the reflected energy the radar can be used to determine presence, range, altitude, speed and direction of targeted objects. In traffic detection, the radar is used as a point sensor which can detect both presence and speed of a vehicle in a fixed sport (or cross-section) on the road.
  • detector's signal raw data is meant the time series generated by a point detector. It is constituted of the succession of vehicle presence detection pulses. The time series contains beginnings and ends of the successive pulses, which are separated by so-called time headways. An area sensor will generate simultaneously many of such time series for fixed areas or locations.
  • a black box model can be defined as follows.
  • a black box is a module with known inputs, known outputs, a known function but with an unknown internal mechanism.
  • a black box module a) acts predictably, b) can be used without knowledge of its internal details, c) hides information from the rest of the system.
  • a black box is defined by "what" it does and not by “how” it does it.
  • For building a black box model on has to start from measurements of the behavior of the system and the related external influences (inputs to the system) and try to determine a mathematical relation between them without going into the details of what is actually happening inside the system.
  • Atomic information refers to refers to a minimal amount of information which makes it possible to distinguish two particular items. Atomic information is mono valuate and does refer to only one thing. Failure to capture atomic information means inaccurate, unreliable or missing data.
  • a location is a generally rectangular area on the lane surface of approx. 1m x 2m of The selection of these positions serve different needs, for example a) provide a trigger or parameter on which a vehicle actuated traffic management system is based for extending the current green time or for giving green to a particular approach; b) measure queue length; etc.
  • HMM is defined as follows:
  • HMM Hidden Markov Model
  • Bayesian network (or - other naming of the same: Bayes network, belief network, Bayes(ian) model, probabilistic directed acyclic graphical model) is a probabilistic graphical model (a type of statistical model) that represents a set of random variables and their conditional dependencies via a directed acyclic graph.
  • a Bayesian network could represent the probabilistic relationships between diseases and symptoms. Given symptoms, the network can be used to compute the probabilities of the presence of various diseases (Source: Wikipedia).
  • Bayesian nets are a network-based framework for representing and analyzing models involving uncertainty. They are different from other knowledge-based systems concepts because uncertainty is handled in mathematically rigorous yet efficient and simple way.
  • the general probabilistic inference problem here is in fact to find the probability of an event given a set of evidences. This can be done in Bayesian nets with sequential applications of Bayes Theorem.
  • Support vector machines are concepts used for classification. They belong to the class of supervised learning models with associated learning algorithms that analyze data and recognize patterns. The fundamental SVM takes a set of input data and predicts, for each given input, which of two possible classes forms the output. Thus, SVM is a non-probabilistic binary linear classifier.
  • an SVM training algorithm Given a set of training examples, each marked as belonging to one of two categories, an SVM training algorithm builds a model that assigns new examples into one category or the other.
  • An SVM model is a representation of provided examples as points in space, mapped so that the examples of the separate categories are divided by a clear gap that is as wide as possible. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall on.
  • SVMs can efficiently perform non-linear classification using what is called a kernel function, and thereby implicitly mapping their inputs into high-dimensional feature spaces.
  • Fuzzy logic is an approach to computing based on "degrees of truth” rather than the usual “true or false” (1 or 0) Boolean logic on which the modern computer is based. Fuzzy logic includes 0 and 1 as extreme cases of truth (or “the state of matters” or “fact”) but also includes the various states of truth in between (i.e. very unlikely' (gray areas of probability)). Software based on application of fuzzy-logic (as compared with that based on Formal Logic) allows computers to mimic human reasoning more closely, so that decisions can be made with incomplete or uncertain data.
  • a fuzzy inference system is a system that uses fuzzy set theory to map inputs (features in the case of fuzzy classification) to outputs (classes in the case of fuzzy classification).
  • a fuzzy inference is an extended form of formal inferences that enables qualitative and/or quantitative evaluation of the degree of confidential level for a given causality on the basis of fuzzy expressions.
  • the presence detection of a vehicle at a fixed location is generally expressed at the detector/sensor output in form of a quadratic pulse in time.
  • the width of this quadratic pulse does generally correlate with the speed of the vehicle which presence is detected.
  • a fast vehicle will generate a pulse with a smaller width; a long vehicle like a truck will result in a longer pulse width.
  • the generated 'presence pulse' (from a given sensor/detector) is named "weak signal” if its width is too small/short and thereby does significantly deviates from what should be expected normally (i.e. according to the current context related to maximum speed, current speed and average vehicle length).
  • pulse breakup if instead of one pulse, two consecutive pulses are generated whereby one or both of the following situations are additionally true:
  • the generated 'presence pulse' or rather 'pulse-series' is named "chattering" if instead of one pulse, multiple (more than three) consecutive pulses are generated whereby both of the following situations are additionally true:
  • the generated 'presence pulse' (from a given sensor/detector) is named “splash-over" if both of the following are true:
  • the sensor/detector does generate a single presence pulse for 2 or more consecutive vehicles passing by.
  • the resulting pulse length will be higher than normal and an undercounting of the effective number of vehicles will be a further consequence. This generally happens when the headways between slowly moving vehicles is relatively small, as it is for example the case during traffic jams.
  • Rough set theory is a relatively new mathematical tool for imperfect data analysis.
  • the foundation of the rough set philosophy is the assumption that to every object of the universe of discourse some information (data, knowledge) is associated. Objects characterized by the same information are indiscernible (similar) in view of the available information about them.
  • the indiscernibility relation generated in this way is the mathematical basis of rough set theory.
  • Rough set theory can be approached as an extension of the classical set theory, for use when representing incomplete knowledge.
  • Rough sets can be considered as sets with fuzzy boundaries; that is, sets that cannot be precisely characterized using the available set of attributes.
  • the basic concept of the rough set theory is the notion of approximation space.
  • the signal considered is the one formed by the succession of the different consecutive presence pulses over time over a given location at a cross-section.
  • Traffic waves also called stop waves or traffic shocks, are travelling disturbances in the distribution of cars on a highway. Traffic waves usually travel backwards in relation to the motion of the cars themselves, or "upstream”.
  • the shock wave pattern is a particular distribution of both consecutive pulse-lengths and headways that do reflect the existence of a traffic wave in the underlying traffic stream.
  • a method for filtering a time series data is a tool to clean as much noise from it as possible as the data should be made compatible for further analysis.
  • T t ⁇ 0 + ⁇ 1 ⁇ t + ⁇ 2 ⁇ t 2 + ... + ⁇ p ⁇ t p
  • p is a positive integer.
  • the coefficients ⁇ i are obtained from a regression process over the underlying time series.
  • a sensor's sensitivity indicates how much the sensor's output changes when the measured quantity changes. For many sensor systems, the possibility exists of varying or setting the sensitivity, either manually or automatically.
  • a high measurement time will result in a lower measurement update frequency.
  • traffic sensing microscopic data refer to single-vehicle related data. In this case here, these will be the time series of individual vehicle presence pulses generated by the traffic detectors.
  • macroscopic data refer to an aggregation of single-vehicle related data. In this case here, these will be the time series of aggregated numbers of individual vehicle presences per time unit, for example per hour.
  • the macroscopic quantities are the flow rate (the number of cars passing a cross section per unit of time, q), the density (the number of cars per unit of distance, k), and the space mean speed.
  • characteristics of individual vehicles can be obtained for cross sections. Then the characteristics of the distributions of the measured data can be examined (mesoscopic traffic flow characteristics) which also differ by flow condition. For example, the distributions of time headways are well studied because they can, for instance, be used to estimate capacity.
  • a pattern is a particular signature that is expressed by 'elementary features' of an object.
  • the object is the time series of consecutive detector pulse signals.
  • the 'elementary features' are for example a part or all microscopic data attributes or a combination of them.
  • a pattern is defined by the common denominator among the multiple instances of an entity.
  • the pattern of relevance here are for example the different faulty signal types.
  • detector systems do offer the possibility of setting, manually or dynamically/automatically two key parameters of the sensor system or their related equivalent: sensitivity, measurement time, and eventually some more, depending on the specific sensor system.
  • Tuning refers to setting (manually or dynamically or a recommendation) these key parameters in ranges that put the sensor systems in the best sensing conditions resulting in the best sensor performance and consequently in the lowest error/faults rate.
  • the classification error is a measure of the misclassification rate or proportions.
  • a measurement error is the difference between the true value of a measurement and the value obtained during the measurement process.
  • a classification error is a type of measurement error by which the respondent does not provide a true response to a survey item. This can occur in one of two ways: a false negative response or a false positive response. A 'false negative response' corresponds to when the measurement system indicates that an event did not occur although it did.
  • a sensor reliability data model that is trained offline to gather all significant characteristics of the sensor component, the physical object in the road surface/concrete, environmental influences, all this for enabling a reliable assessment of the data quality of the single sensor.
  • the topological information including lane direction, signaling and the relative position of the sensors to each other at a complete intersection, mutual interferences are also trained in the sensor reliability data model.
  • the resulting model or rather the set of models offers for the first time the possibility of monitoring (in real- or quasi real-time and proactively) both the current state and the evolution of sensor data quality especially for sensors delivering a digital pulse stream caused induced by the presence of vehicles in traffic.

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Traffic Control Systems (AREA)

Abstract

The present invention relates to a method, apparatus and a computer program product for quality determination in data acquisition. The present invention includes a signal processing apparatus and method for processing at least one detection signal, the apparatus comprising a receiver configured to receive the at least one detection signal from at least one sensor, a model generator configured to generate a model of signal sequences based on previously received detection signals while taking into consideration a topology of the sensors and/or environmental condition in the vicinity of the at least one sensor, comparing means configured to compare the at least one detection signal with the model of signal sequences generated by the model generator, and a prediction means configured to predict quality of the at least one detection signal based on a comparison of the at least one detection signal with the model generated by the model generator.

Description

    Field of the invention
  • The present invention relates to quality determination in data acquisition. In particular, the present invention relates to a method, an apparatus and a computer program product for quality determination in data acquisition.
  • Background
  • Inductive loops and other well-known data suppliers like video cameras with virtual loops, infrared sensors, radar sensors and the like are called traffic detectors or sensors, respectively, and are especially used in traffic control systems for delivering a good picture of the traffic situation based on the cross section and lane-related measurement principle of traffic parameters.
  • Ideally, each vehicle driving over/through these physical/virtual sections on given lanes is recognized and causes the electronic detector component to generate a pulse for each vehicle presence in form of digital information.
  • For example, inductive loop detectors generate digital information out of the analogue signal response. The analogue and digital circuits of the detector component together with the connected loop-wire located in the road concrete/bitume may also be capable of using special parameters as measurement mechanism.
  • The obtained digital information is provided to the traffic control software within the intersection traffic controller and to any traffic monitoring system.
  • The operability and data quality of the measurement system has been so far (in the state-of-the-art) solely roughly determined through a simple evaluation considering aggregated or interval based vehicle counts on a higher system level. Some types of traffic detectors provide also a binary good/bad statement about their overall functionality.
  • An essential prerequisite for being able to cope with the increasing traffic volume and the resulting challenging traffic situations in terms of traffic control quality is a high data quality at the data acquisition level and it is further much more important to be informed constantly about the trend of the data quality short-, middle- and long-term. The evaluation and constant monitoring of both the reliability and the data quality of the data acquisition infrastructure in operation are playing a prominent role in modern traffic management concepts.
  • Normally, an eventually decreasing quality of data acquisition systems takes place slowly and gradually. This cannot be detected appropriately by existing traffic control systems and leads continuously to a deterioration of the traffic actuated control algorithm both in traffic controllers and traffic management systems at the strategic decision-making level. These decisive factors (related to sensor quality deterioration) are hardly ever taken into account in present traffic management solutions.
  • Summary of the invention
  • In view of the above, it is an object of the present invention to provide a method, an apparatus and a computer program product in order to overcome such problems.
  • According to the present invention, this problem is solved by picking up the digital pulse information directly at the sensor output. A sensor reliability data model is trained offline to gather all significant characteristics of the sensor component, the physical object in the road surface/concrete, environmental influences, all this for enabling a reliable assessment of the data quality of the single sensor. Considering the topological information including lane direction, signaling and the relative position of the sensors to each other at a complete intersection, mutual interferences are also trained in the sensor reliability data model.
  • The resulting model or rather the set of models offers for the first time the possibility of monitoring (in real- or quasi real-time and proactively) both the current state and the evolution of sensor data quality especially for sensors delivering a digital pulse stream caused induced by the presence of vehicles in traffic.
  • The present invention presents a global and comprehensive concept for a robust "traffic detectors' quality management". The overall concept relates to all traffic sensors types, independently of whether they are point-sensors or area-sensors. Examples of sensors types are: loop detectors systems, video-based systems, radar-based systems (all forms of radars are meant: radio-wave based; infrared-light-based, light-based, sound-based), and (earth) magnet fields based systems.
  • According to an aspect of the present invention there is provided a signal processing apparatus for processing at least one detection signal, the apparatus comprising:
    • a receiver configured to receive the at least one detection signal from at least one sensor,
    • a model generator configured to generate a model of signal sequences based on previously received detection signals while taking into consideration a topology of the sensors and/or environmental condition in the vicinity of the at least one sensor,
    • comparing means configured to compare the at least one detection signal with the model of signal sequences generated by the model generator; and
    • a prediction means configured to predict quality of the at least one detection signal based on a comparison of the at least one detection signal with the model generated by the model generator.
  • According to further refinements of the present invention as set out under the above aspects,
    • the apparatus further comprises
    • determining means configured to determine whether or not the at least one detection signal comprises an error, and
    a classification means configured to classify the error into a plurality of predetermined error classes, if it is determined that the detection signal comprises an error,
    • correcting means configured to correct the error, if it is determined that the detection signal comprises an error,
    • storing means configured to store a history of the detection signal, and
      prediction means configured to generate a detection signal error rate prediction based on the stored history;
    • the prediction is at least one of a short term prediction, a middle term prediction, and a long term prediction;
    • the apparatus further comprises determining means configured to determine settings of the sensor based on the detection signal error rate prediction such that the detection signal error rate becomes minimum; and
    • the quality of the detection signal includes at least one of an overall error rate of the detection signal, and an error rate of each of the error classes of the detection signal.
  • According to another aspect of the present invention there is provided a method for processing at least one detection signal, the method comprising:
    • receiving the at least one detection signal from at least one sensor,
    • generating a model of signal sequences based on previously received detection signals while taking into consideration a topology of the sensors and/or environmental condition in the vicinity of the at least one sensor,
    • comparing the at least one detection signal with the model of signal sequences generated by the model generator; and
    • predicting quality of the at least one detection signal based on a comparison of the at least one detection signal with the model generated by the model generator.
  • According to further refinements of the present invention as set out under the above aspects, the method further comprises
    • determining whether or not the at least one detection signal comprises an error, and
      classifying the error into a plurality of predetermined error classes, if it is determined that the detection signal comprises an error;
    • correcting the error, if it is determined that the detection signal comprises an error;
    • storing a history of the detection signal, and
      generating a detection signal error rate prediction based on the stored history;
    • the prediction is at least one of a short term prediction, a middle term prediction, and a long term prediction;
    • the method further comprises determining settings of the sensor based on the detection signal error rate prediction such that the detection signal error rate becomes minimum;
    • the quality of the detection signal includes at least one of an overall error rate of the detection signal, and an error rate of each of the error classes of the detection signal;
  • According to an exemplary aspect of the present invention, there is provided a computer program product comprising computer-executable computer program code which, when the program is run on a computer (e.g. a computer of an apparatus according to any one of the aforementioned apparatus-related exemplary aspects of the present invention), is arranged to cause the computer to carry out the method according to any one of the aforementioned method-related exemplary aspects of the present invention.
  • Such computer program product may comprise or be embodied as a (tangible) computer-readable (storage) medium or the like on which the computer-executable computer program code is stored, and/or the program may be directly loadable into an internal memory of the computer or a processor thereof.
  • Advantageous further developments or modifications of the aforementioned exemplary aspects of the present invention are derivable from the following detailed description.
  • Brief Description of the Drawings
  • For a more complete understanding of exemplary embodiments of the present invention, reference is now made to the following description taken in connection with the accompanying drawings in which:
    • Fig. 1 is an overview of an exemplary application of the assessment and prediction of the data quality of inductive loop systems according to certain embodiments of the present invention;
    • Fig. 2 is an overview illustrating the traffic control module shown in Fig. 1;
    • Fig. 3 is a diagram illustrating the reliability physics of a given traffic sensor system type according to certain embodiments of the present invention;
    • Fig. 4 is a diagram illustrating an internal parameter tuning model concept according to certain embodiments of the present invention;
    • Fig. 5 is a diagram illustrating the schematic overall concept of the invention according to certain embodiments.
    Description of exemplary embodiments of the invention
  • Exemplary aspects of the present invention will be described herein below. More specifically, exemplary aspects of the present are described hereinafter with reference to particular non-limiting examples and to what are presently considered to be conceivable embodiments of the present invention. A person skilled in the art will appreciate that the invention is by no means limited to these examples, and may be more broadly applied. Further, it is noted that each of the below described aspects or embodiments can be arbitrarily combined.
  • Fig. 1 is an overview of a scenario to which the present invention is applicable. In particular, Fig. 1 shows an example of a crossroad that is equipped with various traffic lights and sensors. In Fig. 1, reference signs TS1 to TS4 indicate traffic signal lights and reference signs D1 to D10 indicate inductive loop detectors. Further, reference sign TC denotes a traffic controller for traffic light signaling including a traffic control module 10. As shown in Fig. 1, analogue signal from the various inductive loop detectors are input into the traffic control module 10.
  • Fig. 2 is a block diagram illustrating the traffic control module 10 shown in Fig. 1. The traffic control module 10 comprises a detection module 11, a RoSiT (Robust Sensors in Traffic) module 20 and traffic programming module 12. As shown in Fig. 2, analogue signals from the inductive loop detectors D1 to D10 are input to the detection module 11 and the detection module 11 outputs digitized signals to the RoSiT module 20. Further, the RoSiT module 20 outputs an indication regarding loop data quality to the traffic programming module 12.
  • The RoSiT module 20 comprises a pattern recognition module 21, an environmental influences module 22, an error classification module 23, a quality assessment and forecast module 24, an error correction module 25, a detector parameter recommendation module 26, and a truck detection module 27. The pattern recognition module 21, the environmental influences module 22, the error classification module 23, the error correction module 25, the detector parameter recommendation module 26, and the truck detection module 27 each input data to the quality assessment and forecast module 24, which outputs loop data quality information to the traffic programmers module 12.
  • In short, the present invention comprises the following aspects. Each of the below aspects will be described in more details later on.
  • [Aspect 1]
  • The first aspect concerns the from and granularity of the detector's signal raw data observation level. That is, according to a first aspect, solely microscopic detector data is used for quality management.
  • [Aspect 2]
  • According to a second aspect, out of the microscopic data, and while considering weather data and sensor topology information we do extract pattern for detector quality assessment and management.
  • Then, it is determined whether the sensor data contain errors or not. That is, based on extracted pattern, it is assessed whether the data contain errors or not.
  • [Aspect 3]
  • A third aspect deals with the issue of how to differentiate and classify the error types observed. That is, according to the third aspect, in case of errors/faults presence, these faults are classified. In this regard, it is proposed to robustly classify and identify OCCURING error/faults types, the error types depending on the underlying sensor type.
  • [Aspect 4]
  • According to a fourth aspect, the type of vehicle observed is classified. That is, from the pattern collected out of non-erroneous data, the presence of either a car or a truck is classified (this classification is independent of the sensor type).
  • [Aspect 5]
  • A fifth aspect concerns the issue of how to perform erroneous detector data correction, whenever possible. According to the fifth aspect, while involving sensor type related error types knowledge, sensor topology, environmental information, and observed error types, the errors are robustly corrected and a cleaned data stream is generated.
  • [Aspect 6]
  • The sixth aspect relates to how to build a black box model of the error rate related detector's behavior in dependence of both environmental information (weather and temperature) and traffic volume.
  • [Aspect 7]
  • According to the seventh aspect, it is described how to perform short, middle and long term detector error rate performance prediction while involving the black-box model. That is, based on archived pattern history and environmental conditions information we do predict detector faults rate for short, middle and long term. This does also give indication about the remaining detector life time.
  • [Aspect 8]
  • An eighth aspect deals with how to either automatically tune or recommend contextually appropriate detector 'sensitivity' values and/or 'measurement time' values and/or other equivalent or relevant settings in order to minimize detector's error rate performance.
  • Based on detector faults rate behavior, a detector tuning recommendation (that takes environmental information and a detector reliability physics model into consideration) is calculated and provided. The recommendation concerns measurement time and/or sensitivity level, and/or other equivalent detector settings.
  • The processing related to all of the above described 8 aspects is performed in real-time and does reach a detection rate above 96% according to very extensive tests that did involve real field data.
  • In the following, the aspects of the present invention will be described in more detail.
  • [Aspect 1]
  • Concerning the granularity of the observation of the raw data originating from the detector, solely the microscopic level is considered. With "microscopic level", it is referred to the level at which the detector raw data have been transformed into an information about presence or non-presence of a vehicle on the road. Thus, the atomic information (or atomic data stream) considered in the present invention and on which all further processing steps are based is that about
    • a) vehicle presence within detector's range or region of interest, and
    • b) relation to the respective timings, i.e. "star" and "end" of each vehicle presence.
  • No further form of aggregation is considered, contrarily to approaches in the related state-of-the-art, which mainly rely on aggregated data.
  • Therefore, the first step in the global detectors quality management is to transform the detector raw data into an "atomic data stream". For (rare) systems that do not explicitly generate an "atomic data stream" a pre-processing module for generating/guessing/synthesizing the related 'atomic data stream' can be designed. Such a module to be calibrated once by real sample observations may eventually contain some form of stochastic intelligence.
  • [Aspect 2]
  • The atomic data stream is to be observed and analyzed in order to determine whether it contains errors or not. In this point, the overall methodology of this determination is presented.
  • There is involved a pattern recognition concept that involves the following considerations: sensor type, sensor network topology, weather information and traffic volume information. Hidden markov models, bayesian networks and support vector machines are involved in the overall process.
  • The output of this processing is information-related to the presence or not of errors in the observed atomic data stream. Either a single cross-section or a set of consecutive cross-sections is/are considered simultaneously depending on their logical interdependency as determined/fixed by the underlying sensor network topology.
  • In the following, a detailed specific example will be given. However, the present invention is not limited to the specific example.
  • Offline process Phase 1
    • 1. Errorless reference data Stream collection or generation
    • 2. Logical-topological correlation between neighbouring loops and sections + weather & environment information
  • A reference clean or errorless data stream of appropriate significant length is collected or generated. This corresponds to an observation from typical portions from several typical hours up to days.
  • Besides, all logical and topological correlation between neighboring loops and sections are recorded for a set of reference topologies or sensor network configurations.
  • This is done once for a given sensor type for different weather conditions.
  • Offline process Phase 2
    1. 1. Transformation of reference streams into a set of images/pictures of different sizes while considering the logical-topological information
    2. 2. Transformation of the differently sized various images of (1) using Radon transformation (*)
    3. 3. Features extraction using a cellular neural network (CNN) processor (**)
    4. 4. Use a discrete-time CNN processor (***) or PCA (4*) to create reference classes for the various images sizes
  • The collected reference data stream is transformed into various images of different sizes (using sliding windows of different sizes) while involving the logical-topological information.
  • The different images are transformed though nonlinear processing (radon transformation and CNN processing) out of which a set of reference classes is determined and saved.
  • This is done once for any given sensor type.
    • (*): Reference to radon transformation: http://www.tmrfindia.org/ijcsa/v7i34.pdf, last access Sept 14th, 2012
    • (**): Example of CNN based feature extraction, see: http://www.itk.ppke.hu/Szlavik_diss_ENG.pdf, last access Sept 14th, 2012
    • (***): Example of a discrete time CNN classifier: http://www.ieice.org/proceedings/NOLTA2005/HTMLS/paper/7141.pdf
    • (4*): Example of PCA based classifier: http://www.tmrfindia.org/ijcsa/v7i34.pdf, last access Sept 14th, 2012
    Online Phase 3
    • 1. Online data stream collection
    • 2. Remember the current logical-topological correlation between neighbouring loops and sections (from phase 1)
    • 3. Remember the results of all processes of Phase 2
    Online Phase 4
    • 1. Transformation of ONLINE streams into a set of images/pictures of different sizes while considering the logical-topological information
    • 2. Transformation of the differently sized various images of (1/Phase 4) using Radon transformation
    • 3. Features extraction using a cellular neural network (CNN) processor system
    • 4. Use a discrete-time CNN processor system or PCA to classify and compare with the REFERENCE CLASSES of Phase 2
  • All processes of Phase 2 are done, but now on real-time/online collected data streams.
  • The discrete time CNN classification allows a comparison with the references classes.
 IF
        all online classes fit to the reference classes,
        THEN
         the data stream is error free;
         OTHERWISE
          There is one or more errors in certain identified
          portions of the data stream.
          The portions containing errors are kept in view
          for further processing.
[Aspect 3]
  • Portions of the "atomic data stream" containing errors are identified in and marked as such. In this point, marked atomic data stream portions are processed and errors pattern are differentiated and then clustered. A semi-supervised labeling involving for example rough-set theory is then performed offline by a human expert. This labeling depends on the underlying sensor type and is done once. For loop detectors, for example and just for illustrative purposes, the labeling will differentiate 6 error classes: weak-signal, pulse break-up, chattering, splash-over, over-counting and under-counting. The error classes are of course not limited to the above mentioned 6 error classes.
  • For other sensor types, the labeling will depend on their inherent physics and on the sensor network topological information.
  • The following concepts are involved in the error type classification: signal energy, statistical analysis, support vector machine and rough-set theory.
  • In the following, a detailed specific example will be given. However, the present invention is not limited to the specific example.
  • Offline Phase 1
    • 1. Record and/or generate a set of atomic data streams of different lengths and containing all possible known types of errors. Using "Aspect 2" to observe real traffic can help recordings from the field
    • 2. Consider thereby different logical-topological correlations between neighbouring loops and sections + weather and environment information
    • 3. Use, once, a semi-supervised labeling of the different error types by a human expert. The possible error types depend on the current sensor type (or sensor types combinations)
    • 4. Extract the signatures of all identified error types by using SVM or better a CNN processing. An alternative is a statistical measure over sliding windows.
    • 5. Save/record the extracted signatures for use online.
  • The offline phase 1 is used to record reference "error containing data streams or data stream pairs" of different lengths while taking the following into consideration: reference logical-topological contexts, weather and environmental conditions and sensor type or sensor types combinations.
  • A clustering of the different error types is performed by involving for example rough-set theory.
  • A human expert is used once in order to label the different types/classes of errors observed/identified/known. The latter is done once for every sensor type or sensors type combinations.
  • The reference data streams being known for each labeled error types, corresponding signatures are fixed/extracted by using either a nonlinear feature extractor like CNN or by a statistical measure performed over sliding windows.
  • The signatures related to each error type are saved for further use in the online phase.
  • This OFFLINE phase is performed once and the output is the set of signatures for all relevant error types for a given sensor type or sensor types combination. These signatures will then be used in phase 2 (ONLINE) in order to detect/ classify errors (types).
  • Online Phase 2
    1. 1. Identify the error containing sets of atomic data streams detected in (Aspect 2/Phase 4)
    2. 2. If not yet done, extract relevant features by involving preferably CNN or some other appropriate feature extractor
    3. 3. Use a robust classifier (e.g. CNN based or SVM baser or other) to detect/identify the underlying error type
    4. 4. Alternative to (3):
      • After the error signatures are known, this error type classifier can be run directly on the online data stream obtained after Aspect 2/Phase 2/(1)
  • The phase 2 is an online process. It assumes that the signatures of all possible errors are known (while considering related logical-topological and weather + environment information).
  • Thus, a robust classifier (CNN based, SVM based, or other) is used to detect the underlying error within a given data-stream portion.
  • If the data stream portion has been provided by Aspect 2/Phase 4, the related features have been already extracted by Aspect 2 and can be provided to Aspect 3/Phase 2.
  • An alternative is, provided Phase 1 of Aspect 3 has been done once and thus has already provided the signatures of all possible errors, to skip all steps of Aspect 2 coming after Aspect 2/Phase2/(1). The online data streams of different lengths would then be collected from here and processed for error classification. In this case Aspect 2 is merely de facto skipped and considered or integrated as an offline part of Aspect 3.
  • This Online phase 2 can be directly performed on the online data-stream while skipping most of the online phase (Phase 2) of Aspect 2 as explained in the note "Alternative to (3)".
  • [Aspect 4]
  • Clean portions of the "atomic data stream" (i.e. not containing errors), originated from a single detector system, are processed in order to classify the type of vehicle observed. This classification relates to vehicle length and respective headways as sole differentiating features. Thus, it is mainly differentiated between trucks and cars. Further differentiations are possible so far they are related to vehicle length.
  • The main classification is based on the features sets constituted by the presence data pulse lengths and the related distribution of the respective headways. A shock wave pattern is then extracted and involved in a support vector machine based classification.
  • The classification output is whether the vehicle is a car or a truck, and whether it has been in movement or stopping, the latter for example due to red traffic light phase.
  • In the following, a detailed specific example will be given, however, the present invention is not limited to the specific example.
  • Offline Phase 1
    • 1. Select explicitly or generate, similar to Aspect 2/Phase 1, error free reference data streams of different lengths, where, through human supervision, one knows there are trucks inside those streams.
    • 2. Different contexts should be considered hereby. Additionally to those of Aspect 2, the consideration of the presence or not of stops at red traffic lights and that of different traffic conditions is mandatory
    • 3. Then apply the processes described in Phase 2 of Aspect 2 and obtain the various signatures of the truck's presence in data streams of various lengths
    • 4. Save the obtained signatures for use in the online truck detection of Phase 2
  • This phase 1 is an offline process. It is done once and intends to produce the signatures of a truck presence within the atomic data streams while taking related context information into consideration. The reference data streams involved here must be error free.
  • The obtained various signatures of the truck's presence in a traffic stream are then saved for use in the online truck detection process.
  • Since sliding windows are used, the exact position of the truck within the stream will also be determined. That is, the presence pulse corresponding to a truck will be explicitly identified. The signatures are formed accordingly.
  • This offline phase results in a set of signatures indicating the presence and the position of a truck within a traffic stream from different perspectives.
  • Online Phase 2
    1. 1. Extract current online data streams of different lengths. These atomic data streams should be error free.
    2. 2. Extract relevant features by involving an appropriate nonlinear feature extractor, preferably CNN
    3. 3. Then use a classify involving the different truck signatures obtained in Aspect 4/Phase 1. Any classifier can be used, but preferably a CNN based one
    4. 4. The result will be:
      • a. That the data stream contains a truck or not
      • b. In the positive case, the position of the truck
  • This phase 2 is an online process.
  • The online data streams are assumed to be error free. If they are not error free, the stream should be taken after error correction or the portions containing errors should be skipped for the truck detection.
  • After
    • a) the selection of appropriate data streams,
    • b) the related features extractions (preferably through CNN), and
    • c) a classification involving the pre-stored signatures obtained from Phase 1 (of Aspect 4),
  • it will be assessed whether a given data stream contain a truck of not. In the positive case, even the position will be determined. Thus, the exact presence pulse corresponding to a truck will be identified. A CNN based classifier is preferred due to the high performance. Other classifiers can however also been used.
  • This online phase detects the presence of a truck within a traffic stream and gives also the exact position. The atomic 'presence pulse' corresponding to a truck will be identified.
  • This can be done in real-time within less than a second and can thus be considered, amongst others, in highly precise adaptive future traffic control systems
  • [Aspect 5]
  • Error correction is based on a reasoning based on some prior knowledge formulated logically by a human expert and that depends on error classes related to a given detector type and on fixed sensor network topological configurations. The topological information is related to how detectors lay as a group in a common cross-section or are situated with a given spatial separation within a direct longitudinal neighborhood.
  • The prior knowledge that expresses the essence of the rules for the real-time, logic-programming-based reasoning correction is based on a plausibility matrix fixing the possible either simultaneous or consecutive error occurrences depending on the underlying topology and while considering the detector-type related variety of error classes.
  • The output of the correction process is, for each given error (detected by the concept of Aspect 3), that:
    1. a) whether a correction is possible or not;
    2. b) in cases a correction is possible, a corrected/clean "atomic data stream" is generated
  • In the following, a detailed specific example will be given; however, the present invention is not limited to the specific example.
  • Offline Phase 1
    • 1. The error correction is based on:
      • a) The logical-topological information (see Aspect 2)
      • b) The different error types identified in Aspect 3 and especially how they correlate with the logical-topological information
    • 2. Based on (1) of Aspect 5 and on information gathered from Aspect 2 and Aspect 3, a human expert will define logic-programming rules that will determine:
      • a) If a detected error can be SURELY corrected
      • b) In the positive case (i.e. a correction is possible), how it should be surely corrected
      • c) In the negative case:
        • i) Formulate a default recommendation (some examples for illustration: e.g.: delete the pulse(s);
          or set erroneous pulse to normal;
          or replace erroneous long pulse by 2 normal pulses, etc.)
        • ii) Or state that even a default recommendation is not possible due to lack of information
    • 3. The defined logic-programming rules will be integrated in a logic-programming REASONER to be used online for performing error corrections
  • This phase 1 is an offline process.
  • In essence, knowledge from Aspect 2 and Aspect 3 is used by a human expert to formulate logic-programming rules that are then integrated in a logic-programming reasoner to be used in real-time for error correction.
  • The reasoner will involve an appropriate ontology that includes the context information indicated in Aspect 2: logical-topology, weather and environment, data streams lengths, error types, etc. The ontology and the reasoner will be both designed during this offline phase.
  • Due to real-time constraints a solver based on Answer Set Programming (ASP) is recommended, although any other solver can be used.
  • How probable each default recommendation is, is determined by extensive offline observation tests.
  • This OFFLINE phase will result in a reasoner to be used for online corrections and/or default recommendations for cases a SURE correction is not possible
  • Online Phase 2
    • 1. Get the error containing atomic data stream portions (see Aspect 3)
    • 2. Use the REASONER designed in Phase 1 of Aspect 5 to determine:
      1. a) If a the detected error can be SURELY corrected
      2. b) In the positive case (i.e. a correction is possible), the reasoner will corrected it
      3. c) In the negative case:
        • i) Depending on current context information, the reasoner may formulate a default recommendation and assign it a probability value, which was obtained in Phase (1) of Aspect 5
        • ii) Or the reasoner will just state that even a default recommendation is not possible due to lack of information
    • 3. A new corrected data stream may be generated or the old one may be maintained but just augmented with the reasoner's output obtained in (2) of Aspect 5/Phase2. This means that related labels will be added to the erroneous portions. This may be of relevance for those functions using the sensor data upwards.
  • This phase 2 is an online process.
  • In essence the reasoner built in Phase 1 of Aspect 5 will be used online on error containing atomic data stream portions.
  • The reasoner will determine whether the error can be corrected. If yes, it will correct it. If not it will either state a default recommendation or state that neither default recommendation nor correction is possible due to lack of information.
  • Each default recommendation should be endued with a probability value. This latter value is obtained offline as explained in Phase (1) of Aspect 5.
  • The reasoner will preferably involve a solver based on ASP (Answer Set Programming) to ensure a fast speed under real-time constraints.
  • This Online phase involves the use of an ultrafast logic-programming reasoner for online error corrections and/or for default recommendations for cases a sure correction is not possible.
  • [Aspect 6]
  • A black-box model of the detector's performance is constructed with the objective of probabilistically describing the error rate related detector's behavior. In the essence, the intention is to determine the detector's reliability physics in dependence of both internal and external parameters.
  • Internal detector parameters are different settings determining and/or related to its core functioning principle. External parameters are traffic volume, detector's network topology, weather and environmental related ones.
  • To construct the detector's black-box model a combination of the following probabilistic instruments is involved: structural equation modeling, bayesian (belief) networks, and hidden markov movels.
  • In the following, a detailed specific example will be given; however, the present invention is not limited to the specific example.
    • 1. Collect a significant amount of real historic field data (containing also classified faults positions) according to parameters of Fig. 3, i.e. related inputs and outputs
    • 2. Use an appropriate statistical method to asses which of the inputs do have a significant impact on the error/fault rates of different types. Example of method: SEM (structural equation modeling), etc.
    • 3. Suppress from the model of Fig. 3 those input parameters with meaningless impact (according to results of (2)) on the rates of the outputs
    • 4. Use now an appropriate stochastic modeling instrument to build the probability reliability physics model involving the remaining input parameters of Fig. 3. This black box model building (training) involves both historic data (see (1) and the findings from (2). Examples of appropriate methods: bayesian networks, fuzzy logic, neural networks, (hidden) markov models, etc.
    • 5. The obtained black box model is saved for use latter in either occasional or online dynamic/automatic optimal setting of the sensor systems in dependence of current external and internal conditions (i.e. the remaining significant inputs of Fig. 3.)
    • 6. This process must be done once for every sensor system type, e.g.: inductive loops, video, radar, etc.
  • In essence, this is an offline process aiming at assessing the sensor's reliability physics in dependence of both internal and external parameters.
  • Potential/candidate internal parameters are for example: sensor sensitivity level, measurement time level, etc. These parameters may differ depending on the specific sensor system type.
  • Potential/candidates for external parameters are: weather, temperature, traffic level, proportion of trucks in the traffic, etc.
  • This offline process results into a sensor system type specific reliability physics black box model. This model will be used later for online detector tuning in dependance of both current internal and external parameter conditions
  • [Aspect 7]
  • A robust detectors' quality management requires the capability to predict the detector's reliability in three time horizons: short term (range of some minutes up to one hour); middle term (range of some hours, up to some days, and up to a week); long term (range of several weeks, up to several months, and up to some years).
  • The short term prediction is of relevance for managing the impact of detector quality on traffic management. The middle term prediction is mainly of relevance for short term traffic infrastructure maintenance activities. And the long term prediction is mainly of relevance to predict the operationally acceptable lifetime of a given detector or detector group.
  • Overall, the prediction does involve the prior knowledge that is compressed in the black-box model obtained in Aspect 6. Further, for both short-term and middle term detector's-performance prediction, a concept involving mainly hidden markov models and the black-box model is used. For both short and middle term, all external parameters are either known or can be easily predicted (e.g. traffic volume) or obtained from external sources (e.g. weather information).
  • For a long-term prediction, however, external parameters are hardly known. Thus, here another prediction approach is used. It does rather predict an envelope that fixes with a high reliability, as proven by extensive tests involving real data, the upper-bound of the error-rates' detector performance. Thus, the real performance lies below the envelope constituted by the upper-bound. The available detector's data history to be involved in the long-term prediction should be at least two times longer than the future time-frame ahead for prediction. For example, to predict for the next three weeks (or months), the available observed history should be longer than 6 weeks (or months). Thus, the appropriate level of data aggregation should be selected in a way that the available history should satisfy this length requirement.
  • The envelope's prediction involves two major steps: a) time series filtering through a quadratic trend estimation and a consecutive smoothing; and b) estimate a cubic trend of the later-resulting time series and use it for future prediction.
  • In the following, a detailed specific example will be given, however, the present invention is not limited to the specific example.
  • Phase 1 Short Term Prediction Step 1 Offline Short Term Prediction
    • 1. Collect a significant amount of real historic field data according to parameters of Fig. 3, i.e. related inputs and outputs
    • 2. Observe (see Aspect 6/(2) & (4)) and note how far historical data on traffic volume is associated with error rates
    • 3. Observe (see Aspect 6/(2) & (4)) and note how far historical data on weather and temperature are associated with error rates
    Step 2 Online Short Term Prediction
    • 1. Use a stochastic state space model (for instance Kalman or particle filter, etc.) to predict the traffic volume. Use the stored historical data of traffic volume to train the model
    • 2. Train a probabilistic model to model the dynamic error rate in dependence of traffic volume, weather and temperature.
    • 3. Use the trained probabilistic model (e.g. HMM) to predict the error rate using the predicted traffic volume (Online step 1) and the forecasted environmental conditions (temperature, weather)
    • 4. Future time horizon: a couple of hours: 1-4 hours in the future
  • In essence, this process is aiming at predicting the error rate in order to evaluate the SHORT TERM performance of the sensor system. The aggregation time unit is hours; one predict for the next couple of hours (1 to 4 hours). The black box reliability physics model is taken unto consideration. Thus, the short term prediction considers the history of error rates associated with the conditions (weather, temperature and traffic volume). State space models (stochastic and probabilistic) are proposed for this prediction. For instance, hidden Markov model (HMM), Bayesian network and Kalman filter.
  • The sensor system internal parameter tuning control system (see Aspect 8) will need the prediction result as one of the inputs if a tuning assessment for the corresponding time horizon is needed.
  • This online short term prediction model covers the next couple of hours. It takes the sensor system reliability black box model into consideration. State space predictions models are appropriate to implement this prediction. We do favor a HMM combined with Kalman Filter.
  • Phase 2 Middle Term Prediction Step 1 Offline Middle Term Prediction
  • 1. Collect a significant amount of real historic field data according to parameters of Fig. 3, i.e. related inputs and outputs
  • Step 2 Online Middle Term Prediction
    • 1. Estimate a logarithmic local trend for error rate historical data
    • 2. Estimate the trend error deviation
    • 3. Build a model that combines the logarithmic trend with the error deviation
    • 4. Use the estimated model to predict the upcoming error rate
    • 5. Time horizon: a couple of days or a couple of weeks; aggregation level: days or respectively weeks
  • In essence, this process is aiming at predicting the error rate in order to evaluate the MIDDLE TERM performance of the sensor. The aggregation time unit is days and weeks. According to these aggregations, the error rate time series is smoother than in short term, and the environmental conditions have due to the aggregation a relatively smaller impact on error rates.
  • The proposed middle term prediction uses a stochastic state space model (e.g. Auto Regression Moving Average Model, Local Trend Model, etc.).
  • The sensor system internal parameter tuning control system (see Aspect 8) will need the prediction result as one of the inputs if a tuning assessment for the corresponding time horizon is needed.
  • Phase 3 Long Term Prediction Step 1 Offline Long Term Prediction
    • 1. Collect a sufficient amount of real historic error rate data for a given sensor system
    Step 2 Online Long Term Prediction
    • 1. Extract from this the worst cases of error rate (i.e. all data points that are higher than the quadratic trend)
    • 2. Estimate the logarithmic trend of the worst cases from historical data constituted of the extracted points from (1)
    • 3. Predict the future of the worst case based on the logarithmic trend
    • 4. Future time horizon: 3 months and more; aggregation of data points: in days or in weeks
  • In essence, this process is aiming at LONG TERM predicting the error rate in order to estimate amongst others the lifetime of a given sensor.
  • There is a specific level of the predicted error rate for which the sensor is considered unreliable. This threshold should be sensor system type specific. If for example an average of 10% error rate (@ 500 vehicles) is no more acceptable, then it will be fixed as a threshold on which basis the potential remaining life time of a sensor can be estimated based on the long term prediction functionality.
  • A future 'time horizon' of 3 months and more is taken, for example.
  • The Long term prediction model estimates the trend of the worst cases of error rates. It involves similar performance worst cases observed in the recent related history of the historical data.
  • This online process enables a future prediction of the sensor system performance. It allows to know in advance whether the sensor system may reach performance regions that are unacceptable.
  • [Aspect 8]
  • The extensive study of the detectors' reliability physics has shown that their errors rate performance depends on both traffic and environmental conditions on one hand, and on internal detector settings on the other hand.
  • This close interdependency has been expressed within the black box model (Aspect 6).
  • The core question here is, however, the following: while knowing both environmental and traffic conditions, how to determine the appropriate internal detector settings in order to minimize the detector's error rate for a given short-term future observation window?
  • This question is equivalent to the inverse problem to the black box modeling.
  • This inverse problem is solved by involving mainly two complementary methods: bayesian networks and fuzzy logic programming.
  • Through this concept it is therefore possible to: a) formulate/determine in an offline mode appropriate recommendations for maintenance technicians about the optimal settings of internal detector's parameters depending on underlying conditions (traffic and environmental); and b) automatically, and in an online mode, set and control the detector settings through an embedded platform/scheme. The automatic adaptivity of internal detector's settings will significantly improve the detector's performance quality despite volatile/varying external underlying hard conditions.
  • In the following, a detailed specific example will be given; however, the present invention is not limited to the specific example.
  • Phase 1 Offline / Model Training
    • 1. Involve data from Aspect 6/(1)
    • 2. Take the black box model obtained in Aspect6/(5); also involve findings of Aspect 6/(2)
    • 3. Combine data from (1) and the models and findings in (2) to perform extensive stochastic evaluations and/or observations for various scenarios.
    • 4. The observations of (3) will be used to train the probabilistic black box model of Fig. 4. Possible appropriate instruments to realize/implement this black box model are: bayesian networks, (hidden) markov models, fuzzy logic controler, etc. Extensive experiences have shown that all instruments perform quite well; we do however prefer/favor the fuzzy logic controller
  • In essence, this process is a partial inverse problem of the black-box model reliability physics model of Aspect 6/(5). See Fig. 4.
  • The reliability physics model is an input used to extensively train the new sensor system internal parameter tuning control model of Fig. 4.
  • The tuning can be calculated for the current time or for a future time interval. Respective related inputs will be then needed.
  • This online sensor system tuning model controller is in essence a REASONER that builds on knowledge from the sensor system reliability physics model. It ensures an adaptivity of the sensor system to keep the error rates performance the lowest possible at any moment.
  • Explanations of key words
  • In the following, some key words and expressions used in the above description are explained for better understanding of the principles of the invention.
  • Point sensors:
  • A point sensor is typically stationed/placed at a fixed location along/on/under a roadway and watches (i.e. counts, records, classify, etc.) vehicles passing at this particular location in the time domain. The fixed location may be:
    • (a) a complete cross-section of the roadway at a given location, or
    • (b) just a single lane of the cross-section.
  • Examples of point sensors are: loop detectors, radars, ultrasonic radars, Lidars (light detection and ranging) and video cameras.
  • Area sensors or Space sensors:
    • An area sensor typically covers/watches a wide surface/area of a roadway or road junction. It is generally placed high in the sky/air over the ground surface and takes snapshots of traffic at an instant of time. It does therefore simultaneously cover/watches many cross-sections and many lanes.
  • Examples of area sensors are: aerial photography or satellite imagery, and mobile sensors such as automatic vehicle location and global positioning system.
  • Loop detectors systems:
  • The inductive-loop detector is the most utilized sensor in a traffic management system. The principal components of an inductive-loop detector are:
    • One or more turns of insulated loop wire wound in a shallow slot sawed in the pavement
    • Lead-in cable from the curbside pull box to the intersection controller cabinet
    • Detector unit: Electronics unit housed in a nearby controller cabinet
  • Cars passing over or that stop within the detection area of an inductive-loop detector do decrease the inductance of the loop. The electronics unit senses this event as a decrease in frequency and sends a pulse to the controller signifying the passage or presence of a vehicle.
  • Video-based traffic sensor systems:
  • Video-based detectors use advanced image processing schemes running on an appropriate microprocessor to analyze a video image input. Different approaches are used by video detection sensors. Some analyze the video image of a target area on the pavement. A change in the image of the target area as a vehicle passes through the target area is analyzed. Another approach does rather identify when a target vehicle enters the camera field of view and tracks the target vehicle through this field of view. Finally, other video sensors use a combination of these two approaches.
  • Radar-based detector system:
  • A radar operates according to the following principle: " The radar dish or antenna transmits pulses of radio waves or microwaves which bounce off any object in their path. The object returns a tiny part of the wave's energy to a dish or antenna which is usually located at the same site as the transmitter" (Source: http://en.wikipedia.org/wiki/Radar; latest access: Sept. 10th, 2012). Through an appropriate processing of the reflected energy the radar can be used to determine presence, range, altitude, speed and direction of targeted objects. In traffic detection, the radar is used as a point sensor which can detect both presence and speed of a vehicle in a fixed sport (or cross-section) on the road.
  • Detector's signal raw data:
  • By "detector's signal raw data" is meant the time series generated by a point detector. It is constituted of the succession of vehicle presence detection pulses. The time series contains beginnings and ends of the successive pulses, which are separated by so-called time headways. An area sensor will generate simultaneously many of such time series for fixed areas or locations.
  • Black-box model:
  • Commonly a black box model can be defined as follows. A black box is a module with known inputs, known outputs, a known function but with an unknown internal mechanism. A black box module: a) acts predictably, b) can be used without knowledge of its internal details, c) hides information from the rest of the system. Thus, a black box is defined by "what" it does and not by "how" it does it. For building a black box model on has to start from measurements of the behavior of the system and the related external influences (inputs to the system) and try to determine a mathematical relation between them without going into the details of what is actually happening inside the system.
  • Atomic information:
  • Atomic information refers to refers to a minimal amount of information which makes it possible to distinguish two particular items. Atomic information is mono valuate and does refer to only one thing. Failure to capture atomic information means inaccurate, unreliable or missing data.
  • Vehicle presence:
  • In traffic management, for the planning of a junction one needs to know about the presence or not of a vehicle at a set of fixed locations on the road. A location is a generally rectangular area on the lane surface of approx. 1m x 2m of The selection of these positions serve different needs, for example a) provide a trigger or parameter on which a vehicle actuated traffic management system is based for extending the current green time or for giving green to a particular approach; b) measure queue length; etc.
  • Hidden markov models:
  • According to "https//:cwiki.apache.org", HMM is defined as follows:
    • "A Hidden Markov Model (HMM) is a statistical model of a process consisting of two (in our case discrete) random variables O and Y, which change their state sequentially. The variable Y with states {y_1, ... , y_n} is called the "hidden variable", since its state is not directly observable. The state of Y changes sequentially with a so called Markov Property. This means, that the state change probability of Y only depends on its current state and does not change in time. "
  • Further, the Hidden Markov Model (HMM) is a variant of a finite state machine that does have a set of: a) hidden states, Q; b) an output alphabet(observations), O; c) transition probabilities, A; d) output (emission) probabilities, B; e) and initial state probabilities, M. The current state is not observable. Instead, each state produces an output with a certain probability (B). Usually the states, Q, and the outputs, O, are understood; thus an HMM is said to be a triple, ( A, B, M ).
  • Sources for detailed information:
  • Bayesian networks:
  • A Bayesian network (or - other naming of the same: Bayes network, belief network, Bayes(ian) model, probabilistic directed acyclic graphical model) is a probabilistic graphical model (a type of statistical model) that represents a set of random variables and their conditional dependencies via a directed acyclic graph. For example, a Bayesian network could represent the probabilistic relationships between diseases and symptoms. Given symptoms, the network can be used to compute the probabilities of the presence of various diseases (Source: Wikipedia).
  • In other words, Bayesian nets are a network-based framework for representing and analyzing models involving uncertainty. They are different from other knowledge-based systems concepts because uncertainty is handled in mathematically rigorous yet efficient and simple way.
  • The general probabilistic inference problem here is in fact to find the probability of an event given a set of evidences. This can be done in Bayesian nets with sequential applications of Bayes Theorem.
  • Support vector machines:
  • Support vector machines (SVM) are concepts used for classification. They belong to the class of supervised learning models with associated learning algorithms that analyze data and recognize patterns. The fundamental SVM takes a set of input data and predicts, for each given input, which of two possible classes forms the output. Thus, SVM is a non-probabilistic binary linear classifier.
  • Given a set of training examples, each marked as belonging to one of two categories, an SVM training algorithm builds a model that assigns new examples into one category or the other.
  • An SVM model is a representation of provided examples as points in space, mapped so that the examples of the separate categories are divided by a clear gap that is as wide as possible. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall on.
  • Further, SVMs can efficiently perform non-linear classification using what is called a kernel function, and thereby implicitly mapping their inputs into high-dimensional feature spaces.
  • Fuzzy logic programming:
  • Fuzzy logic is an approach to computing based on "degrees of truth" rather than the usual "true or false" (1 or 0) Boolean logic on which the modern computer is based. Fuzzy logic includes 0 and 1 as extreme cases of truth (or "the state of matters" or "fact") but also includes the various states of truth in between (i.e. very unlikely' (gray areas of probability)). Software based on application of fuzzy-logic (as compared with that based on Formal Logic) allows computers to mimic human reasoning more closely, so that decisions can be made with incomplete or uncertain data.
  • A fuzzy inference system is a system that uses fuzzy set theory to map inputs (features in the case of fuzzy classification) to outputs (classes in the case of fuzzy classification). In other words, a fuzzy inference is an extended form of formal inferences that enables qualitative and/or quantitative evaluation of the degree of confidential level for a given causality on the basis of fuzzy expressions.
  • Weak-signal:
  • It is a particular type of faulty detection signal for vehicle presence detection. The presence detection of a vehicle at a fixed location is generally expressed at the detector/sensor output in form of a quadratic pulse in time. The width of this quadratic pulse does generally correlate with the speed of the vehicle which presence is detected. Thus, a fast vehicle will generate a pulse with a smaller width; a long vehicle like a truck will result in a longer pulse width.
  • The generated 'presence pulse' (from a given sensor/detector) is named "weak signal" if its width is too small/short and thereby does significantly deviates from what should be expected normally (i.e. according to the current context related to maximum speed, current speed and average vehicle length).
  • Pulse break-up:
  • It is a particular type of faulty detection signal for vehicle presence detection. The generated 'presence pulse' or rather 'pulse-series' (from a given sensor/detector) is named "pulse breakup" if instead of one pulse, two consecutive pulses are generated whereby one or both of the following situations are additionally true:
    • The time separation between the two pulses, we call it headway, does significantly deviate from what should be expected normally (i.e. according to the current context)
    • One of the two pulses is a 'weak signal'
    Chattering:
  • It is a particular type of faulty detection signal for vehicle presence detection.
  • The generated 'presence pulse' or rather 'pulse-series' (from a given sensor/detector) is named "chattering" if instead of one pulse, multiple (more than three) consecutive pulses are generated whereby both of the following situations are additionally true:
    • The time separation between two consecutive components of the pulse-series, the headway, does significantly deviate from what should be expected normally (i.e. according to the current context)
    • All or at least more than two of the pulse-series are 'weak signals'
    Splash-over:
  • It is a particular type of faulty detection signal for vehicle presence detection.
  • The generated 'presence pulse' (from a given sensor/detector) is named "splash-over" if both of the following are true:
    • It is generated rather on (or for) a directly/closely neighboring lane (in cross-section) and not on the lane on which the vehicle is effectively present or passing
    • It is either a "weak signal" or a "pulse break-up"
    Over-counting:
  • It is a particular type of faulty detection signal for vehicle presence detection.
  • It is a particular form of splash-over or in other words a "normal pulse" splash-over. The generated 'presence pulse' (from a given sensor/detector) is named "over-counting" if both of the following are true:
    • It is generated rather on (or for) a directly/closely neighboring lane (in cross-section) and not on the lane on which the vehicle is effectively present or passing
    • It is a normal signal (i.e. its width is normal according to the current relevant context) and neither "weak signal" nor "pulse breakup"
    Under-counting:
  • It is a particular type of faulty detection signal for vehicle presence detection.
  • It is a particular anomaly of the detection signal. Instead of generating a separate pulse for each individual vehicle passing a sensor, the sensor/detector does generate a single presence pulse for 2 or more consecutive vehicles passing by. The resulting pulse length will be higher than normal and an undercounting of the effective number of vehicles will be a further consequence. This generally happens when the headways between slowly moving vehicles is relatively small, as it is for example the case during traffic jams.
  • Rough-set theory:
  • Rough set theory is a relatively new mathematical tool for imperfect data analysis. The foundation of the rough set philosophy is the assumption that to every object of the universe of discourse some information (data, knowledge) is associated. Objects characterized by the same information are indiscernible (similar) in view of the available information about them. The indiscernibility relation generated in this way is the mathematical basis of rough set theory.
  • Rough set theory can be approached as an extension of the classical set theory, for use when representing incomplete knowledge. Rough sets can be considered as sets with fuzzy boundaries; that is, sets that cannot be precisely characterized using the available set of attributes. The basic concept of the rough set theory is the notion of approximation space.
  • Detailed information, see:
  • http://www.nit.eu/czasopisma/JTIT/2002/3/7.pdf (last access: Sep. 12th, 2012)
  • Signal energy:
  • The signal considered is the one formed by the succession of the different consecutive presence pulses over time over a given location at a cross-section. We call the energy of a signal (the above mentioned signal is meant) the area under the squared signal.
  • Shock wave pattern:
  • These pattern do reflect within the 'vehicle presence pulses' series over time the well-known traffic wave phenomenon. Traffic waves, also called stop waves or traffic shocks, are travelling disturbances in the distribution of cars on a highway. Traffic waves usually travel backwards in relation to the motion of the cars themselves, or "upstream".
  • The shock wave pattern is a particular distribution of both consecutive pulse-lengths and headways that do reflect the existence of a traffic wave in the underlying traffic stream.
  • Time series filtering:
  • Each time series data that reflects the output of an experiment or coming out as a signal from a dynamical system will expectedly contain some amount of noise that is embedded in it. The analysis of such data in presence of noise may often fail to give accurate information.
  • A method for filtering a time series data is a tool to clean as much noise from it as possible as the data should be made compatible for further analysis.
  • Quadratic trend estimation; cubic trend estimation:
  • Trends are patterns of a variable over time. In mathematics, the term quadratic describes something that pertains to squares, to the operation of squaring, to terms of the second degree, or equations or formulas that involve such terms.
  • If one looks at a time series, the first thing that stands out is the obvious tendency of the series to grow or to fall over time.
  • That is, it is immediately apparent from a time series plot that the average change in the series is either positive or negative. This tendency is the series's trend.
  • A general approximation of the trend of a time series is expressed by the following polynomial trend model: T t = β 0 + β 1 t + β 2 t 2 + + β p t p
    Figure imgb0001

    where p is a positive integer. The coefficients βi are obtained from a regression process over the underlying time series.
  • Here, of p=1, the trend is called "linear". However, if p=2, the trend is called "quadratic". And if p=3, the trend is called "cubic".
  • Detector sensitivity:
  • A sensor's sensitivity indicates how much the sensor's output changes when the measured quantity changes. For many sensor systems, the possibility exists of varying or setting the sensitivity, either manually or automatically.
  • Detector measurement time:
  • This is the effective time that the measurement process within a sensor or a detector is exposed to the physical environment from which a given parameter is measured . A high measurement time will result in a lower measurement update frequency.
  • Microscopic data:
  • In the context of traffic sensing microscopic data refer to single-vehicle related data. In this case here, these will be the time series of individual vehicle presence pulses generated by the traffic detectors.
  • Microscopic traffic analysis means analyzing the behavior of the individual driver. The characteristics of individual driving (i.e. microscopic data) can be described with the following variables: speed, acceleration, time headway, distance headway, relative speed between two consecutive vehicles. Macroscopic data:
  • In the context of traffic sensing, macroscopic data refer to an aggregation of single-vehicle related data. In this case here, these will be the time series of aggregated numbers of individual vehicle presences per time unit, for example per hour.
  • The macroscopic quantities are the flow rate (the number of cars passing a cross section per unit of time, q), the density (the number of cars per unit of distance, k), and the space mean speed.
  • Mesoscopic data:
  • Instead of time-aggregated data, characteristics of individual vehicles can be obtained for cross sections. Then the characteristics of the distributions of the measured data can be examined (mesoscopic traffic flow characteristics) which also differ by flow condition. For example, the distributions of time headways are well studied because they can, for instance, be used to estimate capacity.
  • Pattern recognition (in the context here):
  • A pattern is a particular signature that is expressed by 'elementary features' of an object. In the context here, the object is the time series of consecutive detector pulse signals. In this context the 'elementary features' are for example a part or all microscopic data attributes or a combination of them.
  • Informally, a pattern is defined by the common denominator among the multiple instances of an entity. The pattern of relevance here are for example the different faulty signal types.
  • Detector Tuning (in the context here):
  • Most detector systems do offer the possibility of setting, manually or dynamically/automatically two key parameters of the sensor system or their related equivalent: sensitivity, measurement time, and eventually some more, depending on the specific sensor system.
  • Tuning refers to setting (manually or dynamically or a recommendation) these key parameters in ranges that put the sensor systems in the best sensing conditions resulting in the best sensor performance and consequently in the lowest error/faults rate.
  • Classification error:
  • The classification error is a measure of the misclassification rate or proportions.
  • It is known a measurement error is the difference between the true value of a measurement and the value obtained during the measurement process. A classification error is a type of measurement error by which the respondent does not provide a true response to a survey item. This can occur in one of two ways: a false negative response or a false positive response. A 'false negative response' corresponds to when the measurement system indicates that an event did not occur although it did.
  • In summary, the quintessence of the overall concept is presented in Fig. 5.
  • Thus, in view of the above, there is presented a sensor reliability data model that is trained offline to gather all significant characteristics of the sensor component, the physical object in the road surface/concrete, environmental influences, all this for enabling a reliable assessment of the data quality of the single sensor. Considering the topological information including lane direction, signaling and the relative position of the sensors to each other at a complete intersection, mutual interferences are also trained in the sensor reliability data model.
  • The resulting model or rather the set of models offers for the first time the possibility of monitoring (in real- or quasi real-time and proactively) both the current state and the evolution of sensor data quality especially for sensors delivering a digital pulse stream caused induced by the presence of vehicles in traffic.
  • Thus, in view of the above, a global and comprehensive concept for a robust "traffic detectors' quality management" is presented.
  • Claims (15)

    1. A signal processing apparatus for processing at least one detection signal, the apparatus comprising:
      a receiver configured to receive the at least one detection signal from at least one sensor,
      a model generator configured to generate a model of signal sequences based on previously received detection signals while taking into consideration a topology of the sensors and/or environmental condition in the vicinity of the at least one sensor,
      comparing means configured to compare the at least one detection signal with the model of signal sequences generated by the model generator; and
      a prediction means configured to predict quality of the at least one detection signal based on a comparison of the at least one detection signal with the model generated by the model generator.
    2. The apparatus according to claim 1, further comprising
      determining means configured to determine whether or not the at least one detection signal comprises an error, and
      a classification means configured to classify the error into a plurality of predetermined error classes, if it is determined that the detection signal comprises an error.
    3. The apparatus according to claim 2, further comprising
      correcting means configured to correct the error, if it is determined that the detection signal comprises an error.
    4. The apparatus according to claim 2 or 3, further comprising:
      storing means configured to store a history of the detection signal, and
      prediction means configured to generate a detection signal error rate prediction based on the stored history.
    5. The apparatus according to claim 4, wherein the prediction is at least one of a short term prediction, a middle term prediction, and a long term prediction.
    6. The apparatus according to claim 4 or 5, further comprising:
      determining means configured to determine settings of the sensor based on the detection signal error rate prediction such that the detection signal error rate becomes minimum.
    7. The apparatus according to any one of claims 1 to 6, wherein
      the quality of the detection signal includes at least one of an overall error rate of the detection signal, and an error rate of each of the error classes of the detection signal.
    8. A method for processing at least one detection signal, the method comprising:
      receiving the at least one detection signal from at least one sensor,
      generating a model of signal sequences based on previously received detection signals while taking into consideration a topology of the sensors and/or environmental condition in the vicinity of the at least one sensor,
      comparing the at least one detection signal with the model of signal sequences generated by the model generator; and
      predicting quality of the at least one detection signal based on a comparison of the at least one detection signal with the model generated by the model generator.
    9. The method according to claim 8, further comprising
      determining whether or not the at least one detection signal comprises an error, and
      classifying the error into a plurality of predetermined error classes, if it is determined that the detection signal comprises an error.
    10. The method according to claim 9, further comprising
      correcting the error, if it is determined that the detection signal comprises an error.
    11. The method according to claim 10, further comprising:
      storing a history of the detection signal, and
      generating a detection signal error rate prediction based on the stored history.
    12. The method according to claim 11, wherein the prediction is at least one of a short term prediction, a middle term prediction, and a long term prediction.
    13. The method according to claim 11 or 12, further comprising:
      determining settings of the sensor based on the detection signal error rate prediction such that the detection signal error rate becomes minimum.
    14. The method according to any one of claims 8 to 13, wherein
      the quality of the detection signal includes at least one of an overall error rate of the detection signal, and an error rate of each of the error classes of the detection signal.
    15. A computer program product comprising computer-executable computer program code which, when the program is run on a computer, is arranged to cause the computer to carry out the method according to any one of claims 8 to 14.
    EP13162865.3A 2013-04-09 2013-04-09 Quality determination in data acquisition Withdrawn EP2790165A1 (en)

    Priority Applications (1)

    Application Number Priority Date Filing Date Title
    EP13162865.3A EP2790165A1 (en) 2013-04-09 2013-04-09 Quality determination in data acquisition

    Applications Claiming Priority (1)

    Application Number Priority Date Filing Date Title
    EP13162865.3A EP2790165A1 (en) 2013-04-09 2013-04-09 Quality determination in data acquisition

    Publications (1)

    Publication Number Publication Date
    EP2790165A1 true EP2790165A1 (en) 2014-10-15

    Family

    ID=48095611

    Family Applications (1)

    Application Number Title Priority Date Filing Date
    EP13162865.3A Withdrawn EP2790165A1 (en) 2013-04-09 2013-04-09 Quality determination in data acquisition

    Country Status (1)

    Country Link
    EP (1) EP2790165A1 (en)

    Cited By (8)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    DE102015203115A1 (en) * 2015-02-20 2016-09-08 Siemens Aktiengesellschaft Determine faulty configurations of detectors and signals from traffic lights
    CN107067114A (en) * 2017-04-20 2017-08-18 桂林电子科技大学 Tune machine paths planning method based on rebound strength curve program
    WO2020132134A1 (en) * 2018-12-18 2020-06-25 Bonny Banerjee Device and for environment state estimation and prediction
    GB2593482A (en) 2020-03-24 2021-09-29 Daimler Ag A method for associating a traffic light detection for an at least partially autonomous motor vehicle, as well as an assistance system
    EP3937151A1 (en) * 2020-07-08 2022-01-12 Siemens AG Österreich Device and method for controlling a traffic flow in a traffic network by means of an optimal signal phase plan
    CN114036990A (en) * 2022-01-06 2022-02-11 青岛鼎信通讯股份有限公司 Fuzzy enhancement method for topological signals of low-voltage transformer area
    CN116401604A (en) * 2019-05-13 2023-07-07 北京绪水互联科技有限公司 Method for classifying and detecting cold head state and predicting service life
    US11869358B2 (en) 2021-10-29 2024-01-09 Nortak Software Ltd. System and method for warning of a presence of a mobile target

    Citations (4)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    US20020154032A1 (en) * 1996-12-03 2002-10-24 Inductive Signature Technologies, Inc. Automotive vehicle classification and identification by inductive signature
    US20040252193A1 (en) * 2003-06-12 2004-12-16 Higgins Bruce E. Automated traffic violation monitoring and reporting system with combined video and still-image data
    AU2009236035A1 (en) * 2008-11-24 2010-06-10 Intelematics Australia Pty Ltd Method for Dynamically Integrating Traffic Data
    US20120161987A1 (en) * 2010-12-23 2012-06-28 Electronics And Telecommunications Research Institute Apparatus and method for detecting vehicles

    Patent Citations (4)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    US20020154032A1 (en) * 1996-12-03 2002-10-24 Inductive Signature Technologies, Inc. Automotive vehicle classification and identification by inductive signature
    US20040252193A1 (en) * 2003-06-12 2004-12-16 Higgins Bruce E. Automated traffic violation monitoring and reporting system with combined video and still-image data
    AU2009236035A1 (en) * 2008-11-24 2010-06-10 Intelematics Australia Pty Ltd Method for Dynamically Integrating Traffic Data
    US20120161987A1 (en) * 2010-12-23 2012-06-28 Electronics And Telecommunications Research Institute Apparatus and method for detecting vehicles

    Non-Patent Citations (1)

    * Cited by examiner, † Cited by third party
    Title
    LAWRENCE R. RABINER: "A tutorial on Hidden Markov Models and selected applications in speech recognition", PROCEEDINGS OF THE IEEE, vol. 77, no. 2, February 1989 (1989-02-01), pages 257 - 286

    Cited By (12)

    * Cited by examiner, † Cited by third party
    Publication number Priority date Publication date Assignee Title
    DE102015203115A1 (en) * 2015-02-20 2016-09-08 Siemens Aktiengesellschaft Determine faulty configurations of detectors and signals from traffic lights
    DE102015203115B4 (en) 2015-02-20 2022-06-15 Yunex Gmbh Determination of faulty configurations of detectors and signals from traffic lights
    CN107067114A (en) * 2017-04-20 2017-08-18 桂林电子科技大学 Tune machine paths planning method based on rebound strength curve program
    CN107067114B (en) * 2017-04-20 2020-07-28 桂林电子科技大学 Call routing method based on answer set program
    WO2020132134A1 (en) * 2018-12-18 2020-06-25 Bonny Banerjee Device and for environment state estimation and prediction
    US12094315B2 (en) 2018-12-18 2024-09-17 Bonny Banerjee Multi-sensor device for environment state estimation and prediction by sampling its own sensors and other devices
    CN116401604A (en) * 2019-05-13 2023-07-07 北京绪水互联科技有限公司 Method for classifying and detecting cold head state and predicting service life
    CN116401604B (en) * 2019-05-13 2024-05-28 北京绪水互联科技有限公司 Method for classifying and detecting cold head state and predicting service life
    GB2593482A (en) 2020-03-24 2021-09-29 Daimler Ag A method for associating a traffic light detection for an at least partially autonomous motor vehicle, as well as an assistance system
    EP3937151A1 (en) * 2020-07-08 2022-01-12 Siemens AG Österreich Device and method for controlling a traffic flow in a traffic network by means of an optimal signal phase plan
    US11869358B2 (en) 2021-10-29 2024-01-09 Nortak Software Ltd. System and method for warning of a presence of a mobile target
    CN114036990A (en) * 2022-01-06 2022-02-11 青岛鼎信通讯股份有限公司 Fuzzy enhancement method for topological signals of low-voltage transformer area

    Similar Documents

    Publication Publication Date Title
    EP2790165A1 (en) Quality determination in data acquisition
    Nguyen et al. Automatic classification of traffic incident's severity using machine learning approaches
    US20150206427A1 (en) Prediction of local and network-wide impact of non-recurrent events in transportation networks
    US20230267549A1 (en) Method of predicting the future accident risk rate of the drivers using artificial intelligence and its device
    CN113379099B (en) Machine learning and copula model-based highway traffic flow self-adaptive prediction method
    Bawaneh et al. Anomaly detection in smart city traffic based on time series analysis
    Tak et al. Development of AI‐Based Vehicle Detection and Tracking System for C‐ITS Application
    US20220262103A1 (en) Computer-implemented method for testing conformance between real and synthetic images for machine learning
    Graf et al. A learning concept for behavior prediction in traffic situations
    Mihaita et al. Traffic congestion anomaly detection and prediction using deep learning
    Gressenbuch et al. Predictive monitoring of traffic rules
    JP2017084268A (en) Accident occurrence forecast system and accident occurrence forecast method
    US10706720B2 (en) Predicting vehicle travel times by modeling heterogeneous influences between arterial roads
    Kumar et al. Bus travel time prediction: a log-normal auto-regressive (AR) modeling approach
    Bajic et al. Road roughness estimation using machine learning
    JP7313820B2 (en) TRAFFIC SITUATION PREDICTION DEVICE AND TRAFFIC SITUATION PREDICTION METHOD
    KR102545188B1 (en) Method for predicting travel time using spatio-temporal model and apparatus thereof
    Scalabrin et al. A Bayesian forecasting and anomaly detection framework for vehicular monitoring networks
    Lei et al. Identifying and correcting the errors of vehicle trajectories from roadside millimetre‐wave radars
    Li et al. Estimation and prediction of freeway traffic congestion propagation using tagged vehicle positioning data
    US20230075475A1 (en) Outlier Determination Among Models of Future Event Prediction
    Drost et al. Siamese networks for online map validation in autonomous driving
    KR20210125371A (en) Method and apparatus for analyzing traffic situation
    CN118366312B (en) Traffic detection system and method
    Choudhury et al. Real time speed trend analysis and hours of service forecasting using LSTM network

    Legal Events

    Date Code Title Description
    PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

    Free format text: ORIGINAL CODE: 0009012

    17P Request for examination filed

    Effective date: 20130409

    AK Designated contracting states

    Kind code of ref document: A1

    Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

    AX Request for extension of the european patent

    Extension state: BA ME

    R17P Request for examination filed (corrected)

    Effective date: 20150304

    RBV Designated contracting states (corrected)

    Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

    STAA Information on the status of an ep patent application or granted ep patent

    Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

    18D Application deemed to be withdrawn

    Effective date: 20171103