Multisensor Data Fusion in Car Safety
Multisensor Data Fusion in Car Safety
actuators,
Sensor Input Processing Output signals,
control
Tutorial 14:
Multisensor Data
Fusion
Part 14 in a series of tutorials in instrumentation and measurement
M
ultisensor data fusion is a multilevel, the past 15 years, several data fusion methods have
multifaceted process dealing with the been proposed.
automatic detection, association, correla- This article describes some data fusion models and
tion, estimation, and combination of data from single some applications to next-generation car safety and
and multiple information sources. The results of a data driver assistance systems. These applications are par-
fusion process help users make decisions in complicated ticularly suitable to provide an overview of multisensor
scenarios. Integration of multiple sensor data was data fusion starting from the plain detection of
originally needed for military applica- multiple objects around a given host
tions in ocean surveillance, air-to- vehicle to inferring (in order of
air and surface-to-air defense, increasing complexity):
14
or battlefield intelligence. ◗ The position and speed of
More recently, multisen- possible obstacles
sor data fusion has also
included the nonmilitary
fields of remote environ-
mental sensing, medi-
Fourteenth
in a
◗ The type of the objects in
the road environment
◗ The relative movement
and distribution of ob-
cal diagnosis, automated
monitoring of equipment,
robotics, and automotive
Series stacles over a given area
◗ The early detection of a
possible collision
systems. ◗ Possible suggestions for
Inputs to a multisensor data prompt and effective counter-
fusion system include raw sensor measures (e.g., sudden braking,
data, commands, model parameters, steering wheel adjustment, etc.).
and other a priori information. Output is much more For instance, the techniques developed within the
heterogeneous and can range from simple estimates European Union project titled “Preventive and Active
about the attributes of a certain entity (a physical Safety Applications Contribute to the Road Safety Goals
phenomenon, a military target, or a fault condition in on European Roads” (PReVENT) are able to create
a machine) to complex inferences about current or safety zones around vehicles by means of embedded
future relationships between multiple entities. During multisensor data fusion systems that sense the type and
Fig. 3. Block diagram of the ProFusion2 data fusion model specifically defined for car safety and driver assistance applications.
Fig. 4. Overview of the four issues associated to a generic parameter or attribute estimation problem.
lems involving vehicles in motion, the state variables are often complexity, and available resources. A linear model is usually
chosen to be the distance and the relative velocity of the ob- adequate for observations that are closely spaced in time.
stacles surrounding the host vehicle. The observation equations After defining the system model, the estimation process
are defined to predict the future observations using just the aims at finding the state vector enabling the “best fit” between
current state of the system. When dealing with vehicles, this the actual observed data and the values predicted by Equation
often means deriving equations to predict a vehicle’s future (1). If we refer to the residual vector as the N-long vector contain-
position and velocity. ing the differences between the observed and the predicted
Thus, if x(ti) represents the value of the state vector at time, values of ŷ(ti) at the times ti, with i = 1,…, N, the most widely
ti, then the predicted observation, ŷ(ti), at the same time is used optimization criteria are the following:
◗ Least square (LS) optimization: this approach relies on the
minimization of the sum of the squares of the residuals.
yˆ ti h x ti n ti (1) ◗ Weighted least square (WLS) optimization: this technique
is based on the minimization of the sum of the weighted
where h[x(ti)] is a function of the state vector returning the pre- squares of the residuals.
dicted observations at a time ti, and n(ti) represents the random ◗ Bayesian weighted least square (BWLS) optimization: it is
acquisition noise. If the state vector is constant over time, the similar to WLS, but in this case the sum of the weighted
estimation problem is called static. Conversely, if the state vec- squares of the residuals is constrained by the a priori
tor changes as a function of time the estimation problem is re- knowledge of the state vector x.
ferred to as dynamic, and an additional set of equations of motion ◗ Mean square error (MSE) optimization: this criterion seeks to
is required to propagate the state of a system from the initial minimize the expected value of the squared error vector.
time t0 to the time ti in which an observation is collected. ◗ Maximum likelihood estimate (MLE): in this case, the op-
The typical form of an equation of motion is the following: timal value of x maximizes the multivariate probability
distribution modeling the observation noise.
x ti F ti , t0 x t0 (2) Notice that any optimization criterion requires the minimi-
zation or maximization of a certain cost function. The choice
where F(·,·) is the state propagation function matrix. As a rule of the best function depends on the type of problem requir-
of thumb, the definition of the equations of motion results ing optimization as well as on the statistics of observational
from the trade-off between realism, accuracy, computational noise.
Fig. 7. Block diagram of a typical fusion process based on the Bayesian inference approach.
serving the ith obstacle with i = 1,…,M. Therefore, if the stand- For instance, if Bi is an elementary proposition, such as
alone hypothesis probabilities P(Hi) are known due to some A1, then Spt(Bi) = m(A1), whereas if Bi = A1∨A2∨A3, Spt(Bi) =
a priori information, the conditional probabilities [P(Hi| Dj) m(A1) + m(A2) + m(A3) + m(A1∨A2) + m(A2∨A3) + m(A1∨A3) +
… P(HM| Dj)] for any identity declaration can be determined m(A1∨A2∨A3).
using Equation (5). Accordingly, a set of M joint multisensor Similarly to the Bayes’ combination formula, in the D-S
probabilities of having detected a certain obstacle as a result method also some rules to combine the BPA functions related
of multiple declarations is given by the Bayes’ combination to independent sources that must be defined. In the simplest
rule, i.e.: case (i.e., when just two sources of information are consid-
ered) the basic Dempster’s combination rule states that the
P H i P D1|H i ...P DN |H i
P H i|D1 ,..., DN M
i=1,…,M (6) joint probability mass associated with a generic proposition Bi
P H i P D1|H j ...P DN |H j results from the product sum of all individual BPA functions
j 1
over the total probability associated to nonconflicting proposi-
Ultimately, the recognized obstacle (i.e., the result of the tions. In symbols, this means that:
identity fusion inference process) is most probably the hypoth-
esis whose joint probability function, Equation (6), is maxi- m1 X k m2 X j
B X ki X j
mum. This decisional rule is usually referred to as maximum a m12 Bi (8)
1 c
posteriori probability (MAP) criterion.
The Bayesian inference process for data fusion suffers from where Xk and Xj are two generic propositions of the power set
three main drawbacks. The first is that estimating the a priori of Bi and
probabilities, P(Hi), is not always feasible. The second is that
the observations collected by the sensors are not always sta- c m1 X k m2 X j (9)
Xk X j
tistically independent. The third is that we have to be sure that
the M hypotheses are mutually exclusive.
Some of these issues can be tackled by the so-called Demp- is the factor that take conflicts into account. Observe that if
ster-Shafer (D-S) method [5]. The D-S approach is formally a we have a large number of conflicts, the normalization factor
generalization of the Bayesian inference process, but it is closer (1 – c) may become very small, thus leading to a counterin-
to the human way of thinking. In particular, starting from a set tuitive increment of the joint probability mass. For this reason,
of hypotheses, the D-S method aims at estimating the probabil- more sophisticated combination rules have recently been
ity intervals associated with elementary or general propositions proposed [5].
resulting from the combination of different hypotheses. The The application of the D-S method in sensor fusion systems
elementary propositions consist of an individual hypothesis and is very similar to that of the Bayesian approach. For instance,
are mutually exclusive. On the other hand, the general proposi- within the project PReVENT, the D-S method is used to identi-
tions are obtained by combining the elementary propositions fy the driver’s maneuvers, thus triggering possible preventive
using the “OR” Boolean operator. As a consequence, they may actions if some danger is detected [1]. In this specific case study,
contain overlapping or even conflicting hypotheses. the set q of elementary propositions was chosen to be q = {“lane
If q = {A1, A2, …, An} is the set of n elementary propositions change”, “overtaking”, “free flow”, “cut-in”, “merging”, “follow-
and 2Q represents the power set of q containing all the possible ing vehicle in the same path”, “following vehicle in the next lane”,
general propositions, the D-S method defines the evidential in- “unknown maneuver”} [3]. Of course, a complex maneuver can
terval associated to a generic proposition Bi ∈2Q as IBi = [Spt(Bi), consist of multiple elementary maneuvers. After collecting
Pls(Bi)], where Spt(Bi) is the support for the proposition Bi (i.e., heterogeneous sets of estimates from various sources of infor-
a metric of its likelihood), whereas Pls(Bi) = 1 – Spt(B̄ i) is its plau- mation (e.g., the Time to Lane Crossing with Constant Velocity
sibility, namely the probability of not supporting the negated (TLC_CV), The Time to Lane Crossing with Constant Accelera-
proposition B̄ i. Due to the definition of support and plausibility, tion (TLC_CA), The Predicted Minimum Distance (PMD) and
the probability P(Bi) associated with the proposition Bi has to so on), it is possible to assign different BPA functions to each
lie within the corresponding evidential interval. By referring source of information for any of the maneuver types in q. Some
examples of BPA values associated to the TLC_CV parameter ◗ In the hypothesis evaluation (HE) step the entries of the as-
for various maneuver types are shown in Figure 8. sociation matrix are processed to assess the likelihood of
In this example, the TLC_CV parameter represents the a certain hypothesis from a quantitative point of view. In
estimated time for lane crossing when the host vehicle keeps practice, this means that all the entries of an association
on moving with constant velocity. Notice, that if the TLC_CV matrix are filled in with numerical values resulting from
is less than 1 second, the BPA of lane change (LC) and overtake the application of the chosen “closeness” function on the
(OV) is obviously high. Conversely, when the TLC_CV value measurement or predicted data.
increases, the BPA of other kinds of maneuvers tends to grow. ◗ Finally, in the hypothesis selection (HS) step the results of
By properly combining the various probability masses, at first the HE phase are sorted and processed to determine the
the support and plausibility boundaries for every maneuver best set of hypotheses explaining the incoming data.
type and source of information can be estimated, and then The data-entity association problem is critical because it
the joint probability masses can be calculated using Equation affects the accuracy of the whole data fusion process. In car
(8). Finally, a vector of evidential intervals related to all the safety applications, a wrong data-entity association could lead
possible maneuvers is built up. Some decision logic similar to to erroneous and potentially dangerous preventive actions. In
the logic adopted in the Bayesian approach will be used to es- fact, if the observations of a cluster of laser rangefinders are as-
tablish the result of the maneuver classification process. In this sociated to the wrong obstacle, the relative velocity of the cor-
way, some dangerous situations (e.g., when a driver is about to responding object could be grossly over- or underestimated,
overtake the in-front vehicle, but another car outside the back thus leading to either excessive or inadequate braking.
mirror field of view is in turn overtaking the host vehicle) can When the situation refinement performed by the system
be detected and promptly signaled to the driver. requires the analysis of very involved scenarios (e.g., to predict
Consider that the statistical inference techniques men- possible collisions with other vehicles on the road), the infer-
tioned above can be applied at different levels of processing ence mechanisms could rely not only on statistics but also on
in a data fusion system. For instance, at level 1 they can be ap- advanced abilities of reasoning and cognition such as planning,
plied to solve the well-known entity-data association problem. deduction, and induction. Emulating such abilities is the basic
Performing a data-entity association means mapping multiple objective of artificial intelligence techniques. An exhaustive de-
sets of data to the correct entity when multiple objects are in the scription of such techniques is beyond the scope of this paper.
same area at the same time. Usually, the term association refers In the specific field of data fusion techniques, interpreta-
to the attempt of estimating “the closeness” between two data tion of fused data for situation or threat analysis relies mostly
(e.g., two observations or two feature vectors) or between a on expert, or knowledge-based, systems (KBS) [8]. In general, the
new observation and an existing track containing some signifi- structure of an expert system consists of four logical parts:
cant parameters related to the same entity. ◗ A knowledge base (KB) containing all the basic information
Common association measures are correlation coefficients, representing the expertise of the system. Such knowledge
distance measures, and probabilistic similarity measures [4]. can be described through various techniques, including
From an operative viewpoint, the data-object association pro- production rules, which perform a certain action if some evi-
cess usually consists of three steps that are summarized in the dence exists; semantic nets, i.e., graph-based representations
following describing possible relationships between classes of objects
◗ In the hypothesis generation (HG) step the incoming data and related specific instances (e.g., the class of a vehicle with
are processed to create a so-called association matrix: a respect to the vehicle specific model); frames, namely data
table linking the input data and some hypotheses describ- records summarizing the properties of classes and objects;
ing how data could be related. and scripts, representing situations and events through acts,
scenes, settings, and actions, as in a theater play
◗ A global database including dynamic data changing at
run-time
◗ A control structure or inference engine that attempts to find
one or more rules to make an inference about the current
situation by starting from input dynamic data and the
knowledge base
◗ Finally, a human-machine interface that gives users con-
trol over the whole process and allows the user to make
decisions on the basis of the results obtained by the data
fusion process.
The combination of the KB, the database, and the infer-
ence engine enables the implementation of automated rea-
soning techniques. The basic inference process underlying
an expert system for car safety purposes is shown schemati-
cally in Figure 9. Starting from the dynamic input data (e.g.,
Fig. 8. Example of basic assignment probability (BPA) functions associated the velocity and direction of the host vehicle as well as the
with the Time to Lane Crossing with Constant Velocity (TLC_CV) parameter in a type of nearby vehicles, their position, and speed values)
car safety application.
Additionally, the accuracy of a data fusion system strongly Italy. His research interests include the design, implementa-
depends on the specific operating conditions under which the tion, and testing of embedded systems, with special emphasis
system itself is used. on wireless sensor networks.