We experimented with two grouping algorithms. The first is a greedy algorithm. It first assumes that each sub-trajectory given by the segmentation initially forms a group by itself. Let be this initial set of groups.
We want to know how to merge groups so that the resulting groups are sufficiently coherent that a Gaussian model is a good representation.
For every pair of elements of
, we compute the variance of the group
that is built using the sub-trajectories from both
and
. We select the pair of groups
that gives the lowest variance and merge those two groups to form only one group. We delete
and
from
and insert
instead.
We iterate the process until we reach a given number of clusters.
Unfortunately this algorithm becomes quite inefficient if we want to process a large number of sub-trajectories (several hundreds in practice). Its complexity is
where
is the total number of sub-trajectories.
For larger training sets of sub-trajectories, we use the normalised cuts algorithm of Shi and Malik [12]. This computes groups by analysing the eigenvectors of a similarity matrix. The computation of this matrix is in
where
is the complexity of the similarity measure. The complexity of the solution to the eigenvalue problem usually has a complexity of
but fortunately, for a sparse matrix, it can be reduced to a lower complexity, using the Lanczos algorithm.
We tried this algorithm with two similarity measures, one is the euclidian distance, and the other one is based on dynamic time warping. Both have a complexity of
which brings the complexity of the whole algorithm to
. The results are slightly better for the euclidian distance because the dynamic time warping measure is invariant to transformations such as rotations or translations. The principal component analysis used to compute equation 1 performs poorly if we use a similarity measure which is not strict enough on the allowed transformations.
The results, even if still acceptable, are visually less effective than those from the greedy algorithm. However, the normalised cuts method scales better for a large number of sub-trajectories.