In this section, the learning text is 70000 letters long. The learnt VLMM has been tested on 3 texts:
The prediction capabilities (percentage of letters correctly predicted) are reported on table 6.1. The results show similar performances on each text, but the Matusita measure gives markedly better predictions than the KL divergence (note that a flat prior would achieve about
).
|
The small differences between the percentage of correct guesses of the three different texts tends to suggest that the VLMM did not ``over learn'' the text, as its performance on the quite dissimilar test set was close to that on the training set
.