6. Conclusions
This paper has introduced a novel methodology for the selection of subjective sequential data for the training of ES. The methodology is based on the arrangement of homogeneous information acquired from a group of human experts. It has been proposed two different similarity measures between linearized sequential data, and a novel similarity measure that uses their combination using cluster information.
Several experiments have been achieved in order to illustrate the performance of the methodology. Three of the most representative ones have been included. The first example uses synthetic data to present the methodology. The methodology has been applied to a practical case of the ITS domain where an ES for driving risk prediction has been trained and evaluated through risk evaluations acquired from a group of traffic safety experts. An experiment has been focused on an urban scenario, and another experiment makes use of data collected from an interurban scenario.
The obtained results from these experiments have shown the relevance of selecting homogeneous information for the generation of a reliable ground truth. Also, it could be concluded that the ES trained with homogeneous evaluations performed better when predicting the driving risk. Moreover, these results show the relevance of the use of subjective sequential data when dealing with phenomena where a set of rules could not be easily acquired from human experts, such as risk assessment. In this case, the rules have been properly learned from a set of homogeneous evaluations arranged with the presented methodology obtaining outstanding results.