6. Conclusion and future
work In this paper, we have presented a process for designing sensor sets to capture energy events in buildings. The key contributions lie in the use of random forests to produce a measure of sensor value a priori, and the implementation of a bounded knapsack problem (BKP) solver that chooses an optimum sensor set given a set of costs and values. Through a field study in 4 UK homes, we have illustrated how random forests can be used to output a measure of predictive value using the Gini impurity measure, and how this measure e when combined with an appropriate cost measure, e.g. financial cost e can be used to generate sensor sets given designer constraints. Through this, we have also shown that more valuable but expensive sensors such as CO2 are often not included in the sets due to their high cost. Furthermore, we have shown that CO2 and light sensors are particularly predictable, with a mean predictable proportion for both of 0.4 bits from the other sensors used in our study of domestic buildings (temperature, humidity and sound level).