9. Conclusion and future work
In this paper, we determined that stock prices sparsely show similar patterns and all the variables do not have a significant impact on the price. For short-term prediction, we proposed a novel method based on the combination of dynamic time warping, stepwise regression, and artificial neural network model to find similar historical datasets for each stock item and predict daily stock price using optimal significant variables through feature selection and comparison of leverage. Moreover, we dealt with the overall process using a big data processing framework composed of Hadoop, R, and RHive. Finally, we demonstrated the prediction accuracy for three stock items using SAX and Jaro-Winkler distance. In future work, we will improve the reliability of the predicted stock price by relation analysis of same field for a longer period and enhance the execution time by changing our system or file structure to use minimum search queries.