5. Conclusion
In this paper, we proposed a Bagging ELM-based spammer detection framework for SNSs. Our proposed framework has three major contributions in this area. First, it identifies account- and object-specific features to facilitate spammer detection in SNSs. Second, it constructs a novel dataset of the two most popular SNSs, i.e., Twitter and Facebook. Finally, it introduces a Bagging ELM classifier and applies this classifier to the dataset that we constructed from Twitter and Facebook. Our experiments and comparison with other traditional classifiers show that our framework is able to achieve much better generalization performance than other existing frameworks. Our framework achieved average accuracy rate of 99.01 % for the Twitter dataset and 99.02 % for the Facebook dataset while requiring shorter training time of 1.17s and 1.10s, respectively. Note, however, that the performance result of the framework relies on the labeled dataset, which typically needs considerable labor cost and time. Furthermore, manually labeling the process to obtain the labeled dataset suffers from inaccurate result due to individual bias. To address the issue of labeled dataset, our framework can be enhanced by using semi-supervised learning with ELM. The semi-supervised ELM can support the use of easily acquired unlabeled dataset and provide good generalization performance at higher speed.