A feature selection model based on genetic rank aggregation for text sentiment classification

Onan, A. and KorukoGlu, S.

Tarih

2017

Yazar

Onan, A. and KorukoGlu, S.

Üst veri

Tüm öğe kaydını göster

Özet

Sentiment analysis is an important research direction of natural language processing, text mining and web mining which aims to extract subjective information in source materials. The main challenge encountered in machine learning method-based sentiment classification is the abundant amount of data available. This amount makes it difficult to train the learning algorithms in a feasible time and degrades the classification accuracy of the built model. Hence, feature selection becomes an essential task in developing robust and efficient classification models whilst reducing the training time. In text mining applications, individual filter-based feature selection methods have been widely utilized owing to their simplicity and relatively high performance. This paper presents an ensemble approach for feature selection, which aggregates the several individual feature lists obtained by the different feature selection methods so that a more robust and efficient feature subset can be obtained. In order to aggregate the individual feature lists, a genetic algorithm has been utilized. Experimental evaluations indicated that the proposed aggregation model is an efficient method and it outperforms individual filter-based feature selection methods on sentiment classification. © The Author(s) 2015.

Bağlantı

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85011610015&doi=10.1177%2f0165551515613226&partnerID=40&md5=1be8e2db0f6cc5d94fe50028b7881936
http://hdl.handle.net/20.500.12481/12249

Koleksiyonlar

Scopus [2994]