ANALYZING THE IMPACT OF FEATURE SELECTION USING INFORMATION GAIN FOR AIRLINES' CUSTOMER SATISFACTION

Farah Aqilah Bohani1*, Farah Syazwani Mohamed Rashid2, Yuzi Mahmud3, Sitti Rachmawati Yahya4

1,2,3School of Computing Sciences, College of Computing, Informatics, and Mathematics, Universiti Teknologi MARA, 40450 Shah Alam

4Department of Information System, Asia Cyber University, South Jakarta, Indonesia

1*This email address is being protected from spambots. You need JavaScript enabled to view it., 2This email address is being protected from spambots. You need JavaScript enabled to view it., 3This email address is being protected from spambots. You need JavaScript enabled to view it., 4This email address is being protected from spambots. You need JavaScript enabled to view it.



ABSTRACT

 

Feature selection has become a focus of research in many fields that deal with machine learning and data mining because it makes classifiers cost-effective, faster, and more accurate. In this paper, the impact of feature selection using filter methods such as Information Gain is shown. The impact of feature selection has been analyzed based on the accuracy of two classifiers: J48 and Naïve Bayes. The Airline Customer Satisfaction datasets have been used for comparing with and without applying Information Gain. As a result, J48 achieved 0.33% and 0.29% improvements in accuracy after applying Information Gain for 10-fold and 20-fold cross-validation, respectively compared to Naïve Bayes. Most of the precision and F1-score for J48 with Information Gain have also improved for both evaluation methods compared to Naïve Bayes. In conclusion, J48 seems to be the classifier that is most sensitive to feature selection and has shown improvements compared to Naïve Bayes.

 


Keywords: Airline Customer Satisfaction, J48, Naïve Bayes, Feature Selection, Information Gain

 

Published On: 1 April 2024

Full Download