Exploring the Performance Characteristics of the Naïve Bayes Classifier in the Sentiment Analysis of an Airline’s Social Media Data
Volume 5, Issue 4, Page No 266–272, 2020
Adv. Sci. Technol. Eng. Syst. J. 5(4), 266–272 (2020);
DOI: 10.25046/aj050433
Keywords: Airline image branding, Naïve Bayes, Sentiment analysis
Airline operators get much feedback from their customers which are vital for both operational and strategic planning. Social media has become one of the most popular platforms for obtaining such feedback. However, to analyze, categorize, and generate useful insight from the huge quantity of data on social media is not a trivial task. This study investigates the capability of the Naïve Bayes classifier for analyzing sentiments of airline image branding. It further examines the impact of data size on the accuracy of the classifier. We collected data about some online conversations relating to an incident where an airline’s security operatives roughly handled a passenger as a case study. It was reported that the incident resulted in a loss of about $1 billion of the company’s corporate value. Data were extracted from twitter, preprocessed and analyzed using the Naïve Bayes Classifier. The findings showed a 62.53% negative and 37.47% positive sentiments about the incident with a classification accuracy of over 0.97. To assess the impact of training size on the accuracy of the classifier, the training sets were varied into different sizes. A direct linear relationship between the training size and the classifier’s accuracy was observed. This implies that large training data sets have the potentials for increasing the classification accuracy of the classifier. However, it was also observed that a continuous increase in the classification size could lead to overfitting. Hence there is a need to develop mechanisms for determining optimum training size for finest accuracy of the classifier. The negative perceptions of customers could have a damaging effect on a brand and ultimately lead to a catastrophic loss in the organization.
- P. Greenberg, CRM at the Speed of Light, Fourth Edition: Social CRM 2.0 Strategies, Tools, and Techniques for Engaging Your Customers, 4th ed., New York City: McGraw-Hill Education, 2009.
- B. Liu, Sentiment Analysis and Opinion Mining, Virginia: Morgan & Claypool Publishers, 2012.
- S. Gupta, “Sentiment Analysis: Concept, Analysis and Applications,” 2018.
- A. Tripathy, A. Agrawal and S. K. Rath, “Classification of Sentimental Reviews Using Machine Learning Techniques,” Procedia Computer Science, 57, 821 – 829, 2015. https://doi:org/10.1016/j.procs.2015.07.523
- M. Salam, “Security Officers Fired for United Airlines Dragging Episode,” The New York Times, 17 October 2017.
- I. Chaturvedi, E. Cambria, R. E. Welsch and F. Herrera, “Distinguishing between facts and opinions for sentiment analysis: Survey and Challenges,” Information Fusion, 44, 65 – 77, 2018. https://doi.org/10.1016/j.inffus.2017.12.006
- A. Tamilselvi and M. ParveenTaj, “Sentiment Analysis of Microblogs using Opinion Mining Classification Algorithm,” International Journal of Science and Research, 2 (10), 196 – 202, 2013.
- V. A. Kharde and S. S. Sonawane, “Sentiment Analysis of Twitter Data: A Survey of,” International Journal of Computer Applications, 139(11), 5-15, 2016. https://doi.org/10.5120/ijca2016908625.
- L. Zhanga, K. Huac, H. Wangd, G. Qiane and L. Zhanga, “Sentiment Analysis on Reviews of Mobile Users,” Procedía Computer Science, 34, 458 – 465, 2014. https://doi.org/10.1016/j.procs.2014.07.013
- L. Martin-Domingoa, J. C. Martínb and G. Mandsberg, “Social media as a resource for sentiment analysis of Airport Service Quality (ASQ),” Journal of Air Transport Management, 78, 106-115, 2019. https://doi.org/10.1016/j.jairtraman.2019.01.004
- G. Vinodhini and R. Chandrasekaran, “A comparative performance evaluation of neural network-based approach for sentiment classification of online reviews,” Journal of King Saud University – Computer and Information Sciences, 28, 2–12, 2016. https://doi.org/10.1016/j.jksuci.2014.03.024
- Y. AL Amrani, M. Lazaar and K. E. EL Kadiri, “Random Forest and Support Vector Machine based Hybrid Approach to Sentiment Analysis,” Procedia Computer Science, p. 511–520, 2018. https://doi.org/10.1016/j.procs.2018.01.150
- M. D. Devika, C. Sunitha and A. Ganesh, “Sentiment Analysis: A Comparative Study on Different Approaches,” Procedia Computer Science, 87, 2016. https://doi.org/10.1016/j.procs.2016.05.124
- P. Shahana and B. Omman, “Evaluation of Features on Sentimental Analysis,” Procedia Computer Science, 46, (2015), 1585 – 1592, 2015. https://doi:org/10.1016/j.procs.2015.02.088
- K. Mehmood, D. Essam, K. Shafi and M. K. Malik, “Sentiment Analysis for a Resource Poor Language—Roman Urdu,” ACM Transactions on Asian and Low-Resource Language Information Processing, 19(1), 10.1-10.15, 2019. https://doi:org/10.1145/3329709
- M. O. Odim and V. C. Osamor, “Required Bandwidth Capacity Estimation Scheme for Improved Internet Service Delivery: A Machine Learning Approach,” International Journal of Scientific & Technology Research, 8(8), 326 – 334, 2019.
- M. O. Odim, J. A. Gbadeyan and J. S. Sadiku, “Modelling the Multi-Layer Artificial Neural Network for Internet Traffic Forecasting: The Model Selection Design Issues,” in ACM Computing Research and Innovations (CoRI 2016), Ibadan, 2016.
- Tiny du Toit, Hennie Kruger, Lynette Drevin, Nicolaas Maree, "Deep Learning Affective Computing to Elicit Sentiment Towards Information Security Policies", Advances in Science, Technology and Engineering Systems Journal, vol. 7, no. 3, pp. 152–160, 2022. doi: 10.25046/aj070317
- Bougar Marieme, Ziyati El Houssaine, "Analysis Methods and Classification Algorithms with a Novel Sentiment Classification for Arabic Text using the Lexicon-Based Approach", Advances in Science, Technology and Engineering Systems Journal, vol. 7, no. 3, pp. 12–18, 2022. doi: 10.25046/aj070302
- Nasrin Dehbozorgi, Mary Lou Maher, Mohsen Dorodchi, "Emotion Mining from Speech in Collaborative Learning", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 5, pp. 90–100, 2021. doi: 10.25046/aj060512
- Fatima-Ezzahra Lagrari, Youssfi Elkettani, "Traditional and Deep Learning Approaches for Sentiment Analysis: A Survey", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 5, pp. 01–07, 2021. doi: 10.25046/aj060501
- Saichon Sinsomboonthong, "Efficiency Comparison in Prediction of Normalization with Data Mining Classification", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 4, pp. 130–137, 2021. doi: 10.25046/aj060415
- Arwa A. Al Shamsi, Sherief Abdallah, "Text Mining Techniques for Sentiment Analysis of Arabic Dialects: Literature Review", Advances in Science, Technology and Engineering Systems Journal, vol. 6, no. 1, pp. 1012–1023, 2021. doi: 10.25046/aj0601112
- Arwa Alshamsi, Reem Bayari, Said Salloum, "Sentiment Analysis in English Texts", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 6, pp. 1683–1689, 2020. doi: 10.25046/aj0506200
- Jajam Haerul Jaman, Rasdi Abdulrohman, Aries Suharso, Nina Sulistiowati, Indah Purnama Dewi, "Sentiment Analysis on Utilizing Online Transportation of Indonesian Customers Using Tweets in the Normal Era and the Pandemic Covid-19 Era with Support Vector Machine", Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 389–394, 2020. doi: 10.25046/aj050549
- Kevin Yudi, Suharjito, "Sentiment Analysis of Transjakarta Based on Twitter using Convolutional Neural Network", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 5, pp. 281–286, 2021. doi: 10.25046/aj040535
- Ahmad Zainul Hamdi, Ahmad Hanif Asyhar, Yuniar Farida, Nurissaidah Ulinnuha, Dian Candra Rini Novitasari, Ahmad Zaenal Arifin, "Sentiment Analysis of Regional Head Candidate’s Electability from the National Mass Media Perspective Using the Text Mining Algorithm", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 2, pp. 134–139, 2019. doi: 10.25046/aj040218
- Eris Riso, Abba Suganda Girsang, "Talk Show’s Business Intelligence on Television by Using Social Media Data in Indonesia", Advances in Science, Technology and Engineering Systems Journal, vol. 4, no. 1, pp. 311–316, 2019. doi: 10.25046/aj040130
- Sint Sint Aung, Myat Su Wai, "Domain Independent Feature Extraction using Rule Based Approach", Advances in Science, Technology and Engineering Systems Journal, vol. 3, no. 1, pp. 218–224, 2018. doi: 10.25046/aj030126