Handling outliers and over fitting for lung cancer disease prediction through neural network clustering
Author(s): Christopher Francis Britto, Dr. Bidyut Kumar Das and Yogesh V Patil
Abstract: With the rapid increase of cancer disease-affected people worldwide, predicting the cancer disease patients and diagnosing the cancer disease becomes crucial. The cancer disease diagnosis models have greatly confronted the data scarcity and often deal with the outliers. This paper presents the Neural network Clustering-based Outlier Handling (NCOH) methodology to improve the performance of multi-class cancer detection by modeling the outlier detection and outlier handling processes. The proposed methodology enhances the data augmentation process and the outlier handling process to enforce the cancer disease diagnosis using the oversampling and the neural network clustering methods. The NCOH approach handles the outliers with the impact of the clustering and the Z-score normalization score. The experimental results illustrate that the proposed approach significantly outperforms the existing outlier detection method by 15% of higher recall while testing the performance of the multi-class cancer classification model.
Christopher Francis Britto, Dr. Bidyut Kumar Das, Yogesh V Patil. Handling outliers and over fitting for lung cancer disease prediction through neural network clustering. Int J Eng Comput Sci 2024;6(2):09-17. DOI: 10.33545/26633582.2024.v6.i2a.118