An Optimal Solution to the Overfitting and Underfitting Problem of Healthcare Machine Learning Models
Abstract
In the current technological era, artificial intelligence is becoming increasingly popular. Machine learning, as the branch of AI is taking charge in every field such as healthcare, the Stock market, Automation, Robotics, Image Processing, and so on. In the current scenario, machine learning and/or deep learning are becoming very popular in medical science for disease prediction. Much research is underway in the form of disease prediction models by machine learning. To ensure the performance and accuracy of the machine learning model, it is important to keep some basic things in mind during training. The machine learning model has several issues which must be rectified duration of the training of the model so that the learning model works efficiently such as model selection, parameter tuning, dataset splitting, cross-validation, bias-variance tradeoff, overfitting, underfitting, and so on. Under- and over-fitting are the two main issues that affect machine learning models. This research paper mainly focuses on minimizing and/or preventing the problem of overfitting and underfitting machine learning models.
Downloads
References
[2] O. A. M. López, A. M. López, and J. Crossa, (2022) “Multivariate Statistical Machine Learning Methods for Genomic Prediction”, https://doi.org/10.1007/978-3-030-89010-0, Springer Cham Publisher, Biomedical and Life Sciences book chapter - 4, Vol. , Issue, Pp. 109-122, November 2021
[3] A. K. Prajapati and U. K. Singh, “An empirical analysis of ML techniques and/or algorithms for disease diagnosis prediction from the perspective of cardiovascular disease (CVD),” International Journal of Computing Algorithm, vol. 11, no. 2, Pp. 6-16, Dec. 2022, doi: 10.20894/ijcoa.101.011.002.002.
[4] S. Zha F. Luisier, W. Andrews, N. Srivastava, and R. Salakhutdinov, “Exploiting Image-trained CNN Architectures for Unconstrained Video Classification”,
https://doi.org/10.48550/arXiv.1503.04144, Arxiv publishing, Cornell University, Vol. 3, Issue, Pp. 1-9, May 2015.
[5] S. Hershey, S. Chaudhuri, D. P. W. Ellis, J. F. Gemmeke, A. Jansen, R. C. Moore, M. Plakal, D. Platt, R. A. Saurous, B. Seybold, M. Slaney, R. J. Weiss, and K. Wilson “CNN Architectures for large-scale audio classification”, DOI: 10.1109/ICASSP.2017.7952132, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), June 2017.
[6] E. Sharma, G. Ye, W. Wei, R. Zhao, Y. Tian, J. Wu, L. He, E. Lin and Y. Gong “Adaptation of RNN transducer with text-to-speech technology for keyword spotting”, DOI: 10.1109/ICASSP40776.2020.9053191, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2020.
[7] I. Lee, and Y. J. Shin, “Machine learning for enterprises: Applications, algorithm selection, and challenges”, https://doi.org/10.1016/j.bushor.2019.10.005, Business Horizons, Elsevier Journal, Vol. 63, Issue 2, Pp. 1-14, April 2020.
[8] T. Yu, and H. Zhu, “Hyper-Parameter Optimization: A Review of Algorithms and Applications”,
https://doi.org/10.48550/arXiv.2003.05689, Arxiv publishing, Cornell University, Vol. 1, Issue, Pp. 1-56, March 2020.
[9] L. Yang, and A. Shami, “On hyperparameter optimization of machine learning algorithms: Theory and practice” https://doi.org/10.1016/j.neucom.2020.07.061, Neurocomputing, Elsevier Journal, Vol. 415, Issue, Pp. 1-22, November 2020.
[10] X. Du, H. Xu, and F. Zhu, “Understanding The Effect of Hyperparameter Optimization on Machine Learning Models for Structure Design Problems”, https://doi.org/10.1016/j.cad.2021.103013, Computer-Aided Design, Elsevier Journal, Vol. 135, Issue, Pp. 1-16, June 2021.
[11] Q. Wang, Y. Ma, K. Zhao, and Y. Tian, “A Comprehensive Survey of Loss Functions in Machine Learning”. https://doi.org/10.1007/s40745-020-00253-5” Annals of Data Science. Vol., Issue, Pp. 1-26, March 2020
[12] B. U. Bawankar and K. Chinnaiah, “Implementation of ensemble method on dna data using various cross validation techniques”, https://doi.org/10.17993/3ctecno.2022.v11n2e42.59-69, 3c Technology innovation glosses applied to SMEs, . Vol 11., Issue 2, Pp. 1-11, December 2022
[13] M. Rafało, “Cross validation methods: Analysis based on diagnostics of thyroid cancer metastasis”, https://doi.org/10.1016/j.icte.2021.05.001, ICT Express (ScienceDirect), Vol 8., Issue 2, Pp. 1-6, June 2022
[14] V. Singh, M. Pencina, A. J. Einstein, J. X. Liang, D. S. Berman & P. Slomka “Impact of train/test sample regimen on performance estimate stability of machine learning in cardiovascular imaging” https://doi.org/10.1038/s41598-021-93651-5, www.nature.com/scientificreports, Vol 11., Issue, Pp. 1-8, July 2021
[15] A. Rácz, D. Bajusz and K. Héberger, “Effect of Dataset Size and Train/Test Split Ratios in QSAR/QSPR Multiclass Classification”, https://doi.org/10.3390/ molecules26041111, Molecules [MDPI], Vol 26., Issue 4, Pp. 1-16, February 2021
[16] Anil Kumar Prajapati,Umesh Kumar Singh. Cardiovascular disease (CVD) prediction through Artificial Neural network in the perspective of Deep Learning, International Journal of Computing Algorithm, Vol. 11, Issue 2, 2022, pp. 1-7, DOI: 10.20894/IJDMTA.102.011.002.001, ISSN: 2278-2397
[17] L. Li, and M. Spratling “Understanding and combating robust overfitting via input loss landscape analysis and regularization” https://doi.org/10.1016/j.patcog.2022.109229, Pattern Recognition, Vol. 136, Issue, Pp. 1-11, April 2023
[18] G. K. Gupta, and D. K. Sharma, “A Review of Overfitting Solutions in Smart Depression Detection Models” 10.23919/INDIACom54597.2022.9763147, 9th International Conference on Computing for Sustainable Global Development (INDIACom), March 2022
[19] F. Heintz, M. Milano, and B. O’Sullivan “Trustworthy AI – Integrating Learning, Optimization and Reasoning”, https://doi.org/10.1007/978-3-030-73959-1, First International Workshop, TAILOR 2020 Virtual Event (Springer Cham), Pp. 31-42, September 2020.
[20] A. D. Gavrilov, A. Jordache, M. Vasdani, and J. Deng “Preventing Model Overfitting and Underfitting in Convolutional Neural Networks”, DOI: 10.4018/IJSSCI.2018100102, International Journal of Software Science and Computational Intelligence, Vol. 10, Issue 4, Pp. 1-10, December 2018
[21] T. Aotani, T. Kobayashi, and K. Sugimoto,” Meta-Optimization of Bias-Variance Trade-Off in Stochastic Model Learning” 10.1109/ACCESS.2021.3125000, IEEE Acess, Vol. 9, Issue 4, Pp. 1-10, November 2021.
[22] Y. Dar, V. Muthukumar, and R. G. Baraniuk “A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning”
https://doi.org/10.48550/arXiv.2109.02355, Arxiv publishing, Cornell University, Vol. 1, Issue, Pp. 1-48, September 2021.
[23] Y. Muhammad, M. Tahir, M. Hayat and K. Chong, “Early and accurate detection and diagnosis of heart disease using intelligent computational model”, https://doi.org/10.1038/s41598-020-76635-9, Published by Scientific Reports, 10, Issue- 4, PP. 1-18, November 2020.
[24] P. Ghosh, S. Azam, M Jonkman, S. Karim, F. M. J. M Shamrat, E. Ignatius, S. Sultana, A. R. Beeravolu, and A. F. De Boer, “Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques”, DOI: 10.1109/ACCESS.2021.3053759, IEEE ACCESS, Vol. 09, Issue-, PP. 1-23, February 2021.
[25] M. Ahmad, M. Alfayad, S. Aftab,M. A. Khan, A. Fatima, B. Shoaib, M. Sh. Daoud and N. S Elmitwally “Data and Machine Learning Fusion Architecture for Cardiovascular DiseasePrediction”, CMC-Computers, Materials & Continua, doi:10.32604/cmc.2021.019013, Vol. 69, Issue-02, PP. 1-15, April 2021.
Copyright (c) 2023 Journal of Systems Engineering and Information Technology (JOSEIT)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).