Comparative Analysis of Machine Learning Algorithms for Hepatitis C Virus (HCV) Prediction

Khaled M. Toffaha, Mecit Can Emre Simsekler, Mohammed Atif Omar

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Hepatitis C is a prevalent disease, with an estimated 3-4 million new cases annually. This poses a significant public health concern, necessitating effective identification and treatment strategies. This study compares the performance of multi and binary-class labels using the same dataset, considering various evaluation metrics and tool comparisons. Furthermore, the study seeks to utilize Machine learning (ML) algorithms to identify key features in predicting the Hepatitis C Virus (HCV) using an Egyptian patient dataset. The results reveal that the Support Vector Machine (SVM) achieved the highest accuracy rate of 28.45% for the multi-class label. At the same time, the random forest attained an accuracy of 75.05% for the binary class label. Notably, the binary class performance outperformed the multi-class label. Additionally, multi-feature selection methods improved convergence speed and yielded better accuracies and precisions. Normalization and data scaling techniques also played a vital role in improving the results. Moreover, the study employed Bayesian Networks (BNs) and the Shapley Additive Explanations (SHAP) method to gain insights into the predictions made by the ML model. These techniques provided valuable explanations for the model's decisions, enhancing interpretability and aiding in understanding the factors driving the predictions. Overall, this study contributes to HCV prediction by comparing performance between different label types and exploring feature selection methods. The findings underscore the importance of accurate prediction and highlight the potential of advanced techniques such as BNs and SHAP for improved interpretability in ML models.

    Original languageBritish English
    Title of host publication50th International Conference on Computers and Industrial Engineering, CIE 2023
    Subtitle of host publicationSustainable Digital Transformation
    EditorsYasser Dessouky, Abdulrahim Shamayleh
    Pages1348-1358
    Number of pages11
    ISBN (Electronic)9781713886952
    StatePublished - 2023
    Event50th International Conference on Computers and Industrial Engineering: Sustainable Digital Transformation, CIE 2023 - Sharjah, United Arab Emirates
    Duration: 30 Oct 20232 Nov 2023

    Publication series

    NameProceedings of International Conference on Computers and Industrial Engineering, CIE
    Volume3
    ISSN (Electronic)2164-8689

    Conference

    Conference50th International Conference on Computers and Industrial Engineering: Sustainable Digital Transformation, CIE 2023
    Country/TerritoryUnited Arab Emirates
    CitySharjah
    Period30/10/232/11/23

    Keywords

    • Bayesian Network
    • Hepatitis C
    • Machine Learning
    • Machine Learning Prediction

    Fingerprint

    Dive into the research topics of 'Comparative Analysis of Machine Learning Algorithms for Hepatitis C Virus (HCV) Prediction'. Together they form a unique fingerprint.

    Cite this