Abstract
Hepatitis C is a prevalent disease, with an estimated 3-4 million new cases annually. This poses a significant public health concern, necessitating effective identification and treatment strategies. This study compares the performance of multi and binary-class labels using the same dataset, considering various evaluation metrics and tool comparisons. Furthermore, the study seeks to utilize Machine learning (ML) algorithms to identify key features in predicting the Hepatitis C Virus (HCV) using an Egyptian patient dataset. The results reveal that the Support Vector Machine (SVM) achieved the highest accuracy rate of 28.45% for the multi-class label. At the same time, the random forest attained an accuracy of 75.05% for the binary class label. Notably, the binary class performance outperformed the multi-class label. Additionally, multi-feature selection methods improved convergence speed and yielded better accuracies and precisions. Normalization and data scaling techniques also played a vital role in improving the results. Moreover, the study employed Bayesian Networks (BNs) and the Shapley Additive Explanations (SHAP) method to gain insights into the predictions made by the ML model. These techniques provided valuable explanations for the model's decisions, enhancing interpretability and aiding in understanding the factors driving the predictions. Overall, this study contributes to HCV prediction by comparing performance between different label types and exploring feature selection methods. The findings underscore the importance of accurate prediction and highlight the potential of advanced techniques such as BNs and SHAP for improved interpretability in ML models.
| Original language | British English |
|---|---|
| Title of host publication | 50th International Conference on Computers and Industrial Engineering, CIE 2023 |
| Subtitle of host publication | Sustainable Digital Transformation |
| Editors | Yasser Dessouky, Abdulrahim Shamayleh |
| Pages | 1348-1358 |
| Number of pages | 11 |
| ISBN (Electronic) | 9781713886952 |
| State | Published - 2023 |
| Event | 50th International Conference on Computers and Industrial Engineering: Sustainable Digital Transformation, CIE 2023 - Sharjah, United Arab Emirates Duration: 30 Oct 2023 → 2 Nov 2023 |
Publication series
| Name | Proceedings of International Conference on Computers and Industrial Engineering, CIE |
|---|---|
| Volume | 3 |
| ISSN (Electronic) | 2164-8689 |
Conference
| Conference | 50th International Conference on Computers and Industrial Engineering: Sustainable Digital Transformation, CIE 2023 |
|---|---|
| Country/Territory | United Arab Emirates |
| City | Sharjah |
| Period | 30/10/23 → 2/11/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Bayesian Network
- Hepatitis C
- Machine Learning
- Machine Learning Prediction
Fingerprint
Dive into the research topics of 'Comparative Analysis of Machine Learning Algorithms for Hepatitis C Virus (HCV) Prediction'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver