Heart Failure Prediction using Machine Learning

IEEE BASE PAPER TITLE:

Predicting the Classification of Heart Failure Patients Using Optimized Machine Learning Algorithms

IEEE BASE PAPER ABSTRACT:

Heart failure is a critical condition with a high mortality rate, making accurate survival prediction essential for timely interventions. This study proposes an optimized machine learning approach using Gradient Boosting Machine (GBM) and Adaptive Inertia Weight Particle Swarm Optimization (AIW-PSO) to predict heart failure survival. The dataset, sourced from Kaggle, includes clinical features such as age, ejection fraction, and serum creatinine levels for 299 heart failure patients. To address the imbalance in survival outcomes, Synthetic Minority Over-sampling Technique (SMOTE) was employed to balance the dataset, followed by SelectKBest and Chi-square feature selection methods to retain the most significant predictors. The optimized hyperparameters for the GBM model were identified using the AIW-PSO algorithm, which effectively balanced exploration and exploitation by adaptively adjusting inertia weights. Model selection was further refined using information criteria, including Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), ensuring that the best-performing model was chosen based on both predictive accuracy and model complexity. The optimized GBM model achieved a test accuracy of 94%, demonstrating superior performance compared to traditional machine learning models. The study underscores the importance of hyperparameter tuning through metaheuristic algorithms and highlights the potential of AIW-PSO in enhancing model performance for clinical prediction tasks. These findings have significant implications for clinical decision-making, offering a reliable and interpretable tool for predicting patient outcomes in heart failure management.

PROJECT OUTPUT VIDEO:

ALGORITHM / MODEL USED:

Stacking Classifier, XGB Classifier.

OUR PROPOSED PROJECT ABSTRACT:

Heart failure is a critical cardiovascular condition that requires early detection and intervention to improve patient outcomes. Heart disease remains one of the leading causes of mortality worldwide, and early prediction of heart failure can significantly enhance the effectiveness of clinical interventions. This project, titled “Heart Failure Prediction using Machine Learning” aims to develop an intelligent system capable of predicting the likelihood of heart failure in patients based on clinical parameters. The system is developed using Python as the primary coding language, with HTML, CSS, and JavaScript employed for the front-end interface, and Flask serving as the web framework to integrate machine learning models with a user-friendly web application.

The predictive model was trained on a dataset comprising 5,000 patient records with 12 distinct clinical features, including age, anemia, creatinine phosphokinase levels, diabetes status, ejection fraction, high blood pressure, platelet count, serum creatinine, serum sodium, sex, smoking habits, and follow-up time, with death event as the target variable. These features capture essential cardiovascular and metabolic indicators that significantly influence heart failure prognosis.

Two advanced machine learning algorithms were implemented and evaluated: the Stacking Classifier and XGBoost (XGB) Classifier. Experimental results indicate that both models performed with exceptional accuracy. The Stacking Classifier achieved a training accuracy of 99% and a testing accuracy of 99%, demonstrating strong generalization capability. Similarly, the XGBoost Classifier attained a training accuracy of 99% and a testing accuracy of 99%, confirming the robustness of ensemble-based learning techniques for medical prediction tasks.

The proposed system enables healthcare professionals to upload patient datasets and instantly receive predictions on potential heart failure outcomes, providing valuable clinical insights. Overall, this project demonstrates how machine learning and web technologies can be effectively integrated to create an accurate, accessible, and efficient predictive tool for early heart failure diagnosis, thereby contributing to improved patient care and preventive healthcare analytics.

SYSTEM REQUIREMENTS:

HARDWARE REQUIREMENTS:

System : Pentium i3 Processor.
Hard Disk : 20 GB.
Monitor : 15’’ LED.
Input Devices : Keyboard, Mouse.
Ram : 8 GB.

SOFTWARE REQUIREMENTS:

Operating System : Windows 10 / 11.
Coding Language : Python 3.12.0.
Web Framework : Flask.
Frontend : HTML, CSS, JavaScript.

REFERENCE:

MARZIA AHMED, MOHD HERWAN SULAIMAN, MD MARUF HASSAN, AND TOUHID BHUIYAN, “Predicting the Classification of Heart Failure Patients Using Optimized Machine Learning Algorithms” IEEE ACCESS, VOLUME 13, 2025.

👉CLICK HERE TO BUY THIS PROJECT “Heart Failure Prediction using Machine Learning” SOURCE CODE👈

1. What is the main objective of this project?

The main objective of this project is to develop a machine learning-based system capable of predicting the likelihood of heart failure in patients using clinical data. By analyzing parameters such as age, ejection fraction, serum creatinine, blood pressure, and other physiological indicators, the system aims to assist healthcare professionals in early identification of patients at risk, enabling timely medical intervention.

2. Which technologies are used to develop this project?

The project is developed using Python as the main programming language for model training and prediction. The Flask framework is used for backend development to connect the machine learning models with the front end. The front-end interface is created using HTML, CSS, and JavaScript for dataset upload, visualization, and result display.

3. What type of dataset is used in this project?

The dataset used in this project is a clinical dataset containing 5,000 patient records. Each record includes medical and demographic attributes such as age, anaemia, creatinine phosphokinase, diabetes, ejection fraction, high blood pressure, platelets, serum creatinine, serum sodium, sex, smoking, time, and DEATH_EVENT. The dataset helps the system learn patterns associated with heart failure outcomes.

4. What machine learning algorithms are implemented?

The project uses two powerful machine learning models: Stacking Classifier and XGBoost (Extreme Gradient Boosting) Classifier. The Stacking Classifier combines the predictions of multiple base learners to improve classification accuracy through ensemble learning. The XGBoost Classifier is a gradient boosting algorithm optimized for speed and performance, which constructs an ensemble of decision trees to minimize classification errors.

5. What are the accuracy results achieved by the models?

Both the Stacking Classifier and the XGBoost Classifier achieved a training accuracy of 99% and a testing accuracy of 99%, indicating excellent predictive performance and strong generalization capability. These results confirm the reliability of the system for heart failure prediction.

6. How does the system work?

The system follows a systematic workflow. First, the user uploads a dataset in .csv format through the web interface. The Flask backend then validates the file and performs preprocessing tasks such as handling missing values, encoding categorical features, and normalizing data. The cleaned data is passed into the trained machine learning models, which predict the likelihood of heart failure for each patient record. The prediction results and performance metrics are then displayed on the front end, accompanied by visual graphs for better interpretation.

7. What are the key features of the system?

The system supports dataset upload, real-time prediction, performance visualization, and model comparison. It includes automated data preprocessing, integrates high-accuracy machine learning models, and provides interactive charts for analytical understanding. It is also web-deployed, allowing users to interact with it through any modern browser without requiring specialized software.

8. What is the role of Flask in this project?

Flask acts as the middleware connecting the backend machine learning models with the front-end user interface. It handles routing, request processing, and communication between the uploaded dataset and the trained models. Flask ensures that user inputs (such as uploaded data) are processed correctly and that model predictions are sent back to the web interface for visualization.

9. Why was XGBoost chosen as one of the models?

XGBoost was selected due to its high efficiency, scalability, and strong predictive performance. It uses gradient boosting with regularization, which helps reduce overfitting and improves generalization. XGBoost is particularly effective in handling structured datasets like clinical data and is widely used in healthcare prediction tasks for its speed and accuracy.

10. What is the significance of using the Stacking Classifier?

The Stacking Classifier combines multiple base models to improve the overall predictive capability. Instead of relying on a single algorithm, it uses a meta-model to integrate the strengths of various classifiers. This layered ensemble approach leads to higher stability and accuracy, making it ideal for complex medical datasets where relationships among variables can be non-linear and multifactorial.

11. How are data visualizations used in the system?

Data visualizations are used to make the analytical results more understandable and informative. The system displays multiple charts such as age distribution, gender ratio, comorbidity analysis, model accuracy comparison, and death event distribution. These charts help users visually interpret trends, correlations, and prediction outcomes, aiding in better medical insights and data-driven decision-making.

12. How does the system ensure data preprocessing and quality?

Before training or prediction, the system performs several preprocessing steps including handling missing values, encoding categorical variables, scaling numerical features, and splitting data into training and testing sets. This process ensures that the dataset is clean, consistent, and suitable for accurate machine learning predictions. Automated preprocessing also reduces the chances of human error and enhances model performance.

13. What metrics are used to evaluate model performance?

The models are evaluated using various metrics such as Accuracy, Precision, Recall, F1-Score, and Confusion Matrix. These metrics collectively provide a comprehensive understanding of the system’s predictive power and its ability to balance false positives and false negatives, which is crucial for healthcare predictions.

14. Is this system useful for real-time prediction?

Yes, the system supports real-time prediction. Once a data is uploaded through the Flask web interface, predictions are processed immediately, and results are displayed on the web page within seconds. This feature enables healthcare professionals to analyze patient data quickly and obtain risk predictions in real time.

15. What makes this system different from Existing System?

The existing traditional diagnostic methods rely heavily on manual analysis and physician experience, which can be time-consuming and subjective. In contrast, this system leverages machine learning algorithms to analyze large datasets, identify hidden patterns, and make objective predictions based on data. It also provides visual insights and web-based accessibility, making it more efficient, accurate, and user-friendly.

16. How does this project contribute to healthcare improvement?

The project contributes to healthcare by providing a data-driven decision-support tool that helps in early identification of heart failure risks. By automating predictions and simplifying data interpretation, it assists medical practitioners in making faster and more informed decisions, ultimately improving patient care and reducing the chances of late diagnosis.

Python IEEE Projects

Heart Failure Prediction using Machine Learning