Skip to content
Universities of Wisconsin
Call Now608-262-2011 Call 608-262-2011 Request Info Request Info Search the UW Extended Campus website Search
Wisconsin Online Collaboratives
  • About Us
    • About Us
    • Accreditation
    • Our Campus Partners
  • Degrees & Programs
  • Admissions & Aid
    • How to Apply
    • Admission Pathways
    • Important Dates
    • Tuition & Financial Aid
    • Transferring Credits
    • Contact an Enrollment Adviser
  • Online Learning
    • About Online Learning
    • Online Learning Formats
    • Capstone Projects
    • Success Coaching
    • Technology Requirements
  • Stories & News
Home Home / Capstone Projects / Predicting Risk of Heart Disease for Early Detection: A Machine-Learning Approach

Predicting Risk of Heart Disease for Early Detection: A Machine-Learning Approach

Program: Data Science Master's Degree
Location: Not Specified (remote)
Student: Larry Vue

Cardiovascular disease (heart disease) is still the leading cause of death worldwide and significantly impacts healthcare expenses. With a better understanding of data and newer developments in data science, the need for early detection can save lives and reduce healthcare costs [1][2]. This project developed and evaluated supervised learning models to predict heart disease using clinical variables. The data primarily come from the UCI Cleveland Heart Disease dataset and a larger Kaggle cardiovascular dataset. Data cleaning, exploratory analysis, and feature engineering were used to assess the predictive value of key risk factors. The data helped compare baseline and advanced machine learning classifiers, including logistic regression, decision trees, random forests, and gradient boosting (XGBoost), using stratified cross-validation. The focus, aligned with business metrics, is to prioritize recall to minimize false negatives. Class imbalance was addressed using class weights and thresholds, along with ROC/PR analysis and cost-sensitive decision-making. Results show that the interpretations of tree-based models align with clinically relevant relationships. The final model will be a calibrated logistic regression. This achieved strong ranking performance on the internal test set and an interpretable coefficient profile for clinicians. Error analysis revealed that false negatives are often in hard-to-see cases; lowering the threshold slightly reduced misses while maintaining acceptable precision. Overall, the project demonstrates a simple, interpretable model that can provide actionable risk. The model will show the importance of threshold choices for clinical workflows.  

Let's Get Started Together

Apply Apply Schedule an Advising Call Schedule an Advising Call Request Info Request Info

This field is for validation purposes and should be left unchanged.
Are you interested in pursuing the degree or taking one or two courses?(Required)
Can we text you?(Required)

By selecting yes, I agree to receive updates about online degrees, events, and application deadlines from the Universities of Wisconsin.

Msg frequency varies depending on the activity of your record. Message and data rates may apply. Text HELP for help. You can opt out by responding STOP at any time. View our Terms and Conditions and Privacy Policy for more details.

Wisconsin Online Collaboratives will not share your personal information. Privacy Policy

Wisconsin Online Collaboratives

A Collaboration of the
Universities of Wisconsin

University of Wisconsin System

Pages

  • Our Degrees & Programs
  • How to Apply
  • Online Learning Formats
  • Our Campus Partners

Enrollment Advising

608-800-6762
learn@uwex.wisconsin.edu

Contact

780 Regent Street
Suite 130
Madison, WI 53715

Technical Support

1-877-724-7883
https://uwex.wisconsin.edu/technical-support/

Connect

  • . $name .facebook
  • . $name .linkedin
  • . $name .instagram
  • . $name .youtube

Copyright © 2026 Board of Regents of the University of Wisconsin System. | Privacy Policy