Skip to content
Universities of Wisconsin
Call Now608-262-2011 Call 608-262-2011 Request Info Request Info Search the UW Extended Campus website Search
Wisconsin Online Collaboratives
  • About Us
    • About Us
    • Accreditation
    • Our Campus Partners
  • Degrees & Programs
  • Admissions & Aid
    • How to Apply
    • Admission Pathways
    • Important Dates
    • Tuition & Financial Aid
    • Transferring Credits
    • Contact an Enrollment Adviser
  • Online Learning
    • About Online Learning
    • Online Learning Formats
    • Capstone Projects
    • Success Coaching
    • Technology Requirements
  • Stories & News
Home Home / Capstone Projects / Using Machine Learning to Predict Home Runs for a National TV Audience

Using Machine Learning to Predict Home Runs for a National TV Audience

Program: Data Science Master's Degree
Location: Not Specified (remote)
Student: Jack Puncochar

The introduction of the Statcast tracking system increased the prevalence of analytics in baseball. Research emerged claiming that Statcast metrics, exit velocity and launch angle, are strong predictors of home run hitting after contact. However, there is no research exploring the pre-contact probability of hitting a home run. Understanding the probability that a hitter goes yard before the pitch can provide national audiences with valuable content. A home run probability model exists at one media company, but the model suffers from overfitting. This study dealt with extreme class-imbalance (< 5% of batted balls are home runs). Both logistic regression and Naïve Bayes performed poorly using precision and recall for model assessment. The poor performance was attributed to uncertainty in data that is only known pre-pitch and it was found that home run classification may not be meaningful when the desired output is a probability. Instead, log loss was used for model selection and logistic regression was selected to estimate predicted probabilities on new data. An R Shiny application made it possible to display results of the HR probability model on live pitches. The application needed bug fixes and optimized code prior to being sent to the client. Another limitation with the R Shiny app was a lack of automation in updating with Statcast’s real-time updates. The HR probability system was not sent to the client; however, the framework to efficiently process Statcast data and deploy an accurate HR probability model will help them emerge as leaders in the industry. 

Let's Get Started Together

Apply Apply Schedule an Advising Call Schedule an Advising Call Request Info Request Info

This field is for validation purposes and should be left unchanged.
Are you interested in pursuing the degree or taking one or two courses?(Required)
Can we text you?(Required)

By selecting yes, I agree to receive updates about online degrees, events, and application deadlines from the Universities of Wisconsin.

Msg frequency varies depending on the activity of your record. Message and data rates may apply. Text HELP for help. You can opt out by responding STOP at any time. View our Terms and Conditions and Privacy Policy for more details.

Wisconsin Online Collaboratives will not share your personal information. Privacy Policy

Wisconsin Online Collaboratives

A Collaboration of the
Universities of Wisconsin

University of Wisconsin System

Pages

  • Our Degrees & Programs
  • How to Apply
  • Online Learning Formats
  • Our Campus Partners

Enrollment Advising

608-800-6762
learn@uwex.wisconsin.edu

Contact

780 Regent Street
Suite 130
Madison, WI 53715

Technical Support

1-877-724-7883
https://uwex.wisconsin.edu/technical-support/

Connect

  • . $name .facebook
  • . $name .linkedin
  • . $name .instagram
  • . $name .youtube

Copyright © 2026 Board of Regents of the University of Wisconsin System. | Privacy Policy