Skip to content
Universities of Wisconsin
Call Now608-262-2011 Call 608-262-2011 Request Info Request Info Search the UW Extended Campus website Search
Wisconsin Online Collaboratives
  • About Us
    • About Us
    • Accreditation
    • Our Campus Partners
  • Degrees & Programs
  • Admissions & Aid
    • How to Apply
    • Admission Pathways
    • Important Dates
    • Tuition & Financial Aid
    • Transferring Credits
    • Contact an Enrollment Adviser
  • Online Learning
    • About Online Learning
    • Online Learning Formats
    • Capstone Projects
    • Success Coaching
    • Technology Requirements
  • Stories & News
Home Home / Capstone Projects / Validatable Data Pipeline and Reporting for Regulated Industry Custom Development Projects

Validatable Data Pipeline and Reporting for Regulated Industry Custom Development Projects

Program: Data Science Master's Degree
Location: Not Specified (remote)
Student: Patrick Cassidy

This project created a controlled software environment for potential use in FDA-regulated research, development, and manufacturing. Biopython was used within the environment to analyze protein sequence data from NCBI to identify potential target sequences for recombinant manufacturing in E. Coli.  

The environment was built using Docker, which included Git, AWSCLI, and Micromamba to manage Python dependencies for Biopython, boto3, Jupyter, and pyMSAviz. Compliance features were included with Git/GitHub and AWS. Namely, user access control, version history, and controlled storage.  

The container was deployed on a local machine for protein sequence analysis of insulin. Data was sourced from NCBI using their BLAST tool to extract FASTA files, then cleaned and filtered for alignment in the software MEGA. The aligned sequences were annotated, analyzed, and compared for desired characteristics to find the 10 most promising targets for future development. To verify reproducibility of results, the analysis was replicated on a second machine.  

The demonstration of compliance features and reproducibility of results shows this project could be a foundation for a validatable data pipeline in FDA-regulated biopharmaceutical production. 

Let's Get Started Together

Apply Apply Schedule an Advising Call Schedule an Advising Call Request Info Request Info

This field is for validation purposes and should be left unchanged.
Are you interested in pursuing the degree or taking one or two courses?(Required)
Can we text you?(Required)

By selecting yes, I agree to receive updates about online degrees, events, and application deadlines from the Universities of Wisconsin.

Msg frequency varies depending on the activity of your record. Message and data rates may apply. Text HELP for help. You can opt out by responding STOP at any time. View our Terms and Conditions and Privacy Policy for more details.

Wisconsin Online Collaboratives will not share your personal information. Privacy Policy

Wisconsin Online Collaboratives

A Collaboration of the
Universities of Wisconsin

University of Wisconsin System

Pages

  • Our Degrees & Programs
  • How to Apply
  • Online Learning Formats
  • Our Campus Partners

Enrollment Advising

608-800-6762
learn@uwex.wisconsin.edu

Contact

780 Regent Street
Suite 130
Madison, WI 53715

Technical Support

1-877-724-7883
https://uwex.wisconsin.edu/technical-support/

Connect

  • . $name .facebook
  • . $name .linkedin
  • . $name .instagram
  • . $name .youtube

Copyright © 2026 Board of Regents of the University of Wisconsin System. | Privacy Policy