Serialization for Cybersecurity Logs: An IP Focused Approach

Program: Data Science Master's Degree
Host Company: Forescout
Location: Dallas, Texas (remote)
Student: Austin Phillips

This project was an attempt to take something cybersecurity companies have in abundance, device and application logs, and turn them into a format that could be used in machine learning
applications. As of now, methods do not exist to efficiently and accurately encode things like IP addresses and host names for machine learning usage. While not handling hostnames, a new method of encoding IPs was introduced, tested, and compared using a random forest classifier and a real-world dataset.