Michael G. Sullivan

Data Analytics Projects and Related Professional Content

Hi, I’m Michael. Thanks for stopping by. This site is dedicated to my work in data analytics.

Here are my most recent blog posts:

Predicting Adult Income: An End-to-End Machine Learning Pipeline Project

This project uses the Adult Income dataset to explore how demographic and work-related factors influence earnings. Through feature engineering, a custom transformer creates a new “workload level” variable that captures the relationship between education and weekly work hours. Using a scikit-learn pipeline, multiple models are tested, with Logistic Regression emerging as the most efficient and…

Predicting Presence of Liver Cancer Using Logistic Regression

This project details the process of creating a logistic regression model on a set of synthetic clinical data for the purpose of predicting if a patient has or will have liver cancer. This project includes a detailed EDA section and many visualizations and detailed process explanations and result interpretations.