More Projects

UFC Fight Predictions

Multiple ML models trained on 5,000+ UFC fights (1993–2019) to predict outcomes, then used to forecast the 5 main fights in UFC 259 with SKLearn, XGBoost, and Keras.

Anomaly Detection with ML

Applies k-means, isolation forests, SVM, and LSTMs to NAB benchmark datasets, then tests supervised fraud detection techniques on a credit card dataset.

CNN & Transfer Learning

Two CNNs for mask detection — one trained from scratch, one leveraging InceptionResNetV2 transfer learning — demonstrating pre-trained model efficiency.

Regression Inference — Life Expectancy

OLS regression with p-value-driven feature selection on world health data to identify the strongest statistical predictors of life expectancy.

Visualizing Wages in the US

Interactive Plotly visuals from Federal Reserve data exploring the effects of the dot-com boom, 2008 crisis, and Covid-19 on US wages and employment.

San Francisco Housing Analysis

Plotly visuals on Bay Area housing trends using Federal Reserve indices, with a Facebook Prophet forecast of future housing costs.

Covid-19 Choropleth Maps

Plotly choropleth maps visualizing Covid-19 cases and deaths across US states and global countries over time since the start of the pandemic.

Multiple Models — Titanic Dataset

Eight ML algorithms (Naive Bayes through XGBoost and Neural Networks) benchmarked on the Titanic survival dataset using k-fold cross-validation.

Covid-19 Bar Chart Race

Animated bar chart race of US state-level Covid-19 case counts from January to September 2020, built with Pandas and Matplotlib.

Traveling Salesman

Random search algorithm solving the Traveling Salesman Problem with a Tkinter map visualization showing the optimal city visit order.

UK Used Car Dashboard

Plotly/Dash dashboard on 44,000+ UK used cars scraped from Autotrader with BeautifulSoup, deployed on Heroku for interactive price exploration.

Work Projects

Customer Lifetime Value Models
Loyalty Churn Model
Comments Ingestion NLP Pipeline
Omnichannel Propensity Model
Large Language Model Prototype
Snowflake Cortex LLM Dashboards
EComm Customer Segmentation
Retail Store Clustering Analysis
Brand Health Survey Analysis
Email NLP Analysis
Attribution Model
Customer Acquisition Model
MLDevOps Dashboard
Customer Survival Analysis
Cox Proportional Hazard Model
Snowpark CLTV POC
Customer Service Dashboard
Early Access Product Selector

Contact Me

I am eager to help on any Data Science or Data Visualization projects I can. Please feel free to contact me via email or by phone. I am currently living in Framingham, Massachusetts but am willing to help no matter where you stay. Please search for me on Linkedin, GitHub, or even Kaggle and upvote my notebooks if you find them helpful or useful. Thank you!

Phone

+1 (617) 564-6001

Address

Framingham
Massachusetts, USA

LinkedIn

daniel-b-simpson

Instagram

danielbsimpson