Health Supply-Chain Demand Forecasting
12-month horizon on real PEPFAR shipment records — Rwanda focus.
End-to-end data-science and machine-learning projects on real public data — predictive modeling, statistical inference, forecasting, optimization, and experimentation across healthcare, energy, finance, retail, and beyond.
12-month horizon on real PEPFAR shipment records — Rwanda focus.
State-space + Fourier exog vs SARIMA on 145k hours of PJME load.
Poisson / Gamma / Tweedie GLMs on freMTPL2 with GBM challenger.
SARIMA + GBM with MinT-OLS to make item × store × week forecasts coherent.
MDP + Q-learning vs capped LP baseline on real KMPDC + SHA data.
Daily lake level on real Zambezi reservoir data; turbine discharge as exog.
Daily irradiance with weather covariates from NASA POWER API.
Tenure-as-time, churn-as-event; KM, Cox PH, Weibull AFT, log-rank stratification.
Daily volume forecast on top route + GBM price predictor across all routes.
Bedroom + property-type + neighborhood features on 9,607 Lagos sale listings.
Predict farm sales from lat/lon + farm + climate features across multiple African countries.
XGBoost vs RF vs LogReg with calibration plot and retention-queue ranking.
Welch t-tests + ANOVA + OLS adjustment + Bayesian posterior on a real 3-arm trial.
Data Kaggle CLI · NASA POWER · pandas · SQL · EDA matplotlib · seaborn · seasonal_decompose · Modeling statsmodels · scikit-learn · XGBoost · lifelines · Validation rolling-origin backtest · cross-validation · log-rank · ANOVA · calibration · Deployment FastAPI · Streamlit · pickled artifacts · scheduled retrain.
PhD in Mathematics (Topology) from the University of Cape Town. Career split between rigorous applied mathematics and hands-on data science, with 10+ years of experience across healthcare, finance, energy, insurance, retail, and government environments. Co-author of The Shape of Data (No Starch Press, 2024) — a graduate-level textbook on geometry-based machine learning. h-index 12 across 18+ peer-reviewed papers. Bilingual EN/FR.