Portfolio

Senior data science and AI/ML engineering work across healthcare AI, pricing, recommender systems, and experimentation.

A focused view of work that best represents my profile: production-minded machine learning, applied research, data products, and business-facing analytics with measurable impact.

Featured case studies

High-impact applied work

Healthcare AI / Decision Support

Causal Foundry: claims analytics and health financing models

Built healthcare analytics and decision-support systems using claims and provider data to support financing reform, provider performance, anomaly detection, and operational monitoring.

  • Worked with datasets covering more than 9M insured individuals and 4,500+ healthcare facilities.
  • Built large-scale pipelines processing 100M+ healthcare claims records using Python and SQL.
  • Developed monitoring systems with anomaly detection and automated indicators across 1,000+ providers.
  • Reconciled 5,000+ drug SKUs across seven agencies using text similarity and embedding-based matching.
Healthcare AI Claims analytics Provider performance Anomaly detection

Pricing / Commercial ML

ZF Group: pricing leakage detection and optimization

Developed anomaly detection, elasticity, uplift, and optimization models for commercial pricing decisions across a large industrial product portfolio.

  • Protected approximately EUR11M annually by identifying pricing inconsistencies and channel leakage.
  • Built pricing and uplift models across a EUR500M+ portfolio, contributing to a 13% gross-profit improvement.
  • Deployed PySpark and Databricks pipelines with engineering partners for commercial analytics workflows.
Pricing XGBoost PySpark Databricks

Digital Health / Experimentation

Mobile health engagement and demand forecasting

Built forecasting, survival, and experimentation workflows for pharmacy supply chains, healthcare workers, and mHealth applications.

  • Forecasted demand for pharmacy networks using DeepAR and time-series modeling, reducing stockouts by 18%.
  • Improved engagement outcomes with survival modeling, churn prediction, contextual bandits, and rule-based optimization.
  • Designed A/B testing, multi-armed bandit, and reinforcement-learning approaches for adaptive interventions.
Forecasting Survival analysis A/B testing Bandits

AI and recommender systems

Engineering projects

Recommendation systems diagram

Multimodal LLMs / Recommenders

Multi-Modal LLM-based Product Recommender System

Built a recommendation system that combines product reviews, metadata, images, temporal ordering, and multimodal feature fusion to predict the next items a user may purchase or review.

  • Built a multimodal recommender over approximately 3M Amazon interactions using review text and image embeddings.
  • Fine-tuned a GPT-2 style recommendation model with textual, visual, and temporal features.
  • Implemented train/validation/test processing that respects time ordering to reduce leakage.
  • Reported NDCG@5 of 0.22 and P@5 of 0.29, with evaluation harnesses, ablations, and documentation.
LLMs Computer vision Ranking metrics PyTorch
Decision optimization visual

Experimentation / Reinforcement learning

Deep Contextual Bandits

Adapted deep contextual bandit ideas for reusable experimentation workflows, focusing on Bayesian neural networks, Thompson sampling, and decision-making under uncertainty.

  • Grounded in the ICLR 2018 Deep Bayesian Bandits benchmark.
  • Built toward a package that can run contextual bandit algorithms on arbitrary datasets.
  • Connects directly to product experimentation and adaptive intervention design.
Contextual bandits Thompson sampling TensorFlow Experimentation
Retail analytics dashboard

Customer analytics / ML

Business ML: retention, attribution, and segmentation

Converted online retail transactions into a business analytics workflow covering revenue, retention, customer growth, attribution, journey analysis, and RFM-based segmentation.

Cohort analysis Attribution RFM Segmentation

Publications

Research and papers

ISIR-eCom 2025

Enhancing Product Recommendations with Multi-Modal LLMs

Research on multimodal product recommendation using text and image representations for next-item prediction.

2023

Power Samade distribution: properties and application to real lifetime data

Nigerian Journal of Science and Environment paper on distributional modeling and lifetime data analysis.

2018

Homework vs. In Class-Exercise: Means of Assessment, Waste of Time or Punishment?

International Journal of Scientific and Engineering Research.

Writing

Selected articles

2019

Identifying networks in customer reviews

Network analysis applied to customer review relationships and behavioral insight discovery.

Selected earlier work

Analytics breadth

Is the movie industry dying?

Explored film revenue, budget, genre, rating, audience, cast, and director effects to advise production strategy.

The determinants of happiness

Applied statistical modeling to study drivers of subjective wellbeing and socioeconomic outcomes.

Is comparison really the thief of joy?

Empirical analysis using South African data to study comparison, life satisfaction, and economic context.

Data visualization practice

Built a reference collection of Python visualization patterns inspired by The Economist, data-to-viz, and storytelling-with-data practices.