Experience &
education.

Certified AI Ethicist and Data Scientist with 4 years' experience in the NHS ecosystem. Specialised in large-scale analytics, machine learning engineering and healthcare workforce modelling — building production-ready pipelines and LLM-powered applications in Python, PySpark and Databricks.

Python PySpark Databricks R SQL LLMs Tableau Git Agile / Jira

Experience

  • Apr 2024 — Present

    Data Scientist

    NHS England · London, UK

    • Lead Developer on the community pharmacy modelling for the 10-Year Workforce Plan, implementing scenarios and assumptions in the core PySpark model with NHSE and DHSC.
    • Led the transition from Excel models to automated Databricks pipelines in Python.
    • Refactoring Python codebases to improve computational performance and analytical capability across cross-functional teams.
    • Delivering Databricks, Python and Git training to analysts, promoting coding best practice across data analysis teams.
  • Jun 2022 — Mar 2024

    Junior Data Scientist

    Health Education England · London, UK

    • Maintained a pythonised geo-spatial analysis and produced Tableau dashboards advising medical Deans on redistributing specialty training posts to address health inequalities.
    • Supported R modelling to predict complex birth deliveries for maternity service planning, using source data from RCOG statisticians.
    • Collaborated with Data Engineering and Product teams on Jira / Agile to manage project timelines.
  • Apr 2022 — Jun 2022

    Data Manager

    UCL Great Ormond Street Institute of Child Health · London, UK

    • Worked with bioinformaticians and statisticians on the GSK Sotrovimab Covid-19 study, cleaning and transforming lab data.
    • Designed a MySQL server for secure data transfers from study entry points.

Selected Projects

  • Strategic Planning

    10-Year Workforce Plan modelling

    NHS England · DHSC

    • Refreshing the NHS Long-Term Workforce Plan modelling (published Jun 2022) with a multidisciplinary group, incorporating National Audit Office feedback.
  • AI & ML

    LLM & machine learning prototypes

    NHS · Hackathons & conferences

    • Pharmacy First agent — LLM-powered analysis of NHS pharmacy operations, open-sourced and presented at PyConUK, NHS RPySOC and HACA 2025.
    • Reducing missed NHS appointments — ML solution for a No. 10 Downing Street data science hackathon, targeting waitlist cost.
    • NHS Career Coach — POC chatbot on Microsoft AI Studio; 2nd place hackathon, showing how LLMs can support candidates.
    • OPEL prediction — ML models on the NHS Federated Data Platform during a Palantir & KPMG hackathon.
  • Research

    Textbook contribution

    Foundations of Programming, Statistics, and ML for Business Analytics

    • Developed R and Python conversion code for the English edition (Apr 2023); co-authored the Korean translated edition (Feb 2025).

Education

  • 2020 — 2021

    MSc Data Science — Distinction

    City St George's, University of London

    • Awarded 1st rank for the System for the Semantic Answer Type Prediction Task; presented at the International Semantic Web Conference (ISWC), October 2021.
  • 2018 — 2020

    PgCert Data Analytics

    University of Sheffield

    • Modules: Computer Security and Forensics, Machine Learning and Adaptive Intelligence, Statistical Data Science in R.
  • 2007 — 2011

    BSc Electronic Engineering

    Kyungpook National University · Daegu, South Korea

    • Modules: Electronic Engineering Lab (MATLAB), Calculus, Numerical Analysis.

Community & Volunteering

  • 2025 —

    LangChain Ambassador

    London

    • Hosting technical meetups and organising a hackathon in central London.
  • Ongoing

    UK–Korea Global Health Forum working group

    London School of Hygiene & Tropical Medicine

    • Organising a conference with senior researchers at LSHTM.
  • Ongoing

    Professional Mentor

    University of Greenwich · City St George's

    • Supporting early career interview preparation.