Bayesian Hierarchical Modeling and Clustering for Malignant Cancer Diagnosis

Authors

  • Sijia Zhu Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore 21218, USA. https://orcid.org/0009-0006-3694-6473 Author https://orcid.org/0009-0006-3694-6473
  • Jonathan Ma Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore 21218, USA. https://orcid.org/0009-0005-1185-3281 Author
  • Zhe Liu School of Computer Sciences, Universiti Sains Malaysia, Penag 11800, Malaysia. https://orcid.org/0000-0002-8580-9655 Author

DOI:

https://doi.org/10.59543/w22sya71

Keywords:

Late-stage cancer; Bayesian hierarchical logistic regression; No-U-Turn sampler; Patient heterogeneity; Cancer risk cluster

Abstract

Late-stage cancer diagnosis remains a major barrier to improving survival rates, yet the relative contributions of tumor, patient, and region-level factors have not been well quantified. In this study, we develop a 3-layer Bayesian hierarchical logistic regression model to investigate late-stage cancer diagnosis. The model includes fixed effects for tumor characteristics and random effects for patients and regions. Model parameters are estimated using the No-U-Turn Sampler, and posterior samples are evaluated with effective sample sizes and convergence diagnostics. From intra-class correlation estimates, we find that patient-level variation has a substantially stronger influence on late-stage diagnosis than region-level variation. Lastly, we utilize a Gaussian Mixture Model to cluster posterior patient-level random effects, identifying nine distinct clusters characterized by age, sex, and tumor features. Our findings suggest that individualized, patient-focused strategies may be more effective than geographically targeted approaches for promoting earlier cancer detection.

Downloads

Published

2026-02-07

How to Cite

Zhu, S., Ma, J., & Liu, Z. (2026). Bayesian Hierarchical Modeling and Clustering for Malignant Cancer Diagnosis. Argumentation Based Systems Journal, 2, 96-110. https://doi.org/10.59543/w22sya71

Issue

Section

Articles