SCANS (Michigan)

Self-Consistency Analysis of Surveillance

Basic Model Attributes
Cancer site	Prostate
Host institution	University Of Michigan
Purpose	This is a model of prostate cancer initiation, incidence, disease progression and presentation at diagnosis, survival and mortality. The model provides a quantitative functional link between the history of prostate cancer brom birth to death and screening and treatment as control interventions. It is used to understand, predict and optimize the impact of dynamic sceening and treatment. Its purpose is to unravel the myriad causes and relationships that underlie recent population trends and the results of screening and clinical trials, and provide relevant statistical inference and predictions using confidence intervals and hypotheses tests.
Contact	Alex Tsodikov (tsodikov@umich.edu)
Profile	cisnet_prostate_umich_profile.pdf

The Self-Consistency Analysis of Surveillance (SCANS) model of prostate cancer natural history is a statistical model built by analytic mathematics rather than computer simulation algorithms. It consists of a set of integrated submodels that fold into a joint model of cancer incidence, disease presentation (stage and grade) at diagnosis, disease progression in screen-detected persons incorporating early detection and treatment benefit, and survival and cancer mortality. The system of submodels is designed so they can be fit sequentially to cancer incidence, survival, and mortality registry data from the US (Surveillance, Epidemiology, and End Results [SEER] program) and Europe (European Union registry [EUREG]), and data from the screening trials (Prostate, Lung, Colorectal, and Ovarian [PLCO] cancer screening trial in the US and European Randomized Study of Screening for Prostate Cancer [ERSPC] in Europe). The model consists of the following components.

To model incidence, we use a two-stage model for the screening schedule (a point process ). The first stage is defined by the hazard of the first prostate-specific antigen (PSA) test for a man at a given age and calendar time. Another intensity of testing is defined for men who already had their first PSA test. Both intensities of PSA testing are specified using data from a retrospective analysis of PSA testing¹. Cancer diagnosis is defined as a result of two competing risks, clinical diagnosis (CDx) and diagnosis due to screening (SDx), whichever comes first. The risks are dependent based on a common natural history of the disease, with either risk zero until the onset of a detectable tumor. Estimation is based on parametric maximum likelihood. Contributions of population data to the likelihood represent an average over the unobserved screening schedule and natural history processes.² Once the stochastic process mixed model is fit, predictions for lead time, overdiagnosis, age of tumor onset, and other characteristics in the patient and the population are predicted using Bayesian conditional probabilities.

To model disease presentation (stage, grade) at diagnosis, we represent disease stage and grade as a categorical mark () to the incident cancer. We use the mixed multinomial model to specify the distribution of stage and grade at diagnosis, where the mixing variables represent the key unobserved features of the disease natural history prior to diagnosis (e.g., age at onset), predicted as conditional distributions, given information observed on the patient (e.g., age and year of diagnosis). Stage and grade are modeled using a mixed multinomial logit model. The model is estimated by maximum likelihood. We developed a special method of artificial mixtures and the quasi-EM algorithm to deal with the curse of dimensionality in complex models³. Applications to the multinomial model and the stage- and grade-specific incidence model are given in a series of papers^4-6. Conditional on age, year, stage, grade, and other patient characteristics at diagnosis (e.g., race), we generate model-based predictions of the unobserved characteristics of the disease latent natural history prior to onset and counterfactually after diagnosis (e.g., point of CDx and lead time in the absence of treatment).

To model disease progression after screen diagnosis, let Z(ξ) be the cancer progression process with the time (i.e., age of tumor) ξ measured from the point of cancer onset in the subject. Given the two potential competing risks of clinical (CDx) and screening (SDx) diagnosis, we can define the corresponding potential values of the cancer development process Z(ξ_SDx) and Z(ξ_CDx) measured on the same subject. The competing character of the two detection mechanisms makes them partially unobserved. Since ξ_SDx is undefined for an unscreened subject, we would not be able to treat ξ_SDx as missing data in a likelihood-based approach. Let the indicator I_SDx be 1 for screening and 0 for clinical diagnosis. Let the vector V=(a,z) be the disease presentation at diagnosis combining age and stage/grade z at the point of diagnosis. The disease progression model defines the probability of disease progression during the lead time in the absence of treatment represented by the transition model [V₀ | V₁]. For a screen-detected subject, let f_V (V₀ |V₁,x) be the joint pdf of the disease presentation at counterfactual CDx (with characteristics V₀), conditional on the observed presentation V₁ at SDx and the birth cohort x. The transition probabilities between the two points of diagnosis (SDx, CDx) are modeled as functions of the lead time ξ_L, p_b (z₀│z₁,ξ_L ), summarized as a progression probability matrix (PPM). Under the null hypothesis of no screening benefit, the baseline PPM probabilities p_b are not affected by treatment applied at the point of SDx. Two model predictions for cancer incidence are considered: (1) λ_I (a,z|^¬ S) under no screening and (2) λ_I (a,z|IS) under ignored screening (no screening benefit) when the patient is left undiagnosed until symptoms. The first scenario does not involve the PPM, while the second scenario uses PPM. Making the two counterfactual incidence predictions as close as possible (we use a Poisson-type distance measure) by fitting PPM serves as an estimating procedure. When the model was fit to SEER data, we found that only about 5% of patients would progress in stage/grade during the lead time in the absence of treatment.

To model screening benefit, survival, and mortality, we use a cumulative logit (i.e., proportional odds) regression model to describe reduced rates of progression in stage/grade during the lead time as a result of treatment at screen diagnosis. A two-stage logistic-multinomial model was used to model treatment assignments at diagnosis. Another treatment effect is introduced in the Cox model describing post-lead-time survival, conditional on stage and grade at CDx. The model is fit by maximum likelihood to SEER survival and mortality data. The model is mixed over the partially unobserved lead time and stage and grade.

References

Mariotto AB, Etzioni R, Krapcho M, Feuer EJ. Reconstructing PSA testing patterns between black and white men in the US from Medicare claims and the National Health Interview Survey. Cancer. 2007;109(9):1877-1886. [Abstract]
Tsodikov A, Szabo A, Wegelin J. A population model of prostate cancer incidence. Statistics in medicine. 2006;25(16):2846-2866. [Abstract]
Tsodikov A, Liu L, Tseng C. Likelihood Transformations and Artificial Mixtures. Institute of Mathematical Statistics Festschrift in honor of A. Yakovlev: IMS Collections; 2014.
Chefo S, Tsodikov A. Stage-specific cancer incidence: An artificially mixed multinomial logit model. Statistics in Medicine 2009;28(15):2054-2076. [Abstract]
Tsodikov A, Chefo S. Generalized Self-Consistency: Multinomial logit model and Poisson likelihood. Journal of statistical planning and inference. 2008;138(8):23802397. [Abstract]
Wang S, Tsodikov A. A Self-consistency Approach to Multinomial Logit Model with Random Effects. Journal of statistical planning and inference. 2010;140(7):1939-1947. [Abstract]

Tip: Hover your cursor over the dashed attribute links below for more information. View the details of this model in a grid with other prostate models.

Detailed Package Attributes
Attribute Category	Attribute
Approach
Primary Purpose	Screening evaluation, Policy evaluation, Population trends,
Features
Intervention	Prevention, Screening, Treatment,
Natural History	Metastases, Recurrence, Tumor Growth,
Construction
Approach	Analytic,
Methods	Longitudinal, Likelihood optimization, Stochastic process, State Transition, Time to Event,
Unit of Analysis	Tumor, Person, Population,
Data Source
Census
Cancer Registry	SEER,
Linked
Clinical Trial	ERSPC, PLCO,
Survey	NHIS,
Meta Analysis
Assumptions
Benefit Factors
Screening	Stage Shift, Temporal Trends (Age), Lead Time,
Treatment	Temporal Trends (Calendar year, Age, Birth cohort), Modality,
Vaccination
Inputs	Incidence, Grade Distribution, Stage Distribution,
Screening	Attendance, Dissemination, Effect,
Diagnosis
Precancer
Treatment	Dissemination, Efficacy, Effect,
Precancer
Survival	Observed, Relative,
Mortality	Other cause, Tumor Attributes,
Risk Factor
Vaccination
Outputs	Incidence,
Disease	Stage Distribution, Grade Distribution, Other Conditions,
Prevalence
Treatment	Effect,
Precancer
Screening	Effect, Tumor Attributes,
Risk Factor	Natural History,
Outcomes	Survival, Life years, Cause-specific Mortality, All-cause Mortality,
Screening	True Positives, Overdiagnoses, History,
Treatment
Implementation
Development
Tested Platforms	Windows,
Language	Delphi, R,

2022

Wu W, Taylor JMG, Brouwer AF, Luo L, Kang J, Jiang H, He K, Scalable proximal methods for cause-specific hazard modeling with time-varying coefficients., Lifetime Data Anal, April 1, 2022 [Abstract]

2019

Suresh K, Taylor JMG, Tsodikov A, A Gaussian copula approach for dynamic prediction of survival with a longitudinal biomarker., Biostatistics, Dec. 10, 2019 [Abstract]
Beesley LJ, Morgan TM, Spratt DE, Singhal U, Feng FY, Furgal AC, Jackson WC, Daignault S, Taylor JMG, Individual and Population Comparisons of Surgery and Radiotherapy Outcomes in Prostate Cancer Using Bayesian Multistate Models, JAMA Netw Open, Feb. 1, 2019 [Abstract]

2018

Tsodikov A, Gulati R, Etzioni R, Reconciling the Effects of Screening on Prostate Cancer Mortality in the ERSPC and PLCO Trials, Ann Intern Med, April 17, 2018 [Abstract]
Etzioni R, Gulati R, When Clinical Trials Disagree, J Urol, March 1, 2018 [Abstract]

2017

Suresh K, Taylor JMG, Spratt DE, Daignault S, Tsodikov A, Comparison of joint modeling and landmarking for dynamic prediction under an illness-death model, Biom J, Nov. 1, 2017 [Abstract]
Tsodikov A, Gulati R, Heijnsdijk EAM, Pinsky PF, Moss SM, Qiu S, de Carvalho TM, Hugosson J, Berg CD, Auvinen A, Andriole GL, et al., Reconciling the Effects of Screening on Prostate Cancer Mortality in the ERSPC and PLCO Trials, Ann Intern Med, Oct. 3, 2017 [Abstract]
Tran Q, Kidwell KM, Tsodikov A, A joint model of cancer incidence, metastasis, and mortality, Lifetime Data Anal, Sept. 4, 2017 [Abstract]
Tsodikov A, Gulati R, de Carvalho TM, Heijnsdijk EAM, Hunter-Merrill RA, Mariotto AB, de Koning HJ, Etzioni R, Is prostate cancer different in black men? Answers from 3 natural history models, Cancer, June 15, 2017 [Abstract]
Rice JD, Tsodikov A, Semiparametric time-to-event modeling in the presence of a latent progression event, Biometrics, June 1, 2017 [Abstract]

2015

Ha J, Tsodikov A, Semiparametric estimation in the proportional hazard model accounting for a misclassified cause of failure, Biometrics, Dec. 1, 2015 [Abstract]

2014

Gulati R, Tsodikov A, Etzioni R, Hunter-Merrill RA, Gore JL, Mariotto AB, Cooperberg MR, Expected population impacts of discontinued prostate-specific antigen screening, Cancer, Nov. 15, 2014 [Abstract]
Hu C, Tsodikov A, Joint modeling approach for semicompeting risks data with missing nonterminal event status, Lifetime Data Anal, Oct. 1, 2014 [Abstract]
Hu C, Tsodikov A, Semiparametric regression analysis for time-to-event marked endpoints in cancer studies, Biostatistics, July 1, 2014 [Abstract]
Salinas CA, Tsodikov A, Ishak-Howard M, Cooney KA, Prostate cancer in young men: an important clinical entity, Nat Rev Urol, June 1, 2014 [Abstract]

2012

Etzioni R, Gulati R, Tsodikov A, Wever EM, Penson DF, Heijnsdijk EA, Katcher J, Draisma G, Feuer EJ, de Koning HJ, Mariotto AB, et al., The prostate cancer conundrum revisited: treatment changes and prostate cancer mortality declines, Cancer, Dec. 1, 2012 [Abstract]
Gulati R, Tsodikov A, Wever EM, Mariotto AB, Heijnsdijk EA, Katcher J, de Koning HJ, Etzioni R, The impact of PLCO control arm contamination on perceived PSA screening efficacy, Cancer Causes Control, June 1, 2012 [Abstract]
Ha J, Tsodikov A, Isotonic estimation of survival under a misattribution of cause of death, Lifetime Data Anal, Jan. 1, 2012 [Abstract]

2011

Gulati R, Wever EM, Tsodikov A, Penson DF, Inoue LY, Katcher J, Lee SY, Heijnsdijk EA, Draisma G, de Koning HJ, Etzioni R, et al., What if I don't treat my PSA-detected prostate cancer? Answers from three natural history models, Cancer Epidemiol Biomarkers Prev, May 1, 2011 [Abstract]

2009

Chefo S, Tsodikov A, Stage-specific cancer incidence: an artificially mixed multinomial logit model, Stat Med, July 10, 2009 [Abstract]
Draisma G, Etzioni R, Tsodikov A, Mariotto A, Wever E, Gulati R, Feuer E, de Koning H, Lead time and overdiagnosis in prostate-specific antigen screening: importance of methods and context, J Natl Cancer Inst, March 18, 2009 [Abstract]

2008

Etzioni R, Tsodikov A, Mariotto A, Szabo A, Falcon S, Wegelin J, DiTommaso D, Karnofski K, Gulati R, Penson DF, Feuer E, et al., Quantifying the role of PSA screening in the US prostate cancer mortality decline, Cancer Causes Control, March 1, 2008 [Abstract]
Tsodikov A, Chefo S, Generalized Self-Consistency: Multinomial logit model and Poisson likelihood, J Stat Plan Inference, Jan. 1, 2008 [Abstract]

2006

Tsodikov A, Szabo A, Wegelin J, A population model of prostate cancer incidence, Stat Med, Aug. 30, 2006 [Abstract]