Transparency in the Use of Propensity Score Methods (Text Version) Slide presentation from the AHRQ 2008 conference showcasing Agency research and projects. Slide Presentation from the AHRQ 2008 Annual ConferenceOn September 9, 2008, John D. Seeger, made this presentation at the 2008 Annual Conference. Select to access the PowerPoint® presentation (610 KB; Plugin Software Help).Slide 1Transparency in the Use of Propensity Score MethodsJohn D. Seeger, PharmD, DrPHChief Scientist, i3 Drug SafetyAdjunct Assistant Professor, Harvard School of Public HealthSeptember 9, 2008With thanks to: Alec Walker, Tobias Kurth, Jeanne Loughlin, Mona Eng, and Alex ColeSlide 2Propensity Score Analysis—When?Strategies to control for confounding in non-experimental pharmacoepidemiology: Measured confounders** Design** RestrictionMatchingAnalysis** StandardizationStratificationMultivariate regressionUnmeasured confounders* Unmeasured but measurable in a validation study: Two-stage samplingExternal adjustmentUnmeasurable: Design Crossover designsActive comparison group (restriction)Analysis: Instrumental variablesSensitivity analysisNote: *Thanks to S. Schneeweiss**Amenable to Propensity TechniquesSlide 3MotivationAssume matching when comparing 2 treatments: For every drug user with given characteristicsFind a comparator with identical characteristicsExample: Male, age 45, smoker, with hypertension (HTN)... Matching fails: Age (10 categories) xSex (2 categories) xPrior diagnoses (5 @ 2 categories each)Prior drug therapy (5 @ 2 categories each)Preceding cost of care (5 categories)Leads to 102,400 potential matching groups.Slide 4Propensity Score Collapses Exposure PredictorsThe slide shows a line graph which presents the "Hypothetical Distribution of Propensity Scores." The vertical axis, number of people, goes from 0 to 1400 and the horizontal axis, propensity score, goes from 0 to 1. The results show "Initiators" beginning at 0, reaching a maximum of 400 people at 0.65, and ending at less than 100 people at 1. The results show "Non-Initiators" beginning at 300, reaching a maximum of 1199 people at 0.35, and ending at 0.Single value: Probability: subject will receive therapy vs comparatorRemoves confounding by components of the score Patient characteristics that favor one therapy over anotherPermits: RestrictionMatchingStratificationModelingWeightingSlide 5Should Propensity Scores Always be Used?The slide shows a line graph which presents "Logistic Regression."Note: Figure 1. Median percentage of bias with the logistic regression, by strength of the exposure and number of events per confounder. In the logistic regression, the bias declines as the number of events per confounder increases. Values greater than zero indicate an overestimation of the effect of the exposure on the outcome. Negative values indicate an underestimation of the effect of the exposure on the outcome.Note: Cepeda S, et al. Am J Epidemiol 2003;158:280-7.More than 8 events per covariate leads to unbiased estimatesSo propensity score favored when:Many more persons exposed to drug of interest than study outcomes. Common exposureRare outcomeAllows for richer model (more predictors) of exposure than outcome. Alternative hypothesesSlide 6Estimate Propensity ScorePredict treatment from baseline covariates within databaseInclusion of predictors: A priori (what characteristics are used to prescribe?)Empiric (what differentiates initiators?)Generic (what patterns of healthcare predict initiation?)Coefficients of propensity score: Interpretable and InformativeSlide 7Propensity Score RestrictionThe slide shows a bell curve graph presenting the results for "Exposed subjects" and "Unexposed subjects."Note: *In this example subjects with low propensity scores are never exposed while subjects with high propensity scores are always exposed.Note: Sturmer T, et al. J Clin Epidemiol 2006;59:437-47.Slide 8Propensity Score RestrictionPotential for serious adverse events from error (name confusion): Amaryl (glimepiride an oral hypoglycemic)Reminyl (galantamine for Alzheimer's disease)36,816 people with AD diagnosis (14,626 Reminyl dispensings): 236 Amaryl recipients24 Amaryl recipients in the lowest decile of the propensity score13 with a single dispensing of Amaryl or no diabetes diagnoses2 with no diabetes-related claims across entire claim historyMedical record review suggested no errorPropensity score restriction may be used as a screening method to identify unusual patterns of healthcare for closer scrutiny: Possible medication dispensing errorsOthersConfirmation requires additional data, which could be obtained through medical record review.Slide 9Propensity Score Distribution and StrataThe line graph presents the propensity scores on the x axis and percent on the y axis. There are two lines representing "Zopiclone" and "Temazepam."C-statistic equals 0.739.Slide 10Effect of Temazepam Relative to ZopicloneThe table presents statistical data of "Relative Risk" and "Upper and Lower 95th Percentile." The data shown is for Quintiles 1-5 and totals adjusted for propensity score and continuous propensity score.Transparent analysis: Within-stratum balanceStratum-specific effect estimates as well as pooled estimateExplicit evaluation of potential for effect measure modificationSlide 11Matching on the Propensity ScoreMatching can be performed by: Standard automated case-control matching programs where the matching range is specifiedNearest available match based on the propensity scoreGreedy matching techniques (http://www2.sas.com.proceedings/sugi26/p214-26.pdf)Slide 12Distribution of Propensity ScoreThe line graph presents propensity score on the x axis and number of persons on the y axis. The categories are "Statin Initiators" and "Non-Initiators." The results show a consistent trend for both categories, with a low propensity score for higher numbers of persons and exponentially higher propensity score as the number of persons decrease.Slide 13Propensity Score Distribution (After Matching)The line graph presents propensity score on the x axis and number of persons on the y axis. The categories are "Statin Initiators" and "Non-Initiators." The data for both categories shows a sharp increase in number of persons in the lower propensity scores, between 0 and 0.05. Beyond propensity scores of 0.05, the number of persons decreases as the propensity scores increase, approaching 0 beyond a propensity score of 0.7.Slide 14Characteristics Before MatchingThe table presents the results for "Initiators: N=4144," "Non-Initiators: N=4144," and "P-value" for various "Variables" such as Lipid-Related Labs, Different Prescription Drugs, Low-density lipoprotein (LDL) Level, Angina, Smoking, Hypertension, etc.Slide 15Balance Achieved by MatchingThe table presents the results for "Initiators: N=2901," "Non-Initiators: N=2901," and "P-value" for various "Variables" such as Lipid-Related Labs, Different Prescription Drugs, LDL Level, Angina, Smoking, Hypertension, etc.Note: Matched at 0.01 Propensity ScoreSlide 16Analysis by 2X2 TableThe slide presents a "Table of mi by statin."Slide 17MI Outcome (After Matching)The line graph presents months of follow up on the x axis and cumulative incidence on the y axis. The two data sets are "Statin Initiators" and "Statin Non-Initiators." Both data lines begin at approximately, 0, 0. Both show a somewhat linear increase in cumulative incidence as months of follow-up increase. The trend is stronger in the "Statin Non-Initiators" data.Note: HR=0.69 (0.52-0.93)Note: 31% (7%-48%) Risk ReductionSlide 18Regression Adjustment with Propensity ScoresRegression adjustment: Note: An image of the equation is portrayed.All study participants are usedStill a two-step approach (exposure and outcome)More power compared to including all covariates into the model, since degrees of freedom are gainedHowever, assumes the underlying association between the score and the outcome is modeled appropriatelySlide 19WeightingThe slide presents three separate line graphs showing the distribution of propensity score on the x axis relative to the number of people on the y axis. The lower two charts are entitled "IPTW" and "SMR" and both show nearly identical distributions of data for "Initiators" and "Non-Initiators". The "IPTW" chart shows a bell curve distribution of with a peak in number of people, approximately 1300, at a propensity score of approximately 0.45. The "SMR" chart shows a bell curve distribution of with a peak in number of people, approximately 400, at a propensity score of approximately 0.65.Slide 20Baseline CharacteristicsTable 1. Variables included in the propensity scores, LABA ans ICS cohortsThe table presents statistical data of "LABA cohort" and "ICS cohort." The data is shown for various age categories, genders, geographic regions of health plan, Asthma-related drug dispensings (0-2 days), Asthma-related drug dispensings (3-365), and Asthma-related physician visits.CategoriesLABA Cohort(18,596 patients)ICS Cohort(30,520 patients)n%n%Age category, years10-193,75720.206,54321.4420-291,5598.382,2257.2930-393,23717.414,96516.2740-494,34123.346,63121.7350-644,55024.477,62624.9965+1,1526.192,5308.29GenderMale7,25639.0212,80441.95Female11,34060.9817,71658.05Geographic region of health planNortheast2,17611.703,71412.17South/Southeast7,27339.1110,89335.69Midwest6,50234.9612,06539.53West2,61614.073,79012.42Asthma-related drug dispensing (0-2 days)Short-acting beta-agonists4,86626.1710,34433.89Oral steroids1,99810.741,2173.99Injectible steroids3221.731800.59Leukotriene modifiers2,15111.572,4728.10Mast cell stabilizers550.302110.69Xanthines1760.953531.16Omalizumab10.0110.00Asthma-related drug dispensing (3-365)Short-acting beta-agonists13,28771.4521,75371.27Oral steroids7,10438.209,23130.25Injectible steroids2,27812.253,0159.88Leukotriene modifiers4,95826.666,58021.56Mast cell stabilizers3912.107882.58Xanthines6793.651,1173.66Omalizumab20.0120.01Asthma-related physician visits07,44540.0416,67654.6414,97326.747,17823.522+6,17833.226,66621.84Slide 21Cohort ResultsIncidence rates (per 1000) and hazard ratiosThe table presents statistical data of "Events," "Person Years," "Incidence Rate," and "Hazard Ratio" for the following categories:All-cause MortalityAsthma-related emergency room visitsAsthma-related hospitalizationsIntubationsCohortPerson-IncidenceHazard RatioEventsYearsRate95% CIAdjusted*95% CI All-cause Mortality ICS Cohort17135,8864.774.18-5.411.00RefLABA CohortLABA Cohort9325,8013.603.01-4.280.690.50-0.95Formoterol49244.331.48-9.880.780.28-2.14Salmeterol378,4484.383.27-5.760.840.56-1.26Salmeterol/Fluticasone5216,4073.172.48-3.990.610.42-0.88 Asthma-related emergency room visits ICS Cohort72335,33920.4619.24-21.741.00RefLABA CohortLABA Cohort71025,00228.4026.69-30.191.241.09-1.42Formoterol3089233.6324.31-45.341.390.96-2.02Salmeterol2478,15130.3027.24-33.611.331.13-1.57Salmeterol/Fluticasone43315,93727.1725.08-29.381.191.03-1.38 Asthma-related hospitalizations ICS Cohort15435,8184.303.75-4.911.00RefLABA CohortLABA Cohort25425,5789.938.93-11.011.761.36-2.27Formoterol1091310.955.95-18.501.880.97-3.64Salmeterol958,35411.379.53-13.472.011.48-2.72Salmeterol/Fluticasone14916,2909.157.95-10.471.621.23-2.14 Intubations ICS Cohort8635,8522.401.99-2.871.00RefLABA CohortLABA Cohort7425,7622.872.35-3.481.030.69-1.52Formoterol49224.341.48-9.901.460.52-4.10Salmeterol338,4303.912.87-5.231.400.88-2.25Salmeterol/Fluticasone3716,3882.261.68-2.970.810.51-1.27Slide 22Are Divergent Results Possible?The table presents statistical data for "No.," "OR*," and "95% Cl*" for the following categories:Crude modelMultivariable modelMatched on propensity scoreRegression adjusted with propensity scorePropensity score, continuousMultivariablePropensity score, decilesMultivariableWeighted modelsIPTW*SMR* weightedNote: Kurth T, et al. Am J Epidemiol 2006;163:262-70. No.OR*95% CI*Crude model6,2693.352.28, 4.91Multivariable model†6,2691.931.22, 3.06Matched on propensity score4061.170.68, 2.00Regression adjusted with propensity scorePropensity score, continuous6,2691.530.95, 2.48Multivariable†6,2691.851.13, 3.03Propensity score, deciles6,2691.761.13, 2.72Multivariable†6,2691.961.20, 3.20Regression adjusted with propensity scoreIPTW*6,26910.772.47, 47.04SMR* weighted6,2691.110.67, 1.84Slide 23What About Unmeasured Confounding?Obesity, Smoking, ExercisePharmacoepidemiology and Drug Safety (2008)Published online in Wiley InterScience (www.interscience.wiley.com)DOI: 10.1002/pds.1554Original ReportSupplementary data collection with case-cohort analysis to address potential confounding in a cohort study of thromboembolism in oral contraceptive initiators matched on claims-based propensity scores.P. Mona Eng ScD1*, John D. Seeger PharmD, DrPH1,2, Jeanne Loughlin MS1*, C. Robin Clifford MS1*, Sherry Mentor BA1* and Alexander M. Walker MD, DrPH1,2 1Ingenix, i3 Drug Safety, Waltham, MA, USA 2Harvard School of Public Health, Boston, MA, USASlide 24Accounting for Variables had Little EffectTable 3. Relative risks of thromboembolism comparing ethinyl estradiol/drospirenone (EE/DRSP) and other oral contraceptive (OC) initiators from case-cohort and propensity score matched cohort analyses, Ingenix Research Data Mart 2001-2004, United States. Relative risk95% CI*Case cohort analysisRisk ratio Univariate analysisEE/DRSP0.920.50, 1.69Other OC1 Multivariate analysisEE/DRSP0.900.49, 1.68Other OC1 Propensity score matchedRate ratio Cohort analysisEE/DRSP0.920.50, 1.63Other OC1 Slide 25ConclusionPropensity score can be useful for addressing confounding (by indication).Allows for rich model of exposure to be developed.Advantageous when number of people with a study outcome is small relative to number of exposed persons and number of potential confounders is large. Drug effects (particularly adverse ones)Consider transparency: When selecting propensity scoreWhen building propensity scoreWhen using propensity scoreSlide 26Thank YouJohn.Seeger@i3DrugSafety.com Current as of February 2009 Internet Citation: Transparency in the Use of Propensity Score Methods (Text Version). February 2009. Agency for Healthcare Research and Quality, Rockville, MD. http://www.ahrq.gov/news/events/conference/2008/Seeger.html