Selecting Evidence for Comparative Effectiveness Reviews: When to use Slide Presentation from the AHRQ 2009 Annual ConferencSlide presentation from the AHRQ 2009 conference. On September 16, 2009, Meera Viswanathan made this presentation at the 2009 Annual Conference. Select to access the PowerPoint® presentation (1 MB) (Plugin Software Help).Slide 1 Selecting Evidence for Comparative Effectiveness Reviews: When to use Observational StudiesDan Jonas, MD, MPHMeera Viswanathan, PhDKaren Crotty, PhD, MPHRTI-UNC Evidence-based Practice Center Slide 2 SourcesAHRQ Methods Guide, Chapters 4 and 8, http://www.effectivehealthcare.ahrq.gov/repFiles/2007_10DraftMethodsGuide.pdfDraft manuscript, Norris et al., Observational Studies in Systematic Reviews of Comparative Effectiveness.Chou R, Aronson N, Atkins D, et al. Assessing harms when comparing medical interventions: AHRQ and the Effective Health Care Program. J Clin Epidemiol 2008 Sep 25. Slide 3 OverviewWhy should reviewers consider including observational studies (OS) in comparative effectiveness reviews (CERs)?When should OS be included in CERs?What are the differences in considering inclusion of OS for benefits as opposed to OS of harms? Slide 4 Current PerspectiveCERs should consider including observational studies *this should be the default strategy*Reviewers should explicitly state the rationale for including or excluding OS Slide 5 Comparative Effectiveness Reviews (CERs)Systematic reviews that compare the relative benefits and harms among a range of available treatments or interventions for a given condition Slide 6 CER Process OverviewPrepare topic: Refine key questionsDevelop analytic frameworksSearch for and select stufies: Indentify eligibilitySearch for relecant studiesSelect evidence for inclusionAbstract data: Estract evidence from studiesConstruct evidence tablesAnalyze and synthesize data: Assess quality of studiesAssess applicability of studiesApply qualitative methodsApply qualitative methods (meta-anyaysesRate the strength of a body of evidencePresent findings Slide 7 Hierarchy of EvidenceLowest risk of biasSystematic ReviewsRCTsControlled Clinical Trials and Observational StudiesUncontrolled Observational StudiesCase reports and case seriesExpert OpinionsApplicability? Slide 8 Danger of Over-reliance on RCTsMay be unnecessary, inappropriate, inadequate, or impracticalMay be too short in durationMay report intermediate outcomes rather than main health outcomes of interestOften not available for vulnerable populationsGenerally report efficacy rather than effectivenessAHRQ Evidence-based Practice Centers include wide variety of study designs (not only RCTs) Slide 9 Observational Studies (OS)Definition: Studies where the investigators did not assign the exposure/intervention I.e. non-experimental studiesControlled clinical trials are quasi-experimental studies, not OSWe present considerations for including OS to assess benefits and to assess harms separately Slide 10 OS to Assess BenefitsOften insufficient evidence from trials to answer all KQs in CERs (think PICOTS) Population: may not be available for sub-populations and vulnerable populationsInterventions: may not be able to assign high-risk interventions randomlyOutcomes: may report intermediate outcomes rather than main health outcomes of interestTiming: may be too short in durationSetting: may not represent typical practice Slide 11Group ExerciseWhat should reviewers consider when deciding whether or not to include observational studies in CERs? Slide 12 OS to Assess BenefitsReviewers should consider 2 questions:1. Are there gaps in trial evidence for the review questions under consideration?2. Will observational studies provide valid and useful information to address key questions? Slide 13 Flowchart of OS to Assess Benefits1.Systematic review question (including PICOTS)2. Always consider: Controlled Trials3. Are there gaps in trial evidence?4. Yes - Consider OS4. No - Confine review to Controlled TrialsWill OS provide valid and useful information?5. Refocus the review question on gaps6. Assess whether OS address the review question7. Assess the suitability of OS: Natural history of the disease or exposure Potential biases Slide 14 Group Exercise: Include OS?CER of PCI vs. CABG for coronary disease identified 23 RCTs. Experts (TEP) raised concerns that the studies enrolled patients with a relatively narrow spectrum of disease relative to those having the procedures in current practiceReview of antioxidant supplementation to prevent heart disease found numerous large clinical trials, including over 20,000 elevated-risk subjects in the Heart Protection Study. No beneficial effects were seen in CV outcomes, including mortality. Findings were consistent across trials with varying populations, sizes, etc. Slide 15 Group Exercise: include OS?CER of PCI vs. CABG----Need to look for OS OS from 10 large cardiovascular registries were identifiedThese confirmed that the use of the procedures in the community included patients with wider variation in diseaseFor patients similar to those enrolled in trials, mortality results in the registries were similar to trials (no difference between interventions)Relative benefits of the procedures varied markedly with extent of disease, raising caution about extending trial conclusions to patients with greater or lesser disease than those in trial populationsReview of antioxidant supplementation to prevent heart disease----Trial data are sufficient Slide 16 Gaps in Trial Evidence: PICOTSTrial data may be insufficient for a number of reasons PICOTSPopulations included (missing certain groups)Interventions includedOutcomes reported (only intermediate)DurationAll trials may be efficacy studies Slide 17 Are Trial Data Sufficient?PICOTS and BeyondRisk of bias (internal validity) Degree to which the findings may be attributed to factors other than the intervention under reviewConsistency Extent to which effect size and direction vary within and across studiesInconsistency may be due to heterogeneity across PICOTSDirectness Degree to which outcomes that are important to users of the CER (patients, clinicians, or policymakers) are encompassed by trial dataHealth outcomes generally most important Slide 18 Are Trial Data Sufficient? PICOTS and BeyondPrecision Includes sample size, number of studies, and heterogeneity within or across studiesReporting bias Extent to which trial authors appear to have reported all outcomes examinedApplicability Extent to which the trial data are likely to be applicable to populations, interventions, and settings of interest to the userThe review questions should reflect the PICOTS characteristics of interest Slide 19 When to Identify Gaps in Trial EvidenceIdentification of gaps in trial evidence available to answer review questions can occur at a number of points in the review When first scoping the reviewConsultation with Technical Expert PanelInitial review of titles and abstractsAfter detailed review of trial data Slide 20 CER Process OverviewPrepare topic: Refine key questionsDevelop analytic frameworksSearch for and select stufies: Indentify eligibilitySearch for relecant studiesSelect evidence for inclusionAbstract data: Estract evidence from studiesConstruct evidence tablesAnalyze and synthesize data: Assess quality of studiesAssess applicability of studiesApply qualitative methodsApply qualitative methods (meta-anyaysesRate the strength of a body of evidencePresent findings Slide 21 Gaps in Trial EvidenceOperationally, may perform initial searches broadly, to identify both OS and trials, or may do searches sequentially and search for OS after reviewing trials in detail to identify gaps in evidence Slide 22 2. Will observational studies provide valid and useful information to address key questions?Reviewers should:Refocus the study question on gaps in trial evidence Specify the PICOTS characteristics for gaps in trial evidenceAssess whether available OS may address the review questions (applicable to PICOTS?)Assess suitability of OS to answer the review questions Slide 23 Valid and Useful InformationAssess suitability of OS to answer the review questionsAfter gaps have been identified in trial literature and that OS potentially fill those gaps Consider the clinical context and natural history of the condition under studyAssess how potential biases may influence the results of OS Slide 24 Clinical contextFluctuating or intermittent conditions are more difficult to assess with OS Especially if there is no comparison groupOS may be more useful for conditions with steady progression or decline Slide 25 Group ExerciseHere are two very different conditions: Acute low back painAmyotrophic lateral sclerosis (ALS)How might the differences in these conditions impact whether OS would provide useful information? Slide 26 Group ExerciseMain considerations here are the natural history of the condition under studyPeople with acute low back pain often recover spontaneously A cohort study of treatments for acute low back pain can't establish, with any degree of certainty, whether the treatments affected patient outcomesALS has a course of steady decline An uncontrolled cohort study of treatments for ALS may well be able to demonstrate meaningful effects Slide 27 Potential biasesSelection bias (and confounding by indication)Performance biasDetection biasAttrition bias Slide 28 Group ExerciseSuppose you're conducting a CER of medications for rheumatoid arthritis (RA)You find several retrospective analyses of administrative databases comparing outcomes of RA patients taking etanercept vs. methotrexateSuppose that etanercept is restricted in many of the health systems to patients with more severe RA who have failed other treatmentsShould you include these OS?What considerations will influence your decision? Slide 29 Group ExerciseConfounding by indication A type of selection biasWhen different diagnoses, severity of illness, or comorbid conditions are important reasons for physicians to assign different treatmentsCommon problem in pharmacoepidemiology studies comparing beneficial effects of interventionsGenerally would not include this information due to a high risk of bias (poor internal validity), unless studies had a good way to adjust for severity of disease Slide 30 HarmsAssessing harms can be difficult Trials often focus on benefits, with little effort to balance assessment of benefits and harmsOS are almost always necessary to assess harms adequatelyThere are tradeoffs between increasing comprehensiveness of reviewing all possible harms data and decreasing quality (increasing risk of bias) for harms data Slide 31 Trials to Assess HarmsRandomized controlled trials = gold standard for evaluating efficacyBut, relying solely on RCTs to evaluate harms in CERs is problematic Most lack prespecified hypotheses for harms as they are designed to evaluate benefitsAssessment of harms is often a secondary considerationQuality and quantity of reporting of harms is frequently inadequateFew have sufficient sample sizes or duration to adequately assess uncommon or long-term harms Slide 32 Trials to Assess HarmsMost RCTs are "efficacy" trials They assess benefits and harms in ideal, homogenous populations and settingsPatients who are more susceptible to harms are often under-representedFew RCTs directly compare alternative treatment strategiesPublication bias and selective outcome reporting biasRCTs may not be available Slide 33 Trials to Assess HarmsNevertheless, head-to-head RCTs provide the most direct evidence on comparative harmsIn addition, placebo-controlled RCTs can provide important informationIn general, CERs should routinely include both head-to-head and placebo-controlled trials for assessment of harms In lieu of placebo-controlled RCTs, CERs may incorporate findings of well-conducted systematic reviews if they evaluated the specific harms of interest Slide 34 Unpublished Supplemental Trials DataConsider including results of completed or terminated unpublished RCTs and unpublished results from published trials FDA website, www.ClinicalTrials.gov , etc.Must contemplate ability to fully assess risk of biasWhen significant # of published trials fails to report an important AE, CER authors should report this gap in the evidence and consider efforts to obtain unpublished data Slide 35 OS to Assess HarmsOS are almost always necessary to assess harms adequatelyException is when there are sufficient data from RCTs to reliably estimate harmsMay provide best or only data for assessing harms in minority or vulnerable populations who are under-represented in trialsTypes of OS included in a CER will vary; different types of OS might be included or rendered irrelevant by availability of data from stronger study types Slide 36 Hypothesis Testing vs. Hypothesis GeneratingImportant consideration in determining which OS to include Case reports are hypothesis generatingCohort and case-control studies are well suited for testing hypotheses of whether one intervention is associated with a greater risk for an adverse event than another and for quantifying the risk**Chou et al, JCE 2008 Slide 37 Hierarchy of EvidenceLowest risk of biasSystematic ReviewsRCTsControlled Clinical Trials and Observational StudiesUncontrolled Observational StudiesCase reports and case seriesExpert Opinions Slide 38 OS to Assess HarmsCohort and case-control studies CERs should routinely search for and include, except when RCT data are sufficient and validOS based on patient registriesOS based on analyses of large databasesCase reports and post-marketing surveillance New medicationsOther OS Slide 39 OS to Assess HarmsCriteria to select OS for inclusion There are often many more OS than trials; evaluating a large number of OS can be impractical when conducting a CERSeveral criteria commonly uses in CERs to screen OS for inclusion (empirical data lacking)Minimum duration of follow-upMinimum sample sizeDefined threshold for risk of biasStudy design (cohort and case-control)Specific population of interest Slide 40Key Take-home PointsOften insufficient evidence from trials to answer all Key Questions in CERsCERs should consider including OS *default strategy*Should explicitly state the rationale for including or excluding OSFor OS to assess benefits, reviewers should consider 2 questions: Are there gaps in trial evidence for the review questions under consideration?Will observational studies provide valid and useful information to address key questions?For harms, should routinely search for and include cohort and case-control studies Current as of December 2009 Internet Citation: Selecting Evidence for Comparative Effectiveness Reviews: When to use : Slide Presentation from the AHRQ 2009 Annual Conferenc. December 2009. Agency for Healthcare Research and Quality, Rockville, MD. http://www.ahrq.gov/news/events/conference/2009/viswanathan2/index.html