Grading Evidence and Recommendations: Starting with GRADE Basics Vs. Utilizing the Full Framework Slide presentation from the AHRQ 2010 conference. On September 28, 2010, Yngve Falck-Ytter made this presentation at the 2010 Annual Conference. Select to access the PowerPoint® presentation (760 KB). Free PowerPoint® Viewer (Plugin Software Help).Slide 1Grading Evidence and Recommendations: Starting with GRADE Basics vs. Utilizing the Full FrameworkAHRQ Annual Meeting 2010:“Better Care, Better Health: Delivering on Quality for All Americans"September 28, 2010Yngve Falck-Ytter, M.D.Associate Professor of MedicineCase Western Reserve University, Cleveland, OhioHolger Schünemann, M.D., Ph.D.Chair, Department of Clinical Epidemiology & BiostatisticsMichael Gent Chair in Healthcare ResearchMcMaster University, Hamilton, CanadaSlide 2DisclosuresIn the past 5 years, Dr. Falck-Ytter received no personal payments for services from industry. His research group received research grants from Three Rivers, Valeant and Roche that were deposited into non-profit research accounts. He is a member of the GRADE working group which has received funding from various governmental entities in the US and Europe, such as the AHRQ. Some of the GRADE work he has done is supported in part by grant # 1 R13 HS016880-01 from the Agency for Healthcare Research and Quality (AHRQ).Slide 3ContentPart 1A 7 minute version of GRADE.Part 2Rapid interactive exchange contrasting GRADE basic vs. the full GRADE approach. Advantages of a structured approach.Asking good clinical questions.Systematic review vs. ad hoc approaches.Grading the quality of evidence.How to determine the strength of recommendations.Slide 4Question to the audienceDecisions in your medical practice are based on:Training, experience and knowledge of respected colleagues.Patient preferences.Convincing evidence (non experimental) from case reports, case series, disease mechanism.RCTs, systematic reviews of RCTs and meta-analyses.All of the above.Slide 5Evidence-based clinical decisionsVenn diagram showing the intersections of Clinical circumstances, Patient values and preferences, and Research evidence combining to bring Expertise.Haynes et al. 2002Slide 6A real world example...P: In patients with acute hepatitis C ...I: Should anti-viral treatment be used ...C: Compared to no treatment ...O: To achieve viral clearance?EvidenceRecommendationOrganizationBClass IAASLD (2009)II-1“Should be initiated...”VA (2006);1+ASIGN (2006)-/-“Most authorities...”AGA (2006)-/-B “It works...”AWMF(2004)Slide 7Question to the audienceBy now......you are thoroughly confused...you send her to a doctor because treatment is recommended...you send her to a doctor but she can expect that, according to guidelines, she will not be treated...you look at the evidence yourself because past experience tells you that guidelines don't helpSlide 8GRADE is outcome-centricAn image of the old system transitioning into the new GRADE system is shown. Slide 9Guideline developmentThe top half of the slide is diagram entitled "Systematic review". It show the steps of PICO:Formulate question (PICO)Select outcomesRate importanceOutcomes across studiesCreate evidence profile with GRADEpro (Summary of findings & estimate of effect for each outcome)Rate quality of evidence for each outcome (High, Moderate, Low, Very low)Rate overall quality of evidence across outcomes based on lowest quality of critical outcomes.Rate after Step 6 (Rate quality of evidence for each outcome) is set of steps entitled "RCT start high, obs. data start low".Grade downRisk of biasInconsistencyIndirectnessImprecisionPublication biasGrade upLarge effectDose responseConfoundersGuideline developmentFormulate recommendations:For or against (direction)Strong or weak (strength)By considering:Quality of evidenceBalance benefits/harmsValues and preferencesRevise if necessary by considering:Resource use (cost)“We recommend using...”“We suggest using...”“We recommend against using...”“We suggest against using...”Slide 10Question to the audienceWhich question follows a well structured clinical PICO format:What is the evidence that food allergens cause eosinophilic esophagitis?Is it known what the evidence is that aspirin can prevent progression of dysplasia to cancer in Barrett's esophagus?In patients undergoing hip replacement, does warfarin compared to aspirin reduce venous thromboembolism, pulmonary embolism and mortality?Slide 11That's an excellent questionTranslating informal clinical questions into specific PICO questions = central to GRADE.Even if an organization has limited resources, taking care of this step actually saves resources: Helps limiting your scope.Specifies the search strategy more clearly.Guides data extraction.Helps with formulating recommendations.Slide 12Taking it to the next levelInformal QuestionPICO QuestionMethod PopulationIntervention(s)Comparator(s)Outcome(s) Whether to use thrombo-prophylaxis for VTE prophylaxis (drugs)Patients under-going THRAny drug (ASA, LDUH, LMWH, fonda-parinux, direct thrombin inhibitors)No anti-coagulationAsymptomatic DVT (surrogate for symptomatic VTE); symptomatic DVT; non-fatal PE; fatal PE; bleeding (operative site vs. non-operative site); readmission; re-operation; total mortalityRCT, obs. studiesSlide 13Importance of outcomesDeciding on the importance of outcomes on decision making:Less important123Important456Critically important789P: In patients after hip replacement...I: Should warfarin rather than...C: Aspirin be given...O: To reduce symptomatic venous thromboembolism and mortality?Slide 14Question to the audienceDeciding on the importance of outcomes on decision making:Less important123Important456Critically important789Please rate outcome: Dying from pulmonary embolism(1, 2, 3): Less important for decision making(4, 5, 6): Important for decision making(7, 8, 9): Critically important for decision makingSlide 15Question to the audienceDeciding on the importance of outcomes on decision making:Less important123Important456Critically important789Asymptomatic deep vein thrombosis in the calf (e.g., as seen on mandatory venography at end of study)(1, 2, 3): Less important for decision making(4, 5, 6): Important for decision making(7, 8, 9): Critically important for decision makingSlide 16Deciding on the importance of outcomes on decision making:Less important123Important456Critically important789Stomach ulcer bleeding requiring endoscopy(1, 2, 3): Less important for decision making(4, 5, 6): Important for decision making(7, 8, 9): Critically important for decision makingSlide 17Deciding on the importance of outcomes on decision making:Less important123Important456Critically important789Regular blood work and dose adjustments(1, 2, 3): Less important for decision making(4, 5, 6): Important for decision making(7, 8, 9): Critically important for decision makingSlide 18Rating the importance of outcomesTrain the content expert to understand that outcomes that are critical for decision making are identified.Rating is done before, during and after the evidence review.The rating may change in light of new information.Slide 19This slide is duplicate of Slide 9 above.Slide 20Taking it to the next levelEarly involvement of consumers in the guideline development process.Selecting systematic reviews that are known to make an effort to include consumer views (e.g., Cochrane etc.).Can be used to identify research gaps.Slide 21Evidence review stageImage of a flowchart which starts at "What format of evidence do you use?"Slide 22Question to the audienceSelect the best answer: You can find high quality systematic reviews for "free" here:AHRQThe Cochrane LibraryCanadian Agency for Drugs and Technologies in Health (CADTH)National Institute for Clinical Excellence (NICE), UKAll of the aboveSlide 23Taking it to the next levelWhat to look for when selecting evidence review centers.Commissioning systematic reviews: Making sure the center understands GRADE requirements. What SR methodology they use.What databases they can search.What software they use.How they document their work.Slide 24Question to the audienceGRADE rating evidence: The quality of evidence may need downgrading if:The outcome is reduction of elevated pressure in the eye (IOP) instead of loss of vision.There are large losses to follow-up.Some trials showing benefits, others reporting harms.The confidence interval is wide and there are few events.All of the above.Slide 25Quality of evidence: beyond risk of biasDefinition: The extent to which our confidence in an estimate of the treatment effect is adequate to support a particular recommendationMethodological limitationsRisk of bias:Allocation concealmentBlindingIntention-to-treatFollow-upStopped earlyInconsistency of resultsUnderneath this category is an Odds Ratio graph.Indirectness of evidenceSources of indirectness:Indirect comparisonsPatientsInterventionsComparatorsOutcomesImprecision of resultsUnderneath this category is an Odds Ratio graph.Publication biasUnderneath this category is an Odds Ratio graph.Slide 26 Quality assessment criteriaStudy designQuality of evidenceLower if ...Higher if...Randomized trialsHighStudy limitations (design and execution)What can raise the quality of evidence? ModerateInconsistencyObservational studiesLowIndirectness Very lowImprecision Publication biasSlide 27Question to the audienceA systematic review of observational studies showed a relationship between front sleeping position (versus back position) and sudden infant death syndrome (SIDS): OR 2.93 (1.15, 7.47). Rate the quality of evidence for the outcome SIDS:HighModerateLowVery lowSlide 28Question to the audienceYou review all colonoscopies for average risk screening in your health system and document a percentage of patient who developed a perforation after the procedure (evidence of free air on imaging). No comparison group without colonoscopy available. Rate the quality of evidence for the outcome perforation:HighModerateLowVery lowSlide 29Question to the audienceSeveral RCTs have shown the effectiveness of natalizumab to induce remission in Crohn's disease. Study/post-marketing data showed 31 cases of potentially lethal progressive multifocal leukoencephalopathy (PML, JC virus related). Rate the quality of evidence for PML:HighModerateLowVery lowSlide 30Quality assessment criteriaStudy designQuality of evidenceLower if ...Higher if...Randomized trialsHighStudy limitations (design and execution)Large effect (e.g., RR 0.5)Very large effect (e.g., RR 0.2) ModerateInconsistencyEvidence of dose-response gradientObservational studiesLowIndirectnessAll plausible confounding would reduce a demonstrated effect Very lowImprecision Publication bias Slide 31"Categories" of quality (1)High: Further research is very unlikely to change our confidence in the estimate of effectModerate: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimateLow: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimateVery low: Any estimate of effect is very uncertainSlide 32Conceptualizing quality (2)High: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate: We are moderately confident in the estimate of effect: The true effect is likely to be close to the estimate of effect, but possibility to be substantially different.Low: Our confidence in the effect is limited: The true effect may be substantially different from the estimate of the effect.Very low: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect.Slide 33Taking it to the next levelAdvantages of systematically assessing quality of evidenceDowngrading and upgrading "on-the-fly" can introduce errorsStudy / yearTreatmentAllocation concealmentBlindingNo outcome (%)AnalysisCommentsREMOBILIZE 2009dabigatran 220 mg QDdabigatran 150 mg QDenoxaparin 30 mg BIDYes (IVRS) (blocks of 6)Patients: YCaregivers: YData coll: PYAdjudic: YData analysts: ?269/862 (31.2%)232/877 (26.5%)239/876 (27.3%)ITT: noLow dose ASA and stocking allowed, but not pneumatic devicesSlide 34GRADE evidence profileAn image of a GRADE evidence profile table is shown for the question "Should antibiotics vs. no antibiotics be used for children with otitis media?".Slide 35Question to the audiencePICO: Should children with otitis media be treated with antibiotics?Rate the overall quality of evidence for this clinical question by evaluating all critical outcomes (use the evidence profile):HighModerateLowVery lowSlide 36Image of the steps involved in the GRADE process.Slide 37Question to the audiencePICO: Should children with otitis media be treated with antibiotics?Rate the overall strength or recommendations:"We recommend early antibiotics in children with acute otitis media.""We suggest early antibiotics.""We suggest against using antibiotics initially.""We recommend against using antibiotics initially."Slide 38Strength of recommendation"The strength of a recommendation reflects the extent to which we can, across the range of patients for whom the recommendations are intended, be confident that desirable effects of a management strategy outweigh undesirable effects."Slide 394 determinants of the strength of recommendationFactors that can weaken the strength of a recommendationExplanationLower quality evidenceThe higher the quality of evidence, the more likely is a strong recommendation.Uncertainty about the balance of benefits versus harms and burdensThe larger the difference between the desirable and undesirable consequences, the more likely a strong recommendation warranted. The smaller the net benefit and the lower certainty for that benefit, the more likely is a weak recommendation warranted.Uncertainty or differences in patients' valuesThe greater the variability in values and preferences, or uncertainty in values and preferences, the more likely weak recommendation warranted.Uncertainty about whether the net benefits are worth the costsThe higher the costs of an intervention—that is, the more resources consumed—the less likely is a strong recommendation warranted.Slide 40Implications of a strong recommendationPatients: Most people in this situation would want the recommended course of action and only a small proportion would not.Clinicians: Most patients should receive the recommended course of action.Policy makers: The recommendation can be adapted as a policy in most situations.Slide 41Implications of a weak recommendationPatients: The majority of people in this situation would want the recommended course of action, but many would not.Clinicians: Be prepared to help patients to make a decision that is consistent with their own values/decision aids and shared decision making.Policy makers: There is a need for substantial debate and involvement of stakeholders.Slide 42Taking it to the next levelExplicit separation of quality of evidence from making recommendations.Correctly balancing the benefits against the undesirable effects.Special challenges: resource use.Increasing transparency in the process of making recommendations.Slide 43Question to the audienceShould patients with chronic hepatitis C be treated with interferon/ribavirin combination? There is high quality evidence for benefits and high quality evidence for harms.Rate the overall strength or recommendations:"We recommend treatment of chronic hepatitis C.""We suggest treatment...""We suggest against treating patients...""We recommend against treating patients..."Slide 44Patient values & preferencesIn the absence of evidence, guideline panels have to function as surrogates to estimate values and preferences (V&P).Consumer involvement can help.Attaching V&P statements to guideline recommendations increases transparency.Slide 45Taking it to the next levelSystematically searching the literature for studies of values and preferences.Systematic reviews of V&P.Querying the guideline panel to rate health utilities of outcomes using case scenarios.Slide 46Question to the audiencePlease select the most appropriate answer. The reason you attended this session:Just interested in the topic.Have been involved in narrative evidence reviews, but have not used any formal grading system.Have used a grading system but not GRADE.Using or considered using GRADE.Slide 47Question to the audiencePlease select the most appropriate answer. Selecting a system to rate the quality of evidence and strength of recommendations, such as GRADE:Appears too expensive to implement.Appears valuable, but still requires substantial upfront expense.Appears to have some upfront cost but long-term savings.I use GRADE—it has been paying off for me.Slide 48Basic dimensionsGuideline work aligns along 3 basic dimensions:High quality vs. low qualityFast vs. slowExpensive vs. cheapSlide 49Ideal vs. practical ad hoc GRADE approachesStageElementsAdvantageCommentIdealSystematic reviewGRADE eTablesQual. of evidenceStrength of rec.Follows highest standardsMethodolog. most rigorousEasily maintainableFully transparent processAccess to methodologistAccess to evidence centersInitially more resource intensive, long-term savingsIntermediaryAd hoc reviewGRADE eTablesQual. of evidenceStrength of rec.Still retaining major advantages of the “ideal approach”Risk of bias higherAccess methodologist rec.Only minimal addl. costInitiationAd hoc reviewGRADE eTablesQual. of evidenceStrength of rec.Option to fully “upgrade” to an “ideal approach”Foundation of a methodologically sound systemRisk of bias higherAccess methodologist prnNo additional costSlide 50Sources of fundingFunders may have an agenda.Industry—tricky.FoundationsPublic—AHRQ, criteria EHC program fit (3: available, relevance for public payer, priority condition).Importance (7: e.g., public interest etc.).No duplication.Feasibility.Impact (6: e.g., addresses inequity).Slide 51Taking it to the next levelLong term planning.Create a high quality guideline product.Attract high quality guideline panel. Unconflicted methodologist (editor).Content expert (deputy editor).Content expert authors.Health economists.Slide 52Taking it to the next levelGRADE evidence profiles: Condensed and standardized summary of evidence.Are increasingly already created as part of a systematic review (e.g., Cochrane reviews).Flexible presentation (e.g., as summary of findings tables).Initial investment.Long-term value.GRADEpro software (tie-in with RevMan).Avoids duplication of efforts across the globe.Slide 53VisionGlobalize the evidence, localize recommendations.Focus on questions that are important to patients and clinicians.Undertake collaborative evidence reviews.Use a common metric to assess the quality of evidence and strength of recommendations.Examined collaborative models for funding.Slide 54GRADE uptakeA page with 20+ government health agency and publishing company logos is shown.Slide 55ConclusionGaining acceptance as international standard because GRADE adds value:Criteria for evidence assessment across a range of questions and outcomes.Sensible, systematic, fostering transparency.Balance between simplicity and methodological rigor. Current as of December 2010 Internet Citation: Grading Evidence and Recommendations: Starting with GRADE Basics Vs. Utilizing the Full Framework. December 2010. Agency for Healthcare Research and Quality, Rockville, MD. http://www.ahrq.gov/news/events/conference/2010/falckytter/index.html