119
Relative Effectiveness of Osteoporosis Treatments to Reduce Hip Fractures in Patients with Prostate Cancer on Continuous Androgen Deprivation Therapy: Systematic Review, Network Meta-Analysis and Cost-Effectiveness Analysis by Yeesha Poon A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Pharmaceutical Sciences University of Toronto © Copyright by Yeesha Poon, 2018

Relative Effectiveness of Osteoporosis Treatments to Reduce ......in patients with non-metastatic prostate cancer on ADT. Outcomes included percentage change Outcomes included percentage

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

  • Relative Effectiveness of Osteoporosis Treatments to Reduce Hip Fractures in Patients with Prostate Cancer on Continuous Androgen

    Deprivation Therapy:

    Systematic Review, Network Meta-Analysis and Cost-Effectiveness Analysis

    by

    Yeesha Poon

    A thesis submitted in conformity with the requirements

    for the degree of Doctor of Philosophy

    Department of Pharmaceutical Sciences

    University of Toronto © Copyright by Yeesha Poon, 2018

  • ii

    Relative Effectiveness of Osteoporosis Treatments to Reduce Hip Fractures in Patients with

    Prostate Cancer on Continuous Androgen Deprivation Therapy:

    Systematic Review, Network Meta-Analysis and Cost-Effectiveness Analysis

    Yeesha Poon

    Doctor of Philosophy

    Department of Pharmaceutical Sciences

    University of Toronto

    2018

    Abstract

    Background: Androgen deprivation therapy (ADT) is widely used in men with advanced prostate

    cancer, and can lead to loss of bone mineral density (BMD) and fractures.

    Osteoporosis treatments are effective in improving BMD, and reducing risk of hip fractures.

    Given the potential benefits, risks, and widely varying costs of osteoporosis treatments in our

    population, we assessed the effects and cost-effectiveness of treatments.

    Methods: A systematic review and a network meta-analysis were conducted using randomized

    controlled trials (RCT) that evaluated bisphosphonates, denosumab, toremifene, and raloxifene

    in patients with non-metastatic prostate cancer on ADT. Outcomes included percentage change

    in BMD from placebo at different bone sites and incidence rates of any fractures.

    A cost-utility model was developed using a state transition model simulating the progression of

    prostate cancer, the incidence of hip fractures and an adverse event from osteoporosis treatments.

    The risk of fracture was conditional on BMD changes, which were modeled as the means of

    determining the effect of treatment on health and cost outcomes. The outcomes were predicted

    hip fracture incidence, quality-adjusted life years (QALYs), expected costs, and incremental

    cost-effectiveness ratios.

  • iii

    Results: Thirteen RCTs were included for analysis. The largest BMD improvement compared to

    placebo at 12-month at femoral neck site was risedronate 6.77% (95% CrI:-6.87-20.27%). Two

    studies reported fractures; toremifene and denosumab studies reported improved incidence of

    new vertebral fracture outcome vs placebo (2.5% vs 4.9%;p

  • iv

    Acknowledgments

    Completing this thesis took tremendous amount of time, effort and perseverance. I would not

    have been able to achieve such accomplishment without acknowledging the following people.

    Dr. Murray Krahn was instrumental in guiding me through these years. His support, trust,

    encouragement and patience during some turbulence time throughout this process were

    characteristics of a true mentor.

    I also would like to thank Dr. Shabbir Alibhai, who responded to my relentless questions on

    clinical practice and Dr. Petros Pechlivanoglou on network meta-analysis. Knowing their times

    were precious, I always felt their willingness to help. I also appreciated the support and advice

    received from Dr. David Naimark, Dr. Jeffrey Hoch and Dr. Manny Papadimitropoulos. All of

    them have pushed me to go that extra mile and get the most out of the learning process.

    Last, but not least, I thank my husband, Paul, and our children, Caitlin, Brandon and Lucas, who

    put up with my numerous nights and weekends on completing this thesis. Without their support

    and understanding, none of this would have been possible.

  • v

    Table of Contents

    Acknowledgments ..................................................................................................................... iv

    Table of Contents ........................................................................................................................v

    List of Tables ............................................................................................................................ ix

    List of Figures .............................................................................................................................x

    List of Appendices .................................................................................................................... xi

    1. Introduction .............................................................................................................................1

    1.1 Background – Prostate Cancer.......................................................................................1

    1.2 Treatments for Prostate Cancer .....................................................................................2

    1.3 Hip Fractures ................................................................................................................3

    1.3.1 Gender Differences in Risk of Hip Fractures ..........................................................3

    1.3.2 ADT as a Risk Factor for Hip Fractures .................................................................3

    1.3.3 BMD and Age as Risk Factors for Hip Fractures ....................................................4

    1.3.4 Screening for Fracture Risks ..................................................................................5

    1.4 Treatments for ADT-Induced Osteoporosis ...................................................................6

    1.5 Rationale for Analyses ..................................................................................................7

    2. Literature Review ....................................................................................................................9

    2.1 Cost-effectiveness Analysis of Screening for Osteoporosis in Men with Prostate

    Cancer .....................................................................................................................................9

    2.2 Current Gaps ............................................................................................................... 10

  • vi

    3. Methods ............................................................................................................................... 12

    3.1 Systematic Review - Background .............................................................................. 12

    3.2 Steps of the Systematic Review ................................................................................. 12

    3.2.1 Research Question .............................................................................................. 12

    3.2.2 Data Sources and Searches .................................................................................. 12

    3.2.3 Study Selection ................................................................................................... 13

    3.2.4 Quality of Study and Risk of Bias Appraisal ....................................................... 16

    3.3 Network Meta-Analysis .............................................................................................. 17

    3.3.1 Network Meta-Analysis - Background ................................................................. 17

    3.3.2 Objective of the NMA .......................................................................................... 18

    3.3.3 Analysis ............................................................................................................... 18

    3.3.4 Evidence Presentation .......................................................................................... 20

    3.4 Cost-Effectiveness Analysis ........................................................................................ 20

    3.4.1 Cost-Utility Analysis – Background ..................................................................... 20

    3.4.2 Economic Assumptions ........................................................................................ 21

    3.4.3 Population ............................................................................................................ 21

    3.4.4 Treatment Strategies ............................................................................................ 22

    3.4.5 Model Structure ................................................................................................... 22

    3.4.6 Model Parameters ................................................................................................ 23

    3.4.7 Adverse Effects .................................................................................................... 25

  • vii

    3.4.8 Mortality .............................................................................................................. 25

    3.4.9 Quality of Life ..................................................................................................... 25

    3.4.10 Costs .................................................................................................................... 26

    3.5 Analysis and Outcomes ............................................................................................... 26

    3.5.1 Expected Value of Perfect Information................................................................ 27

    4. Results .................................................................................................................................. 30

    4.1 Network Meta-Analysis ............................................................................................. 30

    4.1.1 Data Synthesis and Analysis ................................................................................ 30

    4.1.2 BMD Percentage Change Compared to Placebo .................................................. 31

    4.1.3 BMD Percentage Change Between Active Treatments ........................................ 32

    4.1.4 Fracture Risk ....................................................................................................... 32

    4.2 Cost-Effectiveness Analysis ........................................................................................ 33

    4.2.1 Model Validation ................................................................................................. 33

    4.2.2 Base Case Analysis .............................................................................................. 34

    4.2.3 Uncertainties ........................................................................................................ 36

    4.3 Sensitivity Analyses ................................................................................................ 37

    4.3.1 Deterministic Sensitivity Analyses ..................................................................... 37

    4.4 Expected Value of Perfect Information ...................................................................... 38

    5. Discussion and Conclusions .................................................................................................. 40

    5.1 Network Meta-Analysis .......................................................................................... 40

  • viii

    5.1.1 Strengths and Limitations..................................................................................... 42

    5.2 Cost-Effectiveness Analysis ..................................................................................... 43

    5.2.1 Strengths and Limitations..................................................................................... 45

    5.2.2 Implications for Practice ...................................................................................... 46

    5.2.3 Implications for Research ..................................................................................... 46

    6. Tables.................................................................................................................................... 48

    7. Figures .................................................................................................................................. 57

    8. References............................................................................................................................. 69

    9. Appendices ........................................................................................................................... 81

    9.1 Appendix 1: Search Strategies ..................................................................................... 81

    9.2 Appendix 2: Characteristics of Included Studies .......................................................... 83

    9.3 Appendix 3: BMD Percentage Change from Baseline ................................................. 85

    9.4 Appendix 4: Deterministic Sensitivity Analyses ......................................................... 87

    9.5 Appendix 5: Maximum Net Benefit and Average Net Benefit per Strategy ................. 88

  • ix

    List of Tables

    Table 1: Cost-effectiveness Analysis - Model Input Parameters

    Table 2: Utility Input Parameters

    Table 3: Cost Input Parameters

    Table 4: Result of Difference in Mean BMD Change between Active Treatments at Total Hip,

    Lumbar Spine, and Femoral Neck Sites

    Table 5: Probabilistic Analyses Base Case Results

    Table 6: Probabilistic Analyses Base Case – Net Monetary Benefits Results

    Table 7: Incidences of Hip Fracture

    Table 8: Deterministic Sensitivity Analyses Results

  • x

    List of Figures

    Figure 1: Cost-effective Analysis Model Schematic

    Figure 2: PRISMA Flow Diagram

    Figure 3: Network Diagram of Treatments – Total Hip, Lumbar Spine and Femoral Neck Sites

    Figure 4: Visual Inspection of Convergence for BMD Change Compared to Placebo at the

    Femoral Neck Site

    Figure 5: Results (BMD Percentage Change compared to Other Treatments) Total Hip

    Figure 6: Results (BMD Percentage Change compared to Other Treatments) Lumbar Spine

    Figure 7: Results (BMD Percentage Change compared to Other Treatments) Femoral Neck

    Figure 8: Cost-Effectiveness Frontier

    Figure 9: Cost-Effectiveness Acceptability Curve – PSA

    Figure 10: Expected Value of Perfect Information Graph

    Figure 11: Cumulative Distribution on Frequency of Incremental Net Health Benefit for

    Probabilistic Analysis Results

  • xi

    List of Appendices

    Appendix 1: Search Strategies

    Appendix 2: Characteristics of Included Studies

    Appendix 3: BMD Percentage Change from Baseline

    Appendix 4: Deterministic Sensitivity Analyses

    Appendix 5: Maximum Net Benefit and Average Net Benefit per Strategy

  • 1

    1. Introduction

    1.1 Background – Prostate Cancer

    Prostate cancer is the most prevalent cancer in men in most developed countries.[1] Despite its

    high prevalence, the prognosis for most diagnosed men is good, with 96% of treated patients

    surviving for ≥5 years.[1] Prostate cancer is mainly a disease of older men. It is diagnosed most

    frequently in men between 60 and 69 years of age.[1]

    The etiology of prostate cancer starts in the early stages of cancerous transformation; in which

    small clumps of cancer cells remain confined to the otherwise normal prostate gland. This is

    known as localized disease. Over time, the cancer cells multiply and spread to the prostate and

    beyond the prostate capsule to form a locally advanced tumour.[2] Localized tumours may

    become metastasized, become more aggressive, and begin to invade nearby structures and spread

    through the pelvic lymph nodes and, via the bloodstream, to distant parts of the body such as

    lungs, bladder, liver, and adrenal glands. Metastatic prostate cancer most commonly affects the

    bones.[2]

    While localized prostate cancer is often asymptomatic, many patients with locally advanced

    disease will suffer from urinary outflow obstruction, hematuria, urinary tract infections and

    irritation during bladder voiding.[3] In metastatic disease, patients may suffer from bone pain,

    weakness of the lower extremities or paralysis if the spinal cord is compressed by bony

    metastases.[4] While these symptoms may be rare, they can have a significant impact on quality

    of life.

  • 2

    However, if prostate cancer is detected early, treatment can be effective. The mortality rate for

    prostate cancer has decreased by 3.3% per year since 2001. This is likely due to a combination of

    earlier detection using prostate-specific antigen (PSA) tests to screen for prostate cancer and

    improved treatment options.[1]

    1.2 Treatments for Prostate Cancer

    Androgen deprivation therapy (ADT) removes (or suppresses) testosterone from the body to

    control prostate cancer. The suppression of testosterone by medication is known as medical

    castration.[5] ADT is prescribed as primary, adjuvant, or neoadjuvant treatment [6] in men with

    advanced prostate cancer or with biochemical recurrence.[7] It may prolong survival by

    decreasing tumor size and activity by suppressing androgens.

    ADT including long-acting synthetic luteinizing hormone releasing hormone (LHRH) agonists

    were introduced in the 1980s and have improved the treatment of prostate cancer by allowing

    effective reduction of testosterone levels without the psychological consequences of orchiectomy

    or surgical castration.[8]

    Due to the negative psychological effect of surgical castration,[9] LHRH agonists have become

    the “gold standard” in many countries. In one study, 78% of patients chose medical castration

    over orchiectomy.[10] LHRH agonists are the most commonly prescribed treatment for men with

    advanced prostate cancer.[7]

  • 3

    1.3 Hip Fractures

    1.3.1 Gender Differences in Risk of Hip Fractures

    The prevalence of osteoporosis and the risk of fracture are both higher in women than in men.

    This is partially due to differences in bone mineral density (BMD), bone size, and bone strength

    between men and women. Even though women fracture more often, men tend to have worse

    outcomes after fractures.[11]

    Gender differences in the epidemiology of hip fracture were extensively reported. The following

    results were found in meta-analyses:[12, 13]

    • men had higher excess annual mortality after hip fracture than in women;

    • men were at an average of 4 years younger than women at the time of fracture;

    • men are sicker than age-matched controls and women with hip fractures;

    • men had substantially higher mortality after hip fracture (i.e., the 1-year mortality for

    men ranged from 9.4% to 37.1%, compared to 8.2% to 12.4% in women. At 2 years, mortality in

    men was reported as high as 42%, compared to 23% for women.)

    1.3.2 ADT as a Risk Factor for Hip Fractures

    Androgen suppression can mediate increased rates of bone resorption and impair new bone

    formation.[14] This can lead to the loss of BMD and subsequent fractures.[15]

    A number of studies have shown that ADT is associated with significant bone loss and increased

    osteoporotic fracture risk [16-20] and such risk increases over time with continued use. The

  • 4

    prevalence of osteoporosis was found to be higher at 49% after 4 years of ADT use and 81%

    after 10 or more years compared to 35% for hormone-naive men with prostate cancer after 4

    years.[21]

    Two Canadian population-based studies in men with prostate cancer found that ADT use is

    associated with increased fragility fracture risk compared to men without ADT (HR 1.65).[16,

    22] Another study also found within 5 years of ADT initiation, 19.4% of subjects had a clinical

    fracture compared to 12.6% of controls (p < 0.001). Moreover, statistically higher rates of

    fracture were noted at every site examined, including spine, hip, radius, and skull.[18]

    A systematic review that assessed long-term side effects of ADT in patients with non-metastatic

    prostate cancer showed that its use was associated with substantial decline in BMD in the first

    year, and slower decline in subsequent years, with increased fracture rates within 5 years.[23, 24]

    1.3.3 BMD and Age as Risk Factors for Hip Fractures

    Low BMD has been recognized as a major risk factor for fractures. Although BMD

    measurements can be taken at different sites (i.e., total hip, lumbar spine and femoral neck),

    BMD measurement at the femoral neck site is a well-accepted predictor of hip fractures.[25-27]

    In fact, BMD measurements, as a measure of risk for hip fracture, have been considered to be

    comparable with the measurements of blood pressure to determine risk for cardiovascular

    disease.[26] The National Osteoporosis Foundation guidelines recommend initiation of drug

    therapy based on the T-Score.[28], which is calculated from the BMD value. However,

    prediction of hip fracture can be enhanced by employing other independent risk factors, in

    particular, age, which is one of the most important risk factors.[26] As patients age, the risk of

    hip fractures increases. This association has been shown to be reproducible in several

  • 5

    populations. Actual hip fracture rates were compared between Canada, US and Germany, all of

    which showed a consistent steep increase in incidence of hip fracture with advancing age in both

    men and women.[29]

    1.3.4 Screening for Fracture Risks

    Clinical practice guidelines on BMD testing in patients with prostate cancer on ADT are

    lacking.[30] In Canada, the British Columbia Cancer Agency (BCCA) recommends baseline

    BMD if ADT is used for >6 months [31] and Alberta Health Services [32] recommends using the

    WHO Fracture Risk Assessment Tool (FRAX®) with BMD. The US Endocrine Society [33],

    Ontario Prostate Cancer Guidelines [34] and the National Comprehensive Cancer Network [35]

    suggest measuring BMD in men 50-69 with risk factors (e.g., ADT). Such testing identifies

    persons with osteoporosis who could benefit from specific therapy.

    The most established way of measuring BMD is the use of dual energy x-ray absorptiometry

    (DXA) to diagnose for osteoporosis and for fracture risk assessment in men.[36]

    The BMD value from white women is a standard reference used to calculate T-score, which is

    the number of standard deviation (SD) above or below the mean young adult peak bone density

    as the reference group.[37]

    1. Normal is a T-score +2.5 to –1.0

    2. Low bone mass is a T-score of –1.1 to –2.4 inclusively

    3. Osteoporosis is a T-score equal to or less than -2.5.

    4. Severe osteoporosis is a T- score equal to or less than -2.5 and a fragility fracture.

  • 6

    The 2010 Canadian Practice Guidelines recommend that all individuals with a T-score of the

    spine or hip ≤-2.5 should be considered as having at least moderate 10-year risk of osteoporotic

    fractures.[38]

    The BCCA’s Genito-urinary Tumour Group recommends if BMD, history or plain films identify

    'osteoporosis' (defined here as a T-score

  • 7

    shown to modestly increase BMD at the hip and lumbar spine sites and was shown to reduce the

    incidence of new vertebral fractures, but no statistical difference was found for hip fractures.[17,

    43] Denosumab was also found to reduce pathological fractures in men with metastatic prostate

    cancer.[44]

    Toremifene, a selective estrogen receptor modulator (SERM), has been studied to reduce fracture

    risks in men receiving ADT. It was found that the 2-year incidence of new vertebral fractures

    had a significant relative risk reduction of 50%. Also, the incidence in all fractures was 10.1%

    (47 patients) with placebo and 6.3% with toremifene, which is a significant relative risk

    reduction of 38% (95% CI 2.2 to 60.2, p=0.036).[45] Toremifene is not currently available in

    Canada.

    All the treatments are effective in preventing bone loss and increasing bone mass and can help

    reduce the risks of fragility fractures such as hip fractures.[43, 46, 47] However, osteoporosis

    medications have known adverse effects which include osteonecrosis of the jaw (ONJ) with

    bisphosphonate [48] or denosumab use,[49] and venous thromboembolism with long-term use of

    SERMs, such as raloxifene.[50] Therefore, the use of osteoporosis treatments require balancing

    benefit and risk. Osteoporosis treatments also vary widely in cost, with newer medications being

    substantially more costly than older, off-patent medications.

    1.5 Rationale for Analyses

    The consequences of fracture are grave. Patients with hip fracture have increased morbidity,

    mortality and cost. Hip fractures cause the most morbidity [51] and are associated with a higher

    risk of death (HR 4.13) in the first year after a hip fracture compared to those without

    fractures.[52] Loss of function and independence may be overwhelming, with 40% of affected

  • 8

    persons unable to walk independently, and 60% requiring assistance a year later.[51] The

    economic burden of fractures in Canadian men was estimated at $570 million in the year 2007 to

    2008.[53] Hospitalizations following hip fractures were most costly at approximately $20,163

    per hospitalization. Over 70% of all fractures occurred in patients older than 70 years with the

    highest number of hospitalizations observed in the 81 to 90-year old age group.[53]

    There are published reviews on effectiveness of bisphosphonates [54, 55] in improving BMD in

    men on ADT. There is, however, no review that includes newer agents (i.e., denosumab) even

    though it has been studied in this setting. Furthermore, there is no head-to-head randomized

    controlled trial (RCT) comparing two or more active treatments.

    The economic attractiveness of reducing the risk of having a hip fracture using different

    treatments with varying effectiveness and risks in men with locally advanced prostate cancer

    treated with ADT is unknown.

    Therefore, from a clinical and policy perspective, there is a potential benefit in treating prostate

    cancer patients initiating on ADT who are at risks for osteoporosis; hence reducing risks of

    fractures. Although screening of osteoporosis may be recommended in this population, it would

    be of interest to treating physicians and policy makers to determine what treatment would be

    most effective and cost-effectiveness in reducing hip fractures in prostate cancer patients on

    continuous ADT.

  • 9

    2. Literature Review

    Although studies were found on cost-effectiveness analyses of osteoporosis treatment in

    reducing risks of fractures, they were mainly studied in women.[56, 57] At the time of the

    systematic review, in men with prostate cancer, there was no systematic review that specifically

    evaluated the effectiveness of all available osteoporosis treatments in improving BMD or

    fracture risks. Recently, the Cancer Care Ontario working group published a systematic review

    and a meta-analysis and found that bisphosphonates were found to be effective in increasing

    BMD, but no benefit has been shown in preventing fractures among patients with non-metastatic

    prostate cancer. Denosumab was shown to improve BMD and reduce the incidence of new

    vertebral fractures in men with non-metastatic prostate cancer.[58]

    Lastly, there is no cost-effectiveness analysis that was published between no treatment,

    bisphosphonates, denosumab and raloxifene.

    2.1 Cost-effectiveness Analysis of Screening for Osteoporosis in Men with

    Prostate Cancer

    There was one cost-effectiveness analysis that examined the prostate cancer population, but it

    was related to screening patients. It evaluated the different screening strategies to prevent

    fractures in men who were receiving ADT in the US.[59] A Markov state-transition model

    simulated the progression of prostate cancer and the incidence of hip fractures was developed

    with a hypothetical cohort of men aged 70 years with locally advanced or high-risk localized

    prostate cancer starting a 2-year course of ADT after radiation therapy.

  • 10

    There were three arms to the model; 1) No BMD test or alendronate therapy, 2) a BMD test

    followed by selective alendronate therapy for patients with osteoporosis, or 3) universal

    alendronate therapy without a BMD test. It was found that in patients starting adjuvant ADT for

    locally advanced or high-risk localized prostate cancer, a BMD test followed by selective

    alendronate for those with osteoporosis is a cost-effective use of resources.

    The incremental cost-effectiveness ratio (ICER) for the strategy of a BMD test and selective

    alendronate therapy for patients with osteoporosis and universal alendronate therapy without a

    BMD test were $66,800 per QALY gained and $178,700 per QALY gained, respectively.

    Therefore, the authors concluded that, among patients who begin adjuvant ADT for locally

    advanced or high-risk localized prostate cancer, BMD testing followed by selective alendronate

    for those with osteoporosis is cost-effective; in addition, for patients at higher risk for hip

    fractures (i.e., older patients and those with histories of fracture or low BMD before ADT),

    routine use of alendronate without BMD testing is justifiable.

    2.2 Current Gaps

    The costs of managing hip fractures is quite significant to the healthcare system and reducing

    risks in patients experiencing a hip fracture is important from a quantity and quality of life

    perspective. A cost-effectiveness analysis on the use of bisphosphonates, denosumab or

    raloxifene on the reduction of hip fracture in men treated with ADT with potential adverse event

    from the osteoporosis treatment would be important from the perspective of the Ministry of

    Health or treating physicians.

    Furthermore, there is no relative effectiveness review on the different osteoporosis treatments in

    this population on BMD improvement or reduction in fracture risks. The objective of the project

  • 11

    was to first, identify and synthesize evidence on the effectiveness of bisphosphonates,

    denosumab and raloxifene in reducing the risks of hip fractures and/or having BMD

    improvement in patients who were treated with ADT using LHRH agonists for at least 6 months.

    A network meta-analysis (NMA) was then conducted, as there is no randomized controlled trial

    that evaluated all active treatments in our population. Lastly, a cost-effectiveness analysis was

    performed in the same population to inform the Ministry of Health on potential decisions on

    reimbursement of these treatments.

  • 12

    3. Methods

    3.1 Systematic Review - Background

    A systematic review was completed with the goal of reducing bias by identifying, appraising,

    and synthesizing all relevant studies.[60] It provided a detailed and comprehensive plan and

    search strategy a priori to identify and synthesize evidence on the efficacy of the use of

    bisphosphonates, denosumab and raloxifene in reducing risks of fragility (i.e., hip) fractures in

    patients who were treated with ADT using LHRH agonists continuously for at least 6 months of

    use. The data gathered was used to conduct a network meta-analysis (NMA).

    3.2 Steps of the Systematic Review

    3.2.1 Research Question

    The research question that guided the study was: What is the most effective fragility fracture

    prevention strategy (i.e., IV zoledronic acid, denosumab, oral bisphosphonates or toremifene) in

    patients 65 years of age or older with locally advanced prostate cancer (stage T3/4 M0) who

    were on adjuvant ADT for at least six months and are progressing through prostate cancer states

    until death?

    3.2.2 Data Sources and Searches

    We developed comprehensive a priori search strategies in conjunction with an information

    specialist. We searched MEDLINE (OVID, 1946-September 2015) and EMBASE (1970-

    September 2015) for all RCTs assessing interventions of interest in patients with prostate cancer

  • 13

    (please refer to Appendix 1 for search strategies). No language restriction was applied. Two

    reviewers (YP & MEH) independently screened citations from literature search and extracted

    data. Conflicts between reviewers were resolved through consensus.

    The Preferred Reporting Items for Systematic reviews and Meta-analyses (PRISMA) guided the

    analysis [61].

    3.2.3 Study Selection

    3.2.3.1 Screening criteria

    A study was considered to be not relevant if it met one of the following criteria:

    • letter, editorial, review, or lay press article

    • Non-human studies

    • endpoints which did not include evaluation of fragility fractures, or BMD values

    Inclusion criteria comprised of RCTs in patients with prostate cancer 18 years of age who

    received ADT continuously for ≥6 months. Patients must be randomized to an active treatment

    or placebo and have measured BMD at baseline and at end of study.

    We excluded RCTs with endpoints which did not include evaluation of fragility fractures, or

    BMD evaluation. We also excluded RCTs of patients with metastatic disease or of non-human

    studies. Letters, editorials, reviews or other secondary research studies, cross over studies, and

    lay press articles were also excluded.

    All RCTs that met the eligibility criteria were included in this analysis.

  • 14

    Active treatments included bisphosphonates (of any strength or route of administration) such as

    alendronate, clodronate, etidronate, pamidronate, risedronate or zoledronic acid (ZA), SERMs

    such as raloxifene and toremifene, and denosumab.

    Screening of articles was based on the titles, abstracts, and keywords of each study. The

    screening criteria was applied as broadly as possible to ensure that only irrelevant studies were

    excluded. The full-text reports of all potentially relevant articles and of articles that were

    designated as “unclear” were retrieved for review.

    Screening of articles was based on the titles, abstracts, and keywords of each study. The

    screening criteria was applied as broadly as possible to ensure that only irrelevant studies were

    excluded. The full-text reports of all potentially relevant articles and of articles that were

    designated as “unclear” were retrieved for review.

    3.2.3.2 Data Extraction

    The data extraction form was tested on three included studies and modified as required. Data

    extraction began when agreement was noted between the reviewers. Then the reviewers

    independently extracted all of the data using the standardized data extraction form. Data were

    stored and managed in Microsoft Excel tables. Discrepancies were resolved by discussion

    amongst the reviewers.

    From the included RCTs, data extractions were based on:

    Study characteristics: year of conduct, sample size, study duration, intervention and comparator,

    respective treatment dose and length of treatment

  • 15

    Patient characteristics: number of patients, mean age and standard deviation, BMD at baseline,

    osteoporosis status (T-score), history and type of baseline fracture (if available)

    Outcome results: binary outcomes (e.g., fracture or no fracture), or the number of fractures in

    each treatment arm was extracted, if available. For continuous data (e.g., lumbar spine, femoral

    neck and total hip BMD value), the percentage or absolute change from baseline in all

    intervention groups were extracted. If the BMD change from baseline was not provided, then the

    value at end of follow-up and the baseline value were used to calculate the change. Standard

    errors (SEs) were used to calculate confidence intervals; the 95% confidence interval is equal to

    1.96×SE on either side of the mean.

    Possible modifiers included the dosages of each intervention, allowed cotreatment (combination

    therapies), the length of follow up (1, 5, and 10 years+), duration of ADT treatment, age of

    patients, and previous and types of fractures. Therefore, the year of study, interventions, dose,

    concomitant therapy (vitamin D and calcium), sample size, duration of ADT use, baseline BMD,

    duration of study and patients’ age were abstracted.

    The primary outcome was percentage change in BMD compared to baseline or placebo. When

    standard error or standard deviation of BMD was not reported, attempts were made to contact the

    authors for missing data, and then established methods were used to impute values [62].

    Otherwise, data were estimated from graphs, when possible, using WebPlotDigitizer.[63] The

    percentage change in BMD was extracted at 12 months to provide consistency in data analysis.

    The secondary outcome was fracture rates.

  • 16

    3.2.4 Quality of Study and Risk of Bias Appraisal

    Bias is a systematic error that can be introduced into randomized controlled trials. Bias can occur

    at any phase of the study, including study design, data collection, or process of data analysis.

    Bias, therefore, can be reduced by rigorously following proper study design process and

    implementation. As some degree of bias is nearly always present in a published study, assessing

    bias can help determine how they can influence a study's conclusions.

    The validity of individual trials was evaluated using the Risk of Bias instrument, endorsed by the

    Cochrane Collaboration.[64, 65] This instrument was used to evaluate the following key

    domains: 1) randomization generation; 2) allocation concealment; 3) blinding of participants,

    personnel and outcome assessors; 4) incomplete outcome data (withdrawal); 5) selective

    outcome reporting; and 6) other sources of bias such as potential for industry bias. The risk of

    bias instrument was used to assign summary assessments of within study bias; low risk of bias

    (low risk of bias for all key domains), unclear risk of bias (unclear risk of bias for one or more

    key domains), or high risk of bias (high risk of bias for one or more key domains).

    The quality of RCTs was assessed using the Jadad scale.[66] Three criteria: randomization,

    blinding and accounting of all patients, were used for assessment. Randomization was evaluated

    based on how randomization was generated (e.g., computer, random number list, coin toss or

    well-shuffled envelopes); blinding was determined whether it was done appropriately (e.g.,

    masking of tablets), and if all patients were accounted for in the study (e.g., discontinued,

    dropped out). A high quality study would have a maximum score of five, with maximum of 2

    points each for randomization and blinding, and 1 point for accountability for all patients.

  • 17

    3.3 Network Meta-Analysis

    3.3.1 Network Meta-Analysis - Background

    To compare outcomes between two interventions for which there is no direct evidence, a

    network meta-analysis (NMA) was utilized. A NMA, also known as a multiple-treatment meta-

    analysis or mixed-treatment comparison analysis, compares multiple treatments simultaneously

    by combining direct and indirect evidence. It is an extension of meta-analysis and is a valuable

    statistical tool for decision makers who need to determine the relative effectiveness across a set

    of alternatives, rather than from just two treatments. The challenge of a meta-analysis is the lack

    of evidence between two active treatments. NMA overcomes this challenge by applying an

    evidence network that involves more than two RCTs and more than two interventions. It

    incorporates indirect comparisons when there is no direct comparison between the interventions

    of interest. By combining both direct and indirect evidence, the objective is to improve the

    precision of estimates.[67]

    The three underlying assumptions of NMA are similarity, consistency and homogeneity. In

    assessing similarity, it was important to determine whether differences among studies may affect

    the comparisons of treatments or make some comparisons inappropriate (i.e., patient populations,

    dosages etc.) Similarity assumption means that studies should only be combined if they are

    considered to be clinically and methodologically similar. For the consistency assumption, the

    results of indirect and direct comparisons should be in general agreement. Finally, even though

    trials may differ on study and patient characteristics, they can be homogeneous if these

    characteristics are not modifiers of the relative treatment effect of different treatments.

  • 18

    Any kind of variability among studies in a systematic review may be termed heterogeneity.

    There can be three different types of heterogeneity: 1) clinical heterogeneity (variability in the

    subjects, interventions and outcomes studied); 2) methodological heterogeneity (variability in

    study design, blinding, risk of bias); and 3) statistical heterogeneity (variability in the

    intervention effects, and is a consequence of clinical or methodological diversity, or both, among

    the studies).

    3.3.2 Objective of the NMA

    We conducted a Bayesian NMA [68] of RCTs to assess relative efficacy of bisphosphonates,

    denosumab, raloxifene or toremifene in improving BMD as a surrogate endpoint of reducing

    risk of hip fractures in patients treated with ADT for ≥6 months of continuous use. The main

    outcome was assumed to be the percentage change in BMD versus placebo.

    3.3.3 Analysis

    The outcomes were conducted using a Bayesian random effects (RE) model. A RE model was

    chosen because we assumed that each study within the analysis has its own true effect given that

    the study characteristics may differ across studies.[69] Relative treatment effects assumed to

    follow a normal distribution with mean 0 and variance 105. For between-trial standard deviation,

    a uniform prior distribution was used with ranges from 0 to 5. Posterior means and 95% credible

    intervals (CrI) for the relative effect of treatments on changes to BMD were estimated. A fixed

    effects (FE) binomial model was used for fracture risk using WinBUGS (version 1.4.3).[70]

    Models were estimated using Markov Chain Monte Carlo (MCMC) simulations, using three

    MCMC chains of 150,000 iterations. Each of the three chains assumed different initial values to

  • 19

    assess sensitivity of the model parameters on the initial values assumed. Non-informative priors

    were used throughout the NMA analysis. The included treatments were ranked by using the

    surface under the cumulative ranking curve (SUCRA) to determine the treatment with the

    highest probability of being most effective in improving BMD. SUCRA ranges between 0 and 1;

    where 1 represents 100% of the time treatment is always ranked first versus 0% that the

    treatment is always ranked last.

    3.3.3.1 Evaluation of Heterogeneity and Similarity

    In order to evaluate clinical and methodological heterogeneity, a tabular summary of studies and

    patients’ characteristics for each pairwise comparison was completed. Conceptually,

    heterogeneity can be assessed via the degree to which differences or similarities in these

    characteristics vary and affect the outcomes (i.e., possible effect modifiers) [71].

    3.3.3.2 Evaluation of Consistency

    Before embarking on further analysis, the consistency assumption had to be met. This was to

    evaluate whether the outcomes between direct and indirect comparisons were in concordance.

    The structure diagram that was developed from the analysis helped to determine the number of

    loops that existed.

    In order to test consistency for single loop, the Bucher method could be used.[67] Inconsistency

    was evaluated by comparing results of direct estimate with indirect estimate. For example, if

    direct evidence existed between denosumab and zoledronic acid, the direct estimate of

    denosumab vs zoledronic acid could be compared to the indirect estimate between denosumab vs

    placebo and zoledronic acid vs placebo.

  • 20

    3.3.3.3 Evaluation of Convergence

    Model convergence was assessed visually. When iteration graphs showed a random scatter

    around a stable mean value, we inferred that convergence was achieved.

    3.3.4 Evidence Presentation

    The findings of the NMA were summarized with a network diagram to show the connections

    between the different comparators; and the thickness of the lines represented the number of

    studies between the comparators. Thicker connecting lines indicated higher numbers of studies

    used for comparison. A table summarized the results of the NMA between the different

    comparators.

    3.4 Cost-Effectiveness Analysis

    3.4.1 Cost-Utility Analysis – Background

    Under a resource constrained healthcare system, economic evaluations help decision makers

    evaluate funding choices. A health economics analysis assesses the expected costs of possible

    treatments and resources consumed; i.e., the expected outcomes from each treatment.

    A cost-utility analysis (CUA) is recommended by economics guidelines [72] as the primary form

    of economic analysis as it estimates the costs and the outcomes of competing treatments

    measured as quality-adjusted life-years (QALYs). Unlike cost-effectiveness analysis (CEA),

    which documents costs per life saved, CUA captures costs per quality of life. Therefore, a CUA

    accounts for both life years gained and the utility or impact of treatments on the quality of those

    life years. Utility is a preference measurement, which suggests how much a population would

  • 21

    prefer to be in one health state (e.g., having locally advanced prostate cancer versus metastatic

    prostate cancer). A utility of zero is the preference assigned to death and one applies to perfect

    health. Therefore, metastatic prostate cancer health state would likely have a lower utility value

    than locally advanced prostate cancer health state.

    The final output of a CUA is an incremental cost-utility ratio (ICUR), which evaluates the

    incremental costs and incremental utility of a treatment compared to a control or standard of

    care. In our analysis, the standard of care is no treatment. There is usually an ICUR value above

    which payers will decide not to fund a treatment. This is called the willingness-to-pay (WTP)

    threshold. Payers would consider a treatment to be cost-effective when the ICUR is lower than

    their pre-defined WTP. Ideally, for payers, the utility of a treatment would be higher and it

    would be less costly than the standard of care.

    3.4.2 Economic Assumptions

    We conducted a cost-utility analysis from a third-party payer perspective. Drug costs and costs

    related to prostate cancer and hip fracture were expressed in 2017 Canadian dollars adjusted for

    inflation using the Bank of Canada’s Consumer Price Index (CPI). Health outcomes and costs

    were discounted at 1.5% per year as per current Canadian guidelines.[73]

    3.4.3 Population

    We simulated a cohort of men with a mean age of 65, who were diagnosed with locally advanced

    prostate cancer (stage T3/4 M0) and who were on continuous ADT. Simulated men entered the

    model with no prior fragility fractures. While men were progressing through the prostate cancer

    health states, as defined under the Model Structure section, they could also sustain a hip fracture.

  • 22

    3.4.4 Treatment Strategies

    Treatments for ADT-induced bone loss included bisphosphonates, a human monoclonal antibody

    (denosumab) and a SERM, e.g., raloxifene. All these treatments are effective in preventing bone

    loss and increasing bone mass through different mechanisms of action. They can reduce risk of

    hip fractures by improving BMD at the femoral neck site in men on ADT [43] and some have

    shown a reduction in fractures in Phase III trials.[45, 47] The specific treatments evaluated in

    this analysis were oral bisphosphonates such as alendronate and risedronate, denosumab,

    zoledronic acid, raloxifene, and no treatment.

    3.4.5 Model Structure

    We created a state transition microsimulation model using TreeAge Pro 2017 [Figure 1].[74]

    The model consisted of prostate cancer health states, hip fracture states, and adverse events due

    to osteoporosis treatments. Patients progressed sequentially through a series of health states

    related to prostate cancer:[75] locally advanced disease (T3/4, M0), biochemical failure (three

    consecutive PSA increases after the nadir has been reached), metastatic castrate-sensitive cancer

    (new metastatic disease that is responsive to hormonal therapy), metastatic castrate resistant

    cancer (asymptomatic/minimally symptomatic cancer that has not been treated with

    chemotherapy) and death [Figure 1]. Probabilistic analysis was used as a base case.

    Deterministic sensitivity analyses were also performed. Variables tested included costs of

    treatments, probability of adverse event, starting ages at 55 or 75, BMD percentage change,

    utility of having hip fracture, and using BMD percentage change values from total hip site

    instead of femoral neck site.

  • 23

    3.4.6 Model Parameters

    Epidemiologic data inputs are described in Table 1.

    3.4.6.1 Prostate Cancer Natural History

    Prostate cancer progression was based on the natural history of the disease from locally advanced

    prostate cancer, to biochemical failure cancer, to metastatic castrate sensitive cancer, to castrate

    resistant cancer health states, and ultimately to death.[76-79] The progression rates were

    estimated from the following sources: published studies, systematic review and other health

    economics studies involving patients who closely resembled our population of patients with

    prostate cancer on ADT, and who progressed from one health state to another.[76-79]

    3.4.6.2 Incidence of Hip Fracture

    Patients could sustain a hip fracture in any of the health states thereby accruing fracture-specific

    costs and decrements in quality-of-life.

    Patients were simulated with an average starting BMD value at the femoral neck site of 0.794

    g/cm2 based on an average starting age of 65 years.[80] Patients could experience a new hip

    fracture on a monthly rate based on the 10-year probability of having a hip fracture which were

    dependent on the T-score and age.[27]

    3.4.6.3 Effectiveness of Osteoporosis Treatments

    Both BMD at the femoral neck site and patient age have been shown to be strong predictors of

    hip fracture in prospective studies.[26, 81]

  • 24

    The treatment effects on reduction of BMD were derived from our network meta-analysis

    comparing the relative efficacy of osteoporosis treatments on reducing risks of fragility

    fractures.[46] BMD loss was modeled to be a function of age and the number of years on

    ADT,[82] while osteoporosis treatments mitigated BMD loss. BMD values were updated

    monthly based on age, years on ADT and the effect of treatment.[46, 82]

    3.4.6.4 T-Score calculation

    Monthly BMD value at femoral neck site for each treatment was calculated using a two-stage

    approach with posterior samples from the Bayesian simulations [67] of a NMA.[46]

    The percentage of BMD change for each treatment was generated from the output of the

    Convergence Diagnostics and Output Analysis (CODA) for each iteration from the NMA.[46]

    Each CODA represented the percentage of BMD change values for different treatments and they

    were incorporated into the model for each simulated patient. The updated monthly BMD value

    at the femoral neck site was calculated by accounting for the BMD loss from ADT and the

    percentage improvement in BMD from each treatment as derived from the CODA outputs.[67]

    Each patient’s T-score was estimated from the difference between the updated monthly BMD

    value and the BMD reference (BMDref) divided by the BMD reference standard deviation

    (BMDref_SD).[80] The ideal BMD reference was derived from a healthy 30-year old adult

    female at the femoral neck site from the National Health and Nutrition Education Survey III

    (NHANES III) reference database for white women.[80] The BMD value from white women is

    the reference standard for both men and women to calculate the T-score .[37] A T-score of 0

    means the BMD is equal to the norm for a healthy, young woman. The more negative the

    calculated T-score, the higher the risk of fracture.[83]

  • 25

    3.4.7 Adverse Effects

    Patients could suffer from ONJ while on bisphosphonates or denosumab [84, 85], and venous

    thromboembolism with raloxifene [86]. These adverse events were selected because they could

    have the most impact on patients’ quality of life and the costs to the healthcare system. The costs

    and disutility of an adverse event were assumed to apply for a maximum of one year. Once

    patients suffered an adverse event, the osteoporosis treatment was discontinued.

    3.4.8 Mortality

    Prostate cancer mortality rates were based on the rate of death associated with their respective

    prostate cancer states.[75] The mortality rate from hip fractures was based on the hazard ratio

    associated with sustaining a hip fracture [87] with the death rate of the normal Canadian male

    population serving as the reference.[88]

    Death from prostate cancer or hip fracture was possible at any time and in any health state. All

    men made monthly transitions between the health states until they died or reached 100 years of

    age. The inputs for the model are described below and are found in Table 1.

    3.4.9 Quality of Life

    Our preference was to use utilities estimated from direct methods such as standard gamble or

    time trade off, but due to inconsistencies in measuring utilities, we used utilities values that were

    available in the published literature that used standard gamble, EQ5D or values from the Cost-

    Effectiveness Analysis Registry [89] for each prostate cancer health state.[90-92] For incident

    and subsequent fracture utilities, we extracted information from systematic reviews on utility

    values associated with osteoporotic fractures.[93, 94]

  • 26

    The disutility of experiencing ONJ with bisphosphonates or denosumab and deep vein

    thrombosis from raloxifene were derived from studies using time trade off method [95] and

    standard gamble [96], respectively. ONJ was defined as stage 3, which meant exposed or

    necrotic bone with pain and infection in the jaw and one or more of: pathologic fracture, extra

    oral fistula, or osteolysis.[95] The utility inputs for the model are reported in Table 2.

    3.4.10 Costs

    Drug costs included acquisition costs (i.e., 2018 Canadian Public Drug Programs Formulary [97]

    price plus 8% wholesaler mark-up) and the dispensing fee ($6.11). Prostate cancer health state

    costs were taken from an Ontario population-based costing study that considered factors such as

    physician visits, potential homecare and diagnostic procedures.[6]

    The hip fracture costs in the first and subsequent years were derived from Canadian data.[98, 99]

    Included costs were hospitalization, homecare and physician services. We assumed that

    treatment costs for prostate cancer were independent of hip fracture status. The costs input for

    the model is provided in Table 3.

    3.5 Analysis and Outcomes

    Probabilistic analysis was used as a base case and included uncertainty in treatment effects,

    patient characteristics and costs. Five hundred samples of 25,000 hypothetical patients were

    performed, in which each input parameter value was drawn from a sampling distribution which

    resulted in empirical output distributions of incremental cost and quality-adjusted life

    expectancy.

  • 27

    Deterministic sensitivity analyses were also performed. Variables tested included costs of

    treatments, probability of adverse events, starting age at 55 or 75 years, BMD percentage change,

    and utility of having a hip fracture. We used BMD percentage change values from total hip site

    instead of femoral neck site.

    We measured both costs and health outcomes over a lifetime horizon, which provided

    cumulative sample estimates of number of hip fractures, costs, life years (LYs) and QALYs. The

    model’s main output was the incremental cost-effectiveness ratio or incremental cost-utility ratio.

    Incremental net monetary benefit [100] at various willingness-to-pay thresholds was also

    calculated to determine the extra net benefit of the different treatments compared to placebo. The

    highest incremental net benefit was considered the most cost-effective.

    The calculation was based on this formula:

    Net benefit (NB) = (QALYs)*Willingness-to-pay - Costs

    3.5.1 Expected Value of Perfect Information

    Expected Value of Perfect Information (EVPI) was calculated to measure the highest costs of

    making the wrong decision based on all parameter uncertainties. EVPI helps decision makers

    answer the question on the maximum costs and value associated with funding additional to help

    eliminate uncertainty in the decision.[101] The calculation for EPVI per person was dependent

    on different willingness-to-pay thresholds, and was calculated as shown in Equation 1:

  • 28

    Equation 1: EVPI = average of the maximum net benefit for all samples with perfect

    information (maximum net benefit) – maximum of the average net monetary benefit for each

    treatment strategy with imperfect or present information (expected net benefit).

    A simplified example is provided below:

    ITERATION Net Benefit (based on willingness-to-pay of $50,000/QALY gained)

    Maximum

    Net

    Benefit

    Placebo Zoledronic

    Acid Denosumab Risedronate Alendronate Raloxifene

    1 $197,790 $180,530 $190,069 $198,573 $198,583 $196,576 $198,583

    2 $227,796 $211,546 $220,314 $229,571 $229,694 $227,113 $229,694

    …500 $434,818 $416,842 $426,620 $434,835 $435,261 $433,254 $435,261

    Expected Net

    Benefit $304,389 $286,877 $296,512 $305,057 $305,182 $302,993 $305,215

    EVPI = $305,215 - $305,182 = $33 per patient

    Population EVPI was determined by using the per patient EVPI, the annual incident of hip

    fractures in our population (I), the lifetime horizon of the treatment (t), and the discount rate (r)

    (Equation 2).

    Equation 2: Population EVPI = EVPI It/(1+r)t t=1,2,3…

    It was calculated by multiplying the an incident rate of 110.4 per 100,000 males [1] by the annual

    prevalence of men with prostate cancer, which was estimated to be 21,300.[1] A discount rate of

    1.5% was used.

    The population EVPI was calculated based on three willingness-to-pay thresholds of $50,000,

    $100,000 and $125,000 per QALY gained. The results provided information to decision makers

    on the maximum value that should be placed on research to gain certainty in the funding

    decisions.

  • 29

    Cumulative distribution plots were created to illustrate the uncertainty in the decision. The

    calculations were based on the results of the probabilistic analyses of 500 samples of 25,000

    iterations. Each iteration provided an incremental net health benefit based on the difference in

    costs, QALY and willingness-to-pay thresholds for treatment options that were on the cost-

    effectiveness frontier. By displaying the incremental net health benefit for each iteration, it

    showed an overall percentage of times that a wrong decision could have been made. A wrong

    decision would be if the incremental net health benefit was negative.

  • 30

    4. Results

    4.1 Network Meta-Analysis

    Studies were stratified by BMD test locations: total hip (TH), lumbar spine (LS), femoral neck

    (FN) sites. A total of 13 RCTs [102-114] were used for analysis. Eleven studies [102-104, 106,

    107, 109-114] evaluated outcomes at the TH site (1618 patients in treatment arm; 1649 patients

    in placebo arm). Thirteen studies [102-114] evaluated outcomes at the LS site (1671 patients in

    treatment arm; 1699 in placebo arm), of which 11 studies [102-106, 109-114] evaluated

    outcomes at the FN site (1527 patients in treatment arm; 1540 patients in placebo arm). The

    studies evaluated six (at TH and FN sites) and seven (at LS site) treatments versus placebo.

    The PRISMA Flow Diagram is presented in Figure 2. The mutual comparator in the analysis was

    placebo to allow us to compare treatments across the trials in the network. The network diagram

    is shown in Figure 3.

    4.1.1 Data Synthesis and Analysis

    The mean age range of patients tested at TH, LS, and FN sites for intervention and placebo

    groups ranged from 65 to 76 years. Nine of eleven TH [102-104, 106, 107, 109-111, 113] and

    FN studies [102-106, 109-111, 113] and 11 of 13 LS [102-111, 113] studies had patients on ADT

    12 months. Two studies that contributed data to all three site analyses had an unknown ADT

    duration [112, 114].

    At baseline, patients in 5 studies had unknown osteoporosis status [103, 104, 109, 112, 114], 4

    studies included patients with normal bone status [102, 106, 110, 113] and 4 included both

  • 31

    normal and osteopenic patients [105, 107, 108, 110]. All patients were recommended to take

    calcium and vitamin D daily, except in one study [103] where intake was unknown. Dosages of

    osteoporosis treatments were consistent with their recommended dosages with the exception of

    ZA, where four studies used 4mg IV every 3 months [104, 107, 110, 112], one study used 4mg

    every 6 months [108] and one used 4mg at 12 months [110]. The characteristics of included

    studies are found in Appendix 2.

    Although the age, prostate cancer stage and co-treatment with calcium and vitamin D were

    similar between the studies, there was some heterogeneity between studies based on duration of

    ADT use and different dosing and frequency of zoledronic acid.

    Formal testing [115] was not applied to test the consistency assumption since there was no direct

    evidence between active treatments to test consistency with indirect evidence. A visual

    inspection of iteration plots showed convergence. Figures 4 present visual plots for the BMD

    change that compared between treatments and placebo at the femoral neck site.

    Based on the Cochrane risk of bias assessment tool, the majority of the trials were at medium

    risk of bias, which included those studies that were randomized and double blinded, but did not

    discuss or report missing data. Most included studies, except one [106], had a Jadad score of at

    least 3.

    4.1.2 BMD Percentage Change Compared to Placebo

    Overall, patients on active treatment had BMD improvements from baseline (Appendix 3).

    Patients on placebo had worsening of their BMD. One exception was an alendronate study,

    which reported that patients on placebo had a BMD improvement of 1.18% vs 0.23% (p=0.631)

  • 32

    for the alendronate arm. Another outlier was a toremifene study, in which BMD declined on

    treatment (-0.1% vs -1.44%; p

  • 33

    p=0.004 at 24 months). Other studies reported fractures as adverse events where data were not

    systematically gathered. The number of events was small, and statistical significance could not

    be determined. One alendronate study [106] reported 1 fragility fracture in each arm vs placebo

    and another [109] reported 3 patients had fractures on placebo and 1 patient while on alendronate

    (p = 0.44). Israeli [107] reported two traumatic fractures with ZA and 1 with placebo, and

    another reported [108] 1 bone fracture in each ZA and placebo arm.

    Hence, a NMA was performed with only two studies on vertebral fracture risks, denosumab (679

    patients) and toremifene (477 patients) [102, 103] as each treatment was compared to placebo at

    24 months. Denosumab was ranked higher than toremifene based on SUCRA of 89.4% of having

    lower risk of vertebral fractures.

    4.1 Cost-Effectiveness Analysis

    4.2.1 Model Validation

    The incidence rate of hip fracture in the placebo arm was 360/100,000 person-years or 0.0036

    per person year. Patients in the simulation generally experienced their first hip fracture between

    age 72 to 77. Therefore, the modeled rate lied within the upper and lower bounds of an age-

    specific hip fracture rate of 270 to 400 per 100,000 person-years during this age range in actual

    patients with prostate cancer [116] The estimated mean overall undiscounted survival within the

    placebo strategy of the model was 12.5 years and was within the observed 12 to 19 year life-

    expectancy of a 65 year-old Canadian man with prostate cancer.[117]

  • 34

    4.2.2 Base Case Analysis

    4.2.2.1 Hip Fracture

    When 25,000 iterations were simulated, analysis showed that patients in the placebo strategy

    sustained the most number of hip fractures, followed by those treated with denosumab and

    raloxifene. Patients in the zoledronic acid strategy sustained the least number of hip fractures.

    However, the differences in the incidence rates were very small, placebo at 0.0036 per person-

    year, denosumab at 0.0030 per person-year, and raloxifene at 0.0029 per person-year. Zoledronic

    acid had the lowest incidence rate at 0.0018 per person-year. Both alendronate and risedronate

    were very similar at around 0.0021 per person-year.[Tables 1 to 3] The relative risk of hip

    fracture for denosumab versus placebo was calculated to be approximately 0.80.

    4.2.2.2 Osteonecrosis of the Jaw and Quality Adjusted Life Years

    The risks of experiencing osteonecrosis of the jaw (ONJ) were highest with zoledronic acid,

    followed by denosumab, and lowest with oral bisphosphonates. Hence, patients experienced

    further decrements of quality of life and increased costs with the adverse event. The combined

    decrements in patients experiencing ONJ and higher number of hip fractures contributed to its

    lower quality adjusted life years (QALY) compared to oral bisphosphonates.

    Expected QALY of the different treatments ranged from 8.8820 (placebo) to 8.9237 (zoledronic

    acid). For all treatment strategies, the undiscounted life years were similar at approximately 12.5

    years.

  • 35

    4.2.2.3 Costs

    Although zoledronic acid was the most effective strategy, it also had the highest lifetime costs

    ($159,310) due to the highest drug cost per month. The factors which contributed to denosumab

    having the second highest lifetime costs were: drug cost and ONJ treatment costs. Placebo had

    the lowest lifetime costs ($139,712) due to lack of drug costs and lack of costs associated with

    adverse events.

    Zoledronic acid was effective in reducing the number of hip fractures, but this effect was not

    sufficiently large to offset the differences in drug costs. Therefore, even though patients without

    osteoporosis treatment experienced a higher number of hip fractures, the hip fracture savings was

    smaller than the higher drug (and adverse event) costs.

    4.2.2.4 Incremental Cost-Effectiveness Ratio

    All treatment strategies were more effective, but more costly than placebo. Although risedronate

    was less costly, it was also slightly less effective than alendronate. Hence, alendronate is the

    most cost-effectiveness treatment strategy with an incremental cost-effectiveness ratio (ICER) of

    $30,400 per QALY gained compared to placebo. Zoledronic acid was the most effective but also

    most costly. The ICER for zoledronic acid was $14.8 million compared to alendronate, which

    would not be a cost-effective option for a third-party payer. Other treatment arms, including

    risedronate, denosumab and raloxifene strategies were all dominated, which meant they were less

    effective and more costly than alendronate.[Table 5]

  • 36

    4.2.2.5 Net Monetary Benefit & Cost-Effectiveness Frontier

    Net monetary benefit was also calculated to summarize the value of each treatment in monetary

    terms based on the different willingness-to-pay (WTP) thresholds from $50,000 to $100,000

    [Table 6]. Alendronate had the highest net monetary benefit for the WTP thresholds from

    $50,000 to $100,000.

    The cost-effectiveness frontier showed that risedronate, raloxifene and denosumab were not on

    the cost-effectiveness frontier; therefore, they were not cost-effective. Alendronate and

    zoledronic acid were on the frontier, and therefore were cost-effective options. However,

    zoledronic acid had a much higher ICER compared to alendronate.[Figure 8]

    4.2.3 Uncertainties

    We represented parameter uncertainty in our model at various cost-effectiveness ratios using a

    cost-effectiveness acceptability curve (CEAC). It used a joint distribution of costs and effects to

    evaluate uncertainty by identifying where the ICER falls in relations to the cost-effectiveness

    plane and the cost-effectiveness threshold.[118] The probabilities represented the proportion of

    the iterations or scatter plot points that fall to the south and east region of the cost-effectiveness

    plane; meaning the proportion of times that the strategy has the highest net benefit at various

    levels of willingness-to-pay.[118]

    The CEAC [Figure 9] showed that alendronate had 72% probability of being the most cost-

    effective treatment in the model iterations at a $50,000 per QALY gained willingness-to-pay

    threshold.[117] Results for other willingness-to-pay thresholds are provided in Figure 2. If the

    willingness-to-pay threshold was at $30,000 per QALY gained, placebo has 70% probability to

  • 37

    be the most cost-effective strategy, while alendronate reduced to have 25% probability to be the

    most cost-effective treatment. Lastly, the $40,000 per QALY gained threshold was the crossover

    between placebo and alendronate strategies. At a willingness-to-pay threshold of $40,000, both

    alendronate and placebo has about 50% probability of having the highest net benefit.

    4.3 Sensitivity Analyses

    4.3.1 Deterministic Sensitivity Analyses

    One-way deterministic sensitivity analyses were performed varying costs of bisphosphonates and

    denosumab, costs of treating hip fracture in the first year, BMD change affected by denosumab

    or risedronate, rate of ONJ with denosumab, and utility of experiencing a hip fracture.[Appendix

    4] These parameters were selected based on the potential costs and quality of life impact that

    patients may experience during treatment. Given that the costs of generic medications can

    decrease to 90% [119], and injectable to 65%,[120] reduced costs were tested. There were two

    parameter values which made risedronate the most cost-effective strategy over the lifetime

    horizon compared to its basecase value: 1) reduction in costs per month by at least 22.5%, which

    equated to approximately less than $9.90 per month (included markup and dispensing fee); and

    2) improvement of risedronate BMD change by about 45%.

    If the BMD percentage change in denosumab was improved by 45% from baseline value, then its

    ICER would be $194,000/QALY gained compared to alendronate. A detailed list of parameters

    and summary of the results of the deterministic sensitivity analyses is provided in Table 8.

  • 38

    Starting ages of patients entering the model were also varied at 55 and 75 years. At age 55,

    alendronate was found to be the most cost-effective strategy. However, at age 75, risedronate

    was the most cost-effective strategy.

    BMD percentage change from total hip site was used for re-analysis instead of femoral neck site.

    A limitation was that in the network meta-analysis, risedronate was not evaluated because of its

    lack of evidence at this bone site. The results showed alendronate was still a cost-effective

    treatment strategy compared to placebo, followed by raloxifene with extended dominance.

    Denosumab and zoledronic acid were also cost-effective, but with ICERs of $6 million and $14

    million, respectively.

    4.4 Expected Value of Perfect Information

    In a budget constraint healthcare system, a decision that is made based on current information

    and the payer’s willingness-to-pay threshold, has a cost associated with making the wrong

    decision. Expected value of perfect information (EVPI) provided the costs associated with

    making a decision if all uncertainties were removed.

    For a willingness-to-pay threshold of $50,000, the costs of gaining perfect information was $33

    per patient. If the threshold was lowered to $25,000, then the costs would increase to $272 per

    patient.[Figure 10] The graph shows the various willingness-to-pay thresholds and the costs

    associated with obtaining perfect information in order to make a decision with certainty.

    Population expected value of perfect information were calculated to be between $710,000 and

    $925,000 for willingness-to-pay thresholds of $50,000 to $125,000.

  • 39

    The full calculations for maximum and average net benefit for 500 samples with 25,000

    iterations are presented in Appendix 5. It provided detailed information of maximum net

    monetary benefit for each strategy at a $50,000 willingness-to-pay threshold.

    The cumulative distribution plots illustrated the uncertainty around making a wrong decision

    based on various willingness-to-pay thresholds. We found that at the $50,000 willingness-to-pay

    threshold between placebo and alendronate, there was about 13% of the time that a wrong

    decision was made to fund i.e., had a negative incremental net health benefit. At $100,000 or

    $125,000 willingness-to-pay thresholds, there was no economic value in reducing uncertainty

    since none of the sample iterations had a negative incremental net health benefit.[Figure 11]

    Similarly, the results for the incremental net health benefit between alendronate and zoledronic

    acid found no economic value in reducing uncertainty because no sample iteration showed a

    negative incremental net health benefit.

  • 40

    5. Discussion and Conclusions

    5.1 Network Meta-Analysis

    We conducted a NMA evaluating all available preventive treatments for osteoporosis in men

    with non-metastatic prostate cancer. Our results showed that all treatments were effective in

    reducing the rate of bone loss when compared to placebo. Treated patients’ BMD change from

    baseline ranged from -1.2% to 6.0%. Some treatments appeared to be supported by stronger

    evidence. Denosumab and zoledronic acid showed improvement in BMD across all sites (~3% at

    TH and FN, ~6% at LS sites). The lower bound of the 95% CrI did not include zero at all sites

    for these two drugs. Similarly, when we used SUCRA to determine treatment rank probability,

    we found that ZA consistently ranked amongst the top two treatments at all sites. Denosumab

    was ranked amongst the top two at the LS and TH sites.

    In the two studies that evaluated the effect of preventive therapy on fracture risk, denosumab and

    toremifene were effective in reducing vertebral fracture risk. In a systematic review that

    evaluated bisphosphonates to prevent fragility fractures, it was found that there was no evidence

    of a difference in effect on fractures between treatments in this class of drugs [121].

    Raloxifene was ranked highest based on SUCRA of 0.8628 compared to other treatments at the

    total hip site. Although raloxifene was ranked highest, based on the relative comparison, there is

    no evidence of significantly better BMD improvement compared to other treatments. A recent

    study[122] evaluated the possible reasons for SUCRA’s uncertainty. The reasons for the

    uncertainty in ranking raloxifene to be most effective could be that the data between the

    comparisons were scarce from the limited number of patients in each arm (20 patients). Another

  • 41

    explanation could be that the population size is too small and did not have sufficient power to

    show statistical difference.

    Factors other than treatment efficacy, such as patient preferences, adverse events and costs are

    also important when choosing a treatment. However, we did not systematically evaluate these in

    this network meta-analysis.

    In terms of patient preferences, ZA was given intravenously every several months (3 to 12

    months) which required a visit to a hospital or clinic. This included travel, waiting, set up and

    monitoring time, and a 15-minute infusion. Denosumab was given subcutaneously every 6

    months. The advantage over ZA is that patients can inject themselves. However, some patients

    may prefer not to self-administer, and some may need additional education by a healthcare

    professional or through homecare. Other bisphosphonates can be taken orally.

    The risk of adverse effects can also influence treatment choice. The relative risk of ONJ with

    fewer ZA infusions per year (as in our population for prevention of bone loss) versus 4 mg every

    4-weeks (for reducing risk of skeletal-related event in metastatic cancer) is 0.002 [123, 124]. The

    incidence of ONJ with all frequencies of ZA is 0-90, with oral bisphosphonates is 1.04-69 [125]

    and with denosumab is 0-30.2 [125], per 100,000 patient-years.

    Out-of-pocket costs borne by patients for infusion therapy (i.e., travelling costs, parking fees,

    time lost from work) and other costs assumed by payer (i.e., nursing time, drug costs, ancillary

    equipment) should also be considered. With respect to drug costs, in the US [126], ZA 4mg/5mL

    costs $60/infusion ($60 to $240/year), denosumab 60mg costs $1075/injection ($2150/year) and

    oral risedronate 35mg costs $25/week ($1300/year) or alendronate 70mg costs $0.40/week

    ($21/year).

  • 42

    Our analyses suggested that IV ZA was a reasonable alternative, followed by oral

    bisphosphonates. Our results did not show that one drug was unequivocally more effective than

    another. All drugs appeared to be effective in reducing the rate of bone loss. In fact, they were

    almost universally shown to be associated with improved BMD, and credible intervals showed a

    significant degree of overlap between agents. Therefore, we believe that the most important

    policy implication of this work is that, because preventive therapy is effective, men who are at

    risk for fracture should receive some form of osteoporosis treatment. Choosing the optimal drug

    should be determined on the basis of efficacy, as reported here, but also on the basis of safety,

    patient preferences, patient and health system costs, and local availability.

    5.1.1 Strengths and Limitations

    Our study has both strengths and limitations. Network meta-analysis facilitates comparison of

    treatments that have not been evaluated in head-to-head trials. We used a well-documented

    surrogate endpoint (i.e., BMD change) to estimate fracture risk, which allowed more studies to

    be included in the NMA than would have been possible had we used fracture as our main

    outcome. The limitations are the lack of direct comparisons, and that some of the indirect

    comparisons were either based on single trials only or that they included few patients in the

    studies. As a result, the credible intervals between comparisons were often wide. Lastly, we were

    bound to draw inference on fracture risk based on BMD as a surrogate endpoint.

    A comprehensive economic evaluation provided additional evidence supporting decision making

    for men with prostate cancer receiving ADT.

  • 43

    5.2 Cost-Effectiveness Analysis

    We conducted a cost-utility analysis that evaluated the cost-effectiveness of alendronate,

    risedronate, raloxifene, zoledronic acid, denosumab and no treatment to reduce the number of hip

    fractures while experiencing an adverse event with osteoporosis treatments in patients 65 years

    or older with prostate cancer who were on continuous ADT. Simulated patients experienced the

    most number of incident hip fractures with placebo, and the least number with zoledronic acid,

    followed by alendronate. Alendronate was found to be the most cost-effective strategy compared

    to placebo. Zoledronic acid had a $14 million/QALY gained compared to alendronate, which is

    far above a conventional payer’s willingness to pay. Even though zoledronic acid may be

    marginally more effective, its costs seemed to outweigh its benefit. Both denosumab and

    raloxifene were less effective and more costly than oral bisphosphonates; hence, these strategies

    were dominated.

    The major drivers of the model were costs and BMD improvements of risedronate and

    denosumab. We tested several variables in one-way sensitivity analyses [Table 3] and

    alendronate was no longer the dominant strategy if the monthly cost of risedronate was either

    reduced or BMD change was improved within the plausible range. Similarly, when denosumab’s

    cost was reduced by 90% to about $7 per month from $67 per month (including markup and

    dispensing fee), its ICER was reduced to $57,600/QALY gained compared to alendronate. Since

    the effectiveness of the treatments was dependent on the BMD improvements from a NMA,[46]

    we tested the BMD change for denosumab as well. If BMD change was improved by at least

    45% relative to its baseline value, denosumab had an ICER of $194,000/QALY gained compared

    to alendronate.

  • 44

    We also calculated the relative risk of denosumab in the simulation compared to the published

    study.[47] The RCT found that fracture at any site happened in fewer patients in the denosumab

    group (38 [5.2%]) than in the placebo group (53 [7.2%]), with a relative risk of 0.72 (95% CI,

    0.48 to 1.07). The number of hip fractures in this simulation between placebo and denosumab led

    to a calculated relative risk of 0.80 (95% CI 0.74 to 0.87). The relative risk was similar between

    this simulation and the RCT. In the sensitivity analysis, we also varied the improvement of

    percentage BMD change with denosumab by up to ±90%, and the result remained the same with

    alendronate being the most cost-effective strategy.

    Canadian and US Guidelines recommend that men who are receiving ADT should be assessed

    for fracture risk, and that to prevent fractures, osteoporosis therapy should be considered.[34, 38,

    127] Other factors to guide pharmacologic therapy selection is patient preference for

    treatment.[38] Alendronate is the most cost-effective treatment and it is convenient to administer.

    It is only taken once weekly orally versus denosumab, which is administered subcutaneously

    every six months, or zoledronic acid, which was given intravenously every three months (as per

    most studies in men with prostate cancer).[128-130] Other concerns for patients are potentially

    the time and out of pocket costs (i.e., parking) to have an intravenous injection in a hospital, or

    rarely, the fear of injection. However, for patients with potential compliance issues, bedridden

    patients where injections are administered by a nurse or caregiver, or patients who cannot

    swallow tablets, injections given every 3 months or longer may be preferred. From a patient’s

    perspective, treatment convenience is a potentially important consideration.

    The older, off-patent drugs such as oral bisphosphonates were most cost-effective. There is very

    little efficacy difference between alendronate and risedronate.[46, 131] At the current prices, the

    newer drugs such as denosumab or zoledronic acid were not cost-effective.

  • 45

    5.2.1 Strengths and Limitations

    The strength of this analysis was that we evaluated all potential treatment options for reducing

    hip fracture incidences, and the potential serious adverse events that could impact quality of life

    and costs.

    One limitation of this analysis was excluding the impact of vertebral fractures. The calculations

    used throughout this analysis were based on BMD at the femoral neck site; however, the use of

    BMD at the lumbar spine site to calculate the reduction of vertebral fracture needs more

    substantiation. This made it difficult to accurately account for vertebral fractures, especially with

    up to one-half of vertebral fractures being radiographically detected but clinically asymptomatic.

    Another analysis found alendronate to be cost-effective for reducing incidence of hip fractures,

    and concluded that its analysis may have underestimated the benefits of alendronate given that