15
7/31/2018 1 The need for QI in Medical Oncology Treatment Guidance and Assessment [email protected] Robert Jeraj Professor of Medical Physics, Human Oncology, Radiology and Biomedical Engineering Director of Translational Imaging Research Program University of Wisconsin Carbone Cancer Center, Madison, WI University of Ljubljana, Slovenia Disclosures AIQ Solutions, Founder and President Types of imaging …... BEFORE DURING AFTER THERAPY DIAGNOSIS and STAGING TARGET DEFINITION TREATMENT ASSESSMENT EARLY LATE Qualitative imaging (Diagnostics) Quantitative imaging (Quantitative Imaging Biomarkers)

The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

1

The need for QI in Medical Oncology Treatment Guidance and Assessment

[email protected]

Robert Jeraj Professor of Medical Physics, Human Oncology,

Radiology and Biomedical Engineering

Director of Translational Imaging Research Program

University of Wisconsin Carbone Cancer Center, Madison, WI

University of Ljubljana, Slovenia

Disclosures

AIQ Solutions, Founder and President

Types of imaging

…...

BEFORE DURING AFTER THERAPY

DIAGNOSIS

and STAGING

TARGET

DEFINITION

TREATMENT ASSESSMENT

EARLY LATE

Qualitative imaging (Diagnostics)

Quantitative imaging (Quantitative Imaging Biomarkers)

Page 2: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

2

QUANTITATIVE

IMAGING BIOMARKERS (QIB)

Biomarkers and surrogate endpoints

Biomarkers are characteristics that are objectively measured and evaluated as an indicator of normal biologic processes, pathogenic processes, or pharmacologic responses to a therapeutic intervention

Biomarkers as surrogate endpoints are biomarkers that are intended to substitute for clinical endpoints. Surrogate endpoints are expected to predict clinical benefit (or harm or lack of benefit or harm) based on epidemiologic, therapeutic, pathophysiologic, or other scientific evidence.

Types of biomarkers

…...

BEFORE DURING AFTER THERAPY

DIAGNOSIS

and STAGING

TARGET

DEFINITION

TREATMENT ASSESSMENT

EARLY LATE

Page 3: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

3

What does make imaging biomarkers

QUANTITATIVE?

Raunig et al 2015, Stat Methods Med Res 24(1): 27-67

Repeatability and reproducibility

Repeatability: The ability of the QIB to repeatedly

measure the same feature under identical or near-

identical conditions.

– These studies are often referred to as

test–retest, scan–rescan, or coffee-break

experiments

Reproducibility: The reliability of the QIB

measuring system in different conditions that might

be expected in a preclinical study or clinical trial or

in clinical practice (e.g. multiple sites)

Raunig et al 2015, Stat Methods Med Res 24(1): 27-67

TREATMENT

RESPONSE ASSESSMENT

Page 4: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

4

Response assessment today

WHO (1979, 1981)1,2

– anatomic

– bidimensional measurement of lesion

RECIST (2000, 2009)3,4 – Response Evaluation Criteria In Solid Tumors – anatomic

– CT/MR based

– unidimensional measurement of lesion

– 4 response categories (CR, PR, SD, PD)

• Complete Response: disappearance

• Partial Response: >30% decrease

• Stable Disease: in between

• Progressive Disease: >20% increase

1 WHO handbook 1979 2 Miller et al. 1981 3 Therasse et al. 2000 4 Eisenhauer et al. 2009

Anatomical vs functional response

Functional CR

Anatomical CR

PET-based response assessment

EORTC, NCI Recommendations (1999, 2005) 1,2

– SUV-based approach

– SUVmean and SUVmax

– Response categories with thresholds (CR, PR, SD, PD)

– Problems • SUVmean – collapse information, sensitivity issues

• SUVmax – noise contamination

• fails to use all available functional data • distribution

• heterogeneity

• no response threshold validation

• few sensitivity studies

• alternative measures

PET Response Criteria in Solid Tumors (PERCIST) (2009) 3

– SUVpeak 1 Young et al. 1999 2 Shankar et al. 2006 3 Wahl et al. 2009

Page 5: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

5

What about harmonization?

0

2

4

6

8

10

12

14

0 6 13

Maxim

um

SU

V

Months

Lung Lesion

Original

DVCT

DVCT

D710

D710

D710

+89%

+74%

-39%

-76%

What about harmonization?

0

2

4

6

8

10

12

14

0 6 13

Maxim

um

SU

V

Months

Lung Lesion

Harmonized

Original

DVCT

DVCT

D710

D710

D710

+89%

+74%

-39%

-76%

D710

D710 +19%

+3%

+13% -62%

METASTATIC

PROSTATE CANCER

Page 6: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

6

Metastatic disease

Metastasis: μετά (“next”) + στάσις (“placement”)

Bone metastases present in 90% of

advanced prostate cancer patients (Bubendorf et al, Hum Pathol. 2000)

– Relative 5-year survival rate is about 30% (SEER 2006-2012)

Despite its clinical importance, little is known about the

principal determinants of metastasis (Mehlen and Puisleux, Nature 2006;

Gupta and Massagué, Cell 2006)

Common destinations from solid

tumors: bones, brain,

lungs, liver

Image source: Wikimedia commons

(Gupta and Massagué, Cell 2006; Tofe et al, J Nucl Med 1975)

90%

Destinations of

Metastasis

Bone Other

70%

How can molecular imaging help?

Whole body imaging provides a complete snapshot of the progression of multiple metastases

The role of imaging in monitoring treatment response is not well established (Wallace et al, J Cancer 2014)

– Often based on responses of a few lesions (Wahl et al, J Nucl Med 2009; Doot et al, Transl Oncol 2010)

• Limited assessment in disease burden

18F-NaF PET/CT, imaging of bone metabolism, ideal for monitoring bone metastases – 100% sensitivity and 70-100% specificity

(Even-Sapir et al, J Nucl Med 2006)

– Quantitative and reproducible whole body scans (Kurdziel et al, J Nucl Med 2012)

Advanced prostate

cancer patient

PERFORMANCE

CHARACTERISTICS OF QIB

Page 7: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

7

NaF PET/CT Scan

Lesion

segmentation

Scan 1

NaF PET/CT Scan

Scan 2

Image feature

quantification

Registration

Lesion

matching

Difference

quantification

Lesion

segmentation

Image feature

quantification

Quantitative Total Bone Imaging (QTBI)

Yip et al, Phys Med Biol 2014; Jeraj and Liu, U.S. Patent 9,161,720

NaF PET/CT acquisition Uptake Localization

SUV

30

0

Lesion Segmentation

Quantitative Total Bone Imaging (QTBI)

NaF PET/CT #1

Quantitative Total Bone Imaging (QTBI)

NaF PET/CT #2

Articulated

Registration

Lesion-based

Response

Progressing

Stable

Responding

SUV

30

0

Page 8: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

8

Multicenter trial of metastatic castrate-resistant prostate cancer patients

– received pre-treatment test-retest 18F-NaF PET/CT scans

Site Patients Bone lesions

University of Wisconsin Carbone Cancer Center 18 265

Memorial Sloan Kettering Cancer Center 11 78

National Cancer Institute 6 68

All 35 411

Test/retest scans

(3-5 days apart) Standardized Uptake Value (SUV)

metrics extracted from an ROI

SUVmax – maximum uptake

SUVmean – average uptake

SUVtotal – total uptake

Repeatability of QTBI (NaF PET/CT)

Test Retest

15 50

What is our quantitative accuracy?

15 50

SUV Feature SUV

64.5 SUVmax 63.7

29.7 SUVmean 28.9

453 SUVtotal 478

SUV Feature SUV

48.2 SUVmax 28.8

22.8 SUVmean 19.4

286.4 SUVtotal 92.7

Low repeatability

High repeatability

What is our quantitative accuracy?

Test Retest

Page 9: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

9

Bland-Altman plots assess measurement agreement (Bland and Altman, J Bioph Stat 2007)

Imaging sites:

UWCCC

MSKCC

NCI UWCCC significantly narrower

variance (p < 0.001)

most repeatable metric *log-transformed

i: individual lesion

How much is repeatability?

Test-retest limits of agreement (LOA) can be used to define significant

changes in imaging features in individual lesions (Lin et al, J Nucl Med 2016)

Limits of agreement: SUVmax

Increasing

Expected

Decreasing

Feature Response

All lesions

Defining limits of agreement

Imaging site affects repeatibility

n pROI 1UW 18

2MSK 10 3NCI 6

All 34

Page 10: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

10

Lesion location affects repeatability

Location No. lesions (%)

Skull 9 (2.19)

Ribs 78 (18.98)

Cervical 17 (4.14)

Thoracic 62 (15.09)

Lumbar 54 (13.14)

Sacrum 31 (7.54)

Shoulder 34 (8.27)

Sternum 15 (3.65)

Arm 13 (3.16)

Pelvis 70 (17.03)

Leg 28 (6.81)

Total 411 (100%)

SUVmax SUVmean SUVtotal

Coefficient of variation (%)

Atlas-based segmentation identified lesion location (Yip , Perk, and Jeraj, 2014)

Repeatability poorest in ribs

CLINICAL IMPLICATIONS OF

QIB IN PROSTATE CANCER

Does this make an impact?

56 mCRPC patients receiving multiple NaF PET/CT scans

early in treatment

– Taxane-based therapy (N = 16)

– Androgen Receptor (AR) pathway inhibitors (N = 40)

QTBI used for Quantitative NaF PET/CT analysis

cycle 1 cycle 2 cycle 3

AR-directed

Docetaxel

Baseline NaF PET/CT Mid-Tx NaF PET/CT

Page 11: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

11

Global correlation to PFS

Baseline

Significant NaF PET/CT correlates

Mid-Tx

Δ (%)

Harmon et al 2017, J Clin Oncol, 7(1):9370

30

0

SUV

months

Mid-Treatment SUVtotal

Mid-Tx functional burden

High NaF burden Low NaF burden

2900 g/mL·cm3

57 lesions

3900 g/mL·cm3

30 lesions

Harmon et al 2017, J Clin Oncol, 7(1):9370

0.5 1.0 1.5 2.0 2.5 3.0

Standardized Hazard Ratio

Variable HR p-value (adj. p)

NL 1.41 0.007 (0.043)

Sacral NL 1.71 0.001 (0.015)

Pelvic NL 1.72 <0.001 (<0.001)

SUVmax 1.49 0.014 (0.053)

Lumbar SUVmax 1.63 0.004 (0.035)

Pelvic SUVmax 1.71 0.002 (0.020)

SUVmean 1.70 0.007 (0.043)

Lumbar SUVmean 1.84 <0.001 (<0.001)

SUVtotal 1.54 0.002 (0.020)

Lumbar SUVtotal 1.58 0.005 (0.038)

Thoracic SUVtotal 2.04 0.001 (0.015)

Regional correlation to PFS

Page 12: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

12

Lesion-level heterogeneity

Does heterogeneity in lesion response

impact our prediction?

43 patients with paired baseline and mid-Tx

3228 lesions tracked across scans

75.2 lesions/patient (range: 3 – 315)

Classify lesion response based on local

test-retest analysis (volume dependent)

Record the proportion of lesions contained

within each response classification group

example patient

Lin et al 2016, J Nuc Med, 57: 1872

Lesion-level heterogeneity

New

Progressing

Stable

Responding

Disappeared

Pro

po

rtio

n (

%)

Relative Lesion Burden

based on ∆iSUVtotal

Nlesions

example patient

Lin et al 2016, J Nuc Med, 57: 1872

40/43 patients exhibit response heterogeneity regardless of burden

Lesion-level heterogeneity

Pro

po

rtio

n o

f N

lesio

ns (

%)

Relative Lesion Burden based on ∆iSUVtotal

All Patients

Page 13: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

13

40/43 patients exhibit response heterogeneity regardless of burden

Non-favorable response dominates progression events!

Lesion-level heterogeneity

Proportion of iSUVtotal favorably

(iCR+ iPR) responding lesions

Ra

dio

gra

ph

ic P

rog

ressio

n-F

ree

Su

rviv

al

P = 0.01

Prop. of iSUVmean non-favorably

(iPD+ iND) responding lesions

>31%

<5%

NaF #1

NaF #2

NaF #3

NaF #4

Sample patient

73 year old, diagnosed in 2/2012 w/ PSA=764 and diffused bony mets (biopsy confirmed)

Treated with combined androgen blockade; developed CRPC by 12/2012

Palliative XRT (T6-L1) in 1/2013

Docetaxel

Abiraterone

Pain to right pelvis

Sample patient

B1

Baseline Scan

T1

12 week response

T2

54 week response

progression scan

Abiraterone

0 12wks 31wks 54wks 62wks

Docetaxel

T3

62 week response

Page 14: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

14

“Oligo-resistance”

PD

ND

SD

PR

CR

B1

Baseline Scan

T1

12 week response

T2

54 week response

progression scan

Abiraterone

0 12wks 31wks 54wks 62wks

Docetaxel

T3

62 week response

Evolution of resistance

Medivation/Pfizer: MDV3100-18 clinical trial

All imaging aspects (coordination, analysis) managed by AIQ

SUVtotal response

Highest

responding

lesion

Bridging the gap…

Highest

non-responding

lesion

PCF Global Challenge 2014

Jeraj, Liu, Tomlins, Perlman, Simoncic

-4 +4

Fold-change [log2]

Page 15: The need for QI in Medical Oncology Treatment Guidance and ...amos3.aapm.org/abstracts/pdf/137-41815-446581-136024.pdf · measure the same feature under identical or near-identical

7/31/2018

15

Summary

Quantitative Imaging Biomarkers (QIB) require:

– Characterization of instrument+analysis variability

• Repeatability and Reproducibility

– Analysis of performance characteristics

• Limits of Agreement Response Thresholds

QIB enable radically different approach to

treatment response evaluation

– Impact on clinical decision making

– Impact on drug development

Thanks to:

Image-guided Therapy Group (UW)

– Enrique Cuna

– Peter Ferjancic

– Daniel Huff

– Christie Lin

– Mauro Namias

– Tim Perk

– Alison Roth

– Matt Scarpelli

– Urban Simoncic

– Marusa Turk

– Damijan Valentinuzzi

– Amy Weisman

– Former students and postdocs…

Medical Physics Research Group (Slovenia)

Funding

– PCF, NIH, Medivation, MVI, Pfizer, UWCCC, State of Wisconsin

Medical Oncology/Hematology

– Glenn Liu

– Doug McNeel

– Ryan Mattison

– Mark Albertini

– Ruth O’Regan

– …

Radiology

– Scott Perlman

– Tyler Bradshaw

– Chris Jaskowiak

Human Oncology

– Paul Harari

– Mark Ritter

Medical Physics

– Ed Jackson

UWCCC TIR, CTD2, DOTs

UW WONIX, NIX