UNDIFFERENTIATED SARCOMAS: …...Figure 3.7 RT-PCR confirmation of the CIC-DUX4 fusion for t(4;19) pilot study 47 Figure 3.8 Identification of the CIC-DUX4 fusion by RT-PCR screening

UNDIFFERENTIATED SARCOMAS: UNDERSTANDING

EARLY SARCOMA DEVELOPMENT

A DETAILED GENOMIC AND TRANSCRIPTOMIC ANALYSIS OF

PAEDIATRIC UNDIFFERENTIATED SARCOMAS

BY

Cassandra Graham

A thesis submitted in conformity with the requirements for the degree of Master of Science

Department of Laboratory Medicine and Pathobiology University of Toronto

Date of Defense: November 3rd, 2010

© Copyright by Cassandra Graham, 2011

ii

ABSTRACT

A Detailed Genomic and Transcriptomic Analysis of Paediatric Undifferentiated Sarcomas

Cassandra Graham

Master of Science, 2010

Department of Laboratory Medicine and Pathobiology

University of Toronto

November 3rd, 2010

Paediatric undifferentiated soft tissue sarcomas (USTSs) are a diagnostically challenging

group of neoplasms. We hypothesized that USTSs contain distinct subgroups that can be

identified based on their morphology, genomic aberrations and expression profiles. We sought

to characterize genomic aberrations within primitive round cell (PRC) sarcomas which may

underlie aberrant expression patterns. Using molecular and cytogenetic analyses, we identified 5

of 18 CIC-DUX4-positive PRC sarcomas. The consistent involvement of the CIC-DUX4 fusion

in a subset of PRC sarcomas suggests a central role for the fusion transcript in such tumours.

These analyses also identified a cohort of CIC-DUX4-negative USTSs with no established

genetic markers. We performed integrative copy number and expression profiling, and identified

significant genomic and transcriptomic changes. We propose that these genes are involved in

biological pathways that are important to the initiation and progression of undifferentiated

sarcoma, and may provide novel insights into the biological events responsible for

sarcomagenesis.

iii

ACKNOWLEDGEMENTS

I would like to express my heartfelt appreciation to those people that have made the completion

of this thesis a reality.

This Master’s thesis is dedicated to my parents, whose never-ending patience and unconditional

support have allowed me the latitude to explore all of my personal and academic pursuits. I

thank my brother and sister for their endless teasing and loving support, which always allow me

to keep things in perspective. I would also like to thank my supportive extended family, and my

Toronto family at 470 Markham Street, who have been there for me every step of the way.

I would like to express my sincerest gratitude to my supervisors, Dr. Gino Somers and Dr. Maria

Zielenska. I am so grateful for the confidence with which you have always supported my

research. The encouragement and guidance that I have received from you were invaluable to the

successful completion of this thesis, and have helped me grow significantly both academically

and personally.

I would also like to acknowledge Dr. Jeremy Squire, who has provided me with so many

opportunities, and without whom I may not have chosen this path. I would like to thank Dr.

Maisa Yoshimoto, Dr. Georges Maire, Jane Bayani and Paula Marrano who have mentored me

for the past 6 years and have served as knowledgeable resources throughout the course of this

project. Without our lunch dates, I might have gone crazy. To Devina Ramsaroop, I couldn’t

have asked for a better labmate or partner in crime at our lab meetings and conferences.

I wish to acknowledge Dr. Cynthia Hawkins and Dr. Mary Shago, for providing me with

direction, advice and genuine enthusiasm over the course of this project. I would also like to

iv

thank the following friends at Sick Kids for all of their help with this thesis: Susan Chilton-

MacNeill, Dr. Bekim Sadikovic, Michael Ho, Pawel Buczkowicz and Dr. Paul Thorner.

Lastly I would like to thank Graeme Mask, who has been so encouraging and supportive over the

past 2 years – I couldn’t have done it without you.

My deepest appreciation to each and every one of you.

v

TABLE OF CONTENTS

ABSTRACT ii

ACKNOWLEDGEMENTS iii

TABLE OF CONTENTS v

LIST OF ABBREVIATIONS vii

LIST OF TABLES AND FIGURES ix

CHAPTER ONE: INTRODUCTION 1

1.1 Overview of Paediatric Soft Tissue Sarcomas 2

1.2 Classification of Paediatric Soft Tissue Sarcomas 2

1.3 Diagnostic Importance of Chromosomal Translocations in Sarcomas 3

1.4 Overview of Paediatric Undifferentiated Soft Tissue Sarcomas 5

1.4.1 Morphological Subtypes of USTS 6

1.4.2 Models of Sarcomagenesis: Cell of Origin & Acquisition of Malignant

Properties 7

1.4.3 Molecular Studies of USTS 15

1.5 Hypothesis, Objectives & Expected Outcomes 18

CHAPTER TWO: MATERIALS & METHODS 19

2.1 Tumour Specimens 20

2.2 DNA and RNA Extraction 20

2.3 Cytogenetic and Spectral Karyotyping (SKY) analyses 21

2.4 Array Comparative Genomic Hybridization (CGH) 22

2.5 Interphase Fluorescence in situ Hybridization (FISH) 23

2.5.1 Isolation of DNA from Bacterial Artificial Chromosomes (BACs) 23

2.5.2 Pilot Project Four-Colour Interphase FISH 24

2.5.3 FISH Validation of Copy Number Gain of EPHA3 25

2.6 Reverse Transcriptase Polymerase Chain Reaction (RT-PCR) for

Detection of CIC-DUX4 26

2.7 Copy Number Arrays 27

2.8 Expression Arrays 28

2.9 Analysis and Integration of Microarray Data 28

2.9.1 Partek Genomics Suite: Data Analysis and Integration 28

2.9.2 Ingenuity Pathway Analysis: Network Identification and

Pathway Analysis 30

2.10 Quantitative Polymerase Chain Reaction (qPCR) 30

2.10.1 Comparative CT Analysis of qPCR Data 31

2.11 Immunohistochemistry (IHC) 31

vi

2.11.1 IHC for CD99 32

2.11.2 IHC for EPHA3 32

CHAPTER THREE: RESULTS & DISCUSSION 33

3.1 Clinicopathological Features of Tumour Samples 34

3.2 t(4;19) Pilot Project 36

3.2.1 Genomic Imbalances in Undifferentiated Sarcomas 36

3.2.2 Karyotypes of Undifferentiated Sarcomas 39

3.2.3 Detailed Comparison of Cytogenetic Karyotyping and Array CGH 40

3.2.4 Four-Colour FISH Validation of 19q Rearrangement and Detection

of Translocation Binding Partners 42

3.2.5 Detection of CIC-DUX4 Fusion Transcripts by RT-PCR 46

3.2.6 Discussion of t(4;19) Pilot Project 48

3.3 Screening of USTS Cohort for the CIC-DUX4 Fusion Transcript 49

3.3.1 Detection of the CIC-DUX4 Fusion by RT-PCR 50

3.3.2 DNA Sequence Analysis of the CIC-DUX4 Positive PCR Products 52

3.3.3 Discussion of Overall Screening of USTS cohort of CIC-DUX4

Fusion Transcript 55

3.4 Discovery of Novel Markers for CIC-DUX4-Negative Tumours 58

3.4.1 DNA Copy Number Analysis 59

3.4.2 Gene Expression Analysis 60

3.4.3 Integration of Copy Number and Gene Expression Analyses 62

3.4.4 Gene Network Analysis 64

3.4.5 Identification of Candidate Genes 65

3.4.6 Gene Expression Validation of Candidate Genes 66

3.4.7 Copy Number Validation of EPHA3 68

3.4.8 Protein Validation of EPHA3 68

3.4.9 Discussion of Discovery of Novel Markers for

CIC-DUX4-Negative Tumours 71

CHAPTER FOUR: SUMMARY, GENERAL CONCLUSIONS & FUTURE DIRECTIONS 77

4.1 Summary & General Conclusions 78

4.2 Future Directions 79

REFERENCES 81

vii

LIST OF ABBREVIATIONS

ARMS Alveolar Rhabdomyosarcoma

BAC Bacterial Artificial Chromosome

CGH Comparative Genomic Hybridization

CSC Cancer Stem Cell

DAPI 4’,6’-diamidino-2-phenylindole

DEPC Diethylpyrocarbonate

DNA Deoxyribonucleic Acid

EDTA Ethylenediaminetraacetic acid

ERMS Embryonal Rhabdomyosarcoma

ES/PNET Ewing Sarcoma/Primitive Neuroectodermal Tumour

FFPE Formalin-Fixed Paraffin-Embedded

FISH Fluorescence in situ Hybridization

IHC Immunohistochemistry

IPA Ingenuity Pathway Analysis

MPNST Malignant Peripheral Nerve Sheath Tumour

MSC Mesenchymal Stem Cell

NSC Neural Stem Cell

PGS Partek Genomic Suite

PRC Primitive Round Cell

qPCR Quantitative Polymerase Chain Reaction

RMS Rhabdomyosarcoma

RNA Ribonucleic Acid

RT-PCR Reverse Transcriptase-Polymerase Chain Reaction

viii

SNP Single Nucleotide Polymorphism

SSC Saline Sodium Citrate

SDS Sodium Dodecyl Sulfate

SKY Spectral Karyotyping

SS Synovial Sarcoma

t() Translocation

TCAG The Centre for Applied Genomics

USTS Undifferentiated Soft Tissue Sarcoma

ix

LIST OF TABLE AND FIGURES

Chapter 1

Figure 1.1 Subclassification of paediatric soft tissue sarcomas 4

Table 1.1 Sarcomas characterized by recurrent chromosomal translocations 5

Figure 1.2 Euclidean clustering of sarcoma tumours based on eight stem cell markers 11

Figure 1.3 Immunohistochemical staining of CD56 12

Figure 1.4 Decoding the early events of sarcoma development 14

Chapter 2

Table 2.1 Primer sequences used for RT-PCR screening of USTS cohort 26

Chapter 3

Figure 3.1 Histological features and CD99 staining of USTSs used in

t(4;19) pilot study 34

Table 3.1 Clinicopathological features, CD99 staining patterns, and CIC-DUX4

fusion transcript status of the cohort of tumours used in study. 35

Figure 3.2 Composite analysis of genomic imbalances in USTSs used in

t(4;19) pilot study 38

Table 3.2 Summary of USTS karyotypes by G-banding and SKY analyses 39

Figure 3.3 SKY of USTSs with 19q13 rearrangement 40

Figure 3.4 Comparative analysis of array CGH and SKY results 42

Figure 3.5 Four-colour FISH validation of 19q rearrangement and detection

of translocation binding partners in USTS1 44

Figure 3.6 Four-colour FISH validation of 19q rearrangement and detection

of translocation binding partners in USTS2 46

Figure 3.7 RT-PCR confirmation of the CIC-DUX4 fusion for t(4;19) pilot study 47

Figure 3.8 Identification of the CIC-DUX4 fusion by RT-PCR screening 51

x

Figure 3.9 Nucleotide sequences of CIC and DUX4 sequences denoting

primer sequences and translocation breakpoints 53

Figure 3.10 Nucleotide and predicted amino acid sequences of the different

CIC-DUX4 fusions 54

Figure 3.11 Predicted protein structure of CIC-DUX4 fusion protein variants 55

Figure 3.12 Overall genomic profiling views from PGS of all USTS samples

interrogated for copy-number changes 60

Figure 3.13 Cluster analysis from PGS of all USTS samples interrogated for

changes in gene expression 61

Figure 3.14 Tumour-specific integration of copy number and gene expression

analyses 62

Figure 3.15 Integration of cumulative copy number and gene expression analyses 63

Figure 3.16 Gene network analysis of copy number and gene expression changes 64

Table 3.3 Candidate genes with USTS-specific genomic and transcriptomic changes 65

identified by integrative and gene network analyses.

Figure 3.17 Changes in Gene expression of candidate genes 67

Figure 3.18 FISH Validation of Copy Number Gain of EPHA3 69

Figure 3.19 IHC for EPHA3 70

1

CHAPTER ONE

INTRODUCTION

2

1. INTRODUCTION

1.1 Overview of Paediatric Soft Tissue Sarcomas

Paediatric soft tissues sarcomas (STSs) are a diverse group of mesenchymal tumours arising in

the connective tissues of the body. These tumours are generally found to be quite aggressive,

particularly if the tumour cells are poorly differentiated, and often present with early hematogenous

metastasis [1-6]. Collectively, STSs account for up to 10% of all paediatric cancers, making them

proportionately more common among children than adults (1% of all adult cancers) [5, 7].

Sarcomas have long been a challenging group of tumours to diagnose and classify due to the

morphological similarities between established subgroups, and the relative rarity of these

neoplasms [8]. However, advances in immunohistochemical analysis and the development of

molecular and cytogenetic techniques have allowed for the discovery of mechanisms by which we

can differentiate between different sarcoma subtypes [9]. Treatment of paediatric STSs consists of

surgery and preoperative chemotherapy. Chemotherapy has been shown to effectively manage

localized, chemosensitive tumours, and when the tumour is caught early enough, surgery can be

curative [6]. Though advances in therapeutics have improved long-term survival, the prognosis of

chemoresistant and metastasized sarcomas remains very poor, with a 5-year survival rate of less

than 60% [6, 10].

1.2 Classification of Paediatric Soft Tissue Sarcomas

Morphologically, sarcomas are divided into rhabdomyosarcomas (RMS), the most common

subtype of STS accounting for up to 50% of paediatric sarcomas, and non-rhabdomyomatous

sarcomas (NRSTSs) (Figure 1.1) [11]. NRSTSs include a very large number of pathologically

diverse tumours including Ewing sarcoma/primitive neuroectodermal tumour (ES/PNET),

3

malignant fibrous histiocytoma (MFH), synovial sarcoma (SS), and many others [11]. At the

genomic level, sarcomas are divided into two major categories (Figure 1.1). The first category is

composed of sarcomas with simple, near-diploid karyotypes which consistently present with

recurrent chromosomal rearrangements [12]. Members of this group include ES/PNET which are

associated with rearrangements of the EWS gene including t(11;22) and t(21;22); alveolar

rhabdomyosarcoma (ARMS), which is associated with t(1;13) and t(2;13); and SS which is

associated with t(X;18) resulting in an SYT-SSX gene fusion product [13-15]. To date,

approximately 41 gene fusions have been associated with 17 different subtypes of sarcomas [16].

The second category is composed of tumours that have very complex karyotypes, but in which no

reproducible chromosomal aberrations have been identified. This group includes malignant

peripheral nerve sheath tumour (MPNST), embryonal rhabdomyosarcoma (ERMS) and

osteosarcoma (OS) [12].

1.3 Diagnostic Importance of Chromosomal Translocations in Sarcomas

The accurate subclassification of paediatric sarcomas has important therapeutic and prognostic

implications [17, 18]. Subclassification is achieved using a variety of diagnostic techniques

including morphological, immunohistochemistal, molecular and cytogenetic analyses [19-21]. In

some cases, the combination of specific morphological features (e.g. biphasic histology for SS, or

spindled and myxoid histology for RMS), together with positive staining for specific antibodies

(e.g. CD99 for ES [19], or myogenin for RMS [22]) allows for the identification of specific

sarcoma subtypes [23]. However certain tumours are particularly difficult for the pathologist to

diagnose due to significant overlap in the histologic and immunohistochemical features of the

tumours [24]. Because the treatment protocols and prognoses of these different sarcoma subtypes

vary immensely, a correct diagnosis is crucial.

4

The discovery of specific chromosomal translocations associated with sarcoma subtypes has

markedly improved the diagnostic accuracy of paediatric sarcomas (Table 1.1) [23-27]. Tumours

with these translocations generally show low genomic complexity, with well-defined recurrent

chromosomal translocations that result in fusion genes with oncogenic fusion protein products. Not

only has the identification of these fusion genes improved the precision of sarcoma

subclassification, it has provided an increased understanding of sarcoma biology by providing a

logical starting point for more thorough functional studies of tumour development [27, 28]. The

identification of specific translocations harboured by different tumours has played a pivotal role in

differentiating between different neoplasms. As such, it has become imperative to use molecular

Figure 1.1. Subclassification of Pediatric Soft Tissue Sarcomas. Morphologically,

pediatric sarcomas are divided into rhabdomyosarcomas (RMS) and non-rhabdomyomatous

sarcomas (NRSTS) (Top). At the genomic level sarcomas are divided into two categories

(Bottom); those with simple, near-diploid karyotypes which consistently present with

recurrent chromosomal rearrangements and those with very complex karyotypes, but in

which no recurrent chromosomal aberrations have been identified.

5

and cytogenetic techniques, in conjunction with traditional histopathological techniques, to identify

these chromosomal translocations so that a correct diagnosis can be made.

TUMOUR TRANSLOCATION FUSION PRODUCT

Alveolar Rhabdomyosarcoma t(2;13)(q35;q14) PAX3-FOXO1A

t(1;13)(p36;q14) PAX7-FOXO1A

Alveolar Soft- Part Sarcoma t(X;17)(p11.2;q25) TFE3-ASPL

Clear-cell Sarcoma t(12;22)(q13;q12) EWS-ATF1

Congenital Fibrosarcoma t(12;15)(p13;q25) ETV6-NTRK3

Desmoplastic Small Round

Cell Tumour t(11;22)(p13;q12) EWS-WT1

Ewing Sarcoma/PNET

t(11;22)(q24;q12) EWS-FLI1

t(21;22)(q22;q12) EWS-ERG

t(7;22)(p22;q12) EWS-ETV1

t(17;22)(q21;q12) EWS-ETV4

t(2;22)(q33;q12) EWS-FEV

Myxoid Chondrosarcoma t(9;22)(q22;q12) EWS-NR4A3

Myxoid Liposarcoma t(12;16)(q13;p11) FUS-CHOP

t(12;22)(q13;q12) EWS-CHOP

Synovial Sarcoma t(X;18)(p11;q11) SYT-SSX1

Table 1.1. Several sarcoma subtypes are characterized by specific chromosomal translocations that

produce fusion oncogenes with protein products. (Adapted from [11]).

1.4 Overview of Paediatric Undifferentiated Soft Tissue Sarcomas

Despite the advancement of diagnostic applications which allow for the identification of

sarcoma subtypes, approximately 5% of sarcomas remain unclassifiable [29-32]. These tumours,

termed undifferentiated soft tissue sarcomas (USTSs), show no specific lineage differentiation,

exhibit no well established or consistent histologic or immunohistochemical profile, and harbour no

compelling recurrent molecular aberrations associated with the traditional sarcoma subtypes [32,

33]. Furthermore, the karyotypes of these sarcomas are largely variable, with some tumours having

a very complex genomic make-up, and other tumours harbouring a near-diploid genome [34]. As

6

such, a diagnosis of undifferentiated sarcoma is largely considered a diagnosis of exclusion,

evoking much debate as to whether this group is composed of tumours with common histological

and biological features, or if these tumours represent a heterogeneous group of primitive

mesenchymal tumours that can differentiate along various soft tissue lineages [9]. USTSs have

been found to be most commonly located in the extremities (46%), the trunk (38%), and the head

and neck (16%) [8, 29]. Treatment protocols included surgery, VAC (vincristine, adriamycin,

cyclophosphamide) chemotherapy and radiation therapy [30]. At the present time, patients

diagnosed with USTS have a 40-50% risk of developing distant metastases within 5 years of

diagnosis [4]. USTSs generally have a poorer prognosis compared to RMSs [8, 30, 35, 36], with a

44% overall 5-year survival rate [8, 30], though these results may be skewed due to small sample

size and inaccurate diagnoses.

1.4.1 Morphological Subtypes of USTS

Due to the rarity of these tumours, only a few studies have been published which seek to

more fully elucidate the pathology behind a USTS diagnosis. Somers et al initiated a study which

sought to examine the clinical and pathological features of the largest cohort of USTSs studied at

the time, in an attempt to identify specific characteristics which could be useful for the diagnosis,

prognosis and treatment of USTS tumours [32]. This study was unable to identify any specific

features shared by the majority of these tumours, but noted that many of the tumours showed a

sheet-like proliferation of densely packed round to plump spindled tumour cells [32]. In a similar

study, Alaggio et al. [9] examined the histology of a smaller cohort of USTSs, and furthered this

notion, determining that USTSs can be broken down into two morphological groups. The first

group contains tumours with a spindled/myxoid morphology, that have elongated cells arranged in

poorly formed bundles or fascicles [9, 32] (see Figure 3.1c taken from Somers et al., 2006 [32]).

7

The second group, which our lab has entitled paediatric primitive round cell (PRC) sarcomas,

contains tumours composed of primitive mesenchymal cells showing no evidence of differentiation,

that are arranged in sheets or nests [9, 32] (see Figure 3.1b taken from Somers et al., 2006 [32]).

These PRC sarcomas represent the most primitive paediatric sarcomas described to date, and thus

form an attractive model in which to study the early events involved in sarcomagenesis.

1.4.2 Models of Sarcomagenesis: Cell of Origin and Acquisition of Malignant Properties

Much research has gone into studying the molecular mechanisms of sarcomagenesis, or the

initiation and development of sarcoma. While significant progress has been made, this complex

process remains highly debated. Two generally accepted models of tumourigenesis are the ‘cancer

stem cell’ (CSC) hypothesis and the ‘dedifferentiation’ hypothesis. The dedifferentiation

hypothesis suggests that cells with more primitive features, such as ability to self-renew and

differentiate down different lineages, may arise in an adult through the process of dedifferentiation

[37]. Overlapping with this model is the stochastic model of tumour formation which proposes that

any cell from a tumour is capable of limitless growth and the ability to spread throughout the body

[38]. These theories are less accepted by modern cancer biologists due to the heterogeneous nature

of most tumours, as well as evidence to the contrary that all tumour cells can re-initiate tumour

formation. The CSC hypothesis suggests that tumours are initiated and sustained by a fraction of

cancer cells derived from tissue stem cells which maintain pluripotency and the capacity to self-

renew [39-41]. In general, these cells are thought to be resistant to chemotherapy, and thus the cells

that are responsible for relapse and metastasis. This theory appears to hold true for numerous

cancer types including leukemia [42, 43], pancreatic cancer [44], and brain tumours [45], in which

specific tumour- initiating cells have been identified. The CSC hypothesis is not without criticisms

8

however, as certain cancers, such as melanoma, have been identified which seem to harbour

numerous cells that are able to initiate and maintain a tumour [40, 46].

Though controversial, a number of publications have suggested that some sarcoma subtypes

may also follow this CSC hypothesis [41, 47-49]. Other studies however, have found that the

ability of some sarcoma-initiating cells to actually initiate tumour formation in vitro and in

xenograft models is inconsistent, and varies from tumour to tumour [50]. Thus, the initiation and

progression of sarcomas remains a complex process that is poorly understood. Crucial to our

understanding of sarcomagenesis are two central and related themes: the cell of origin and the

mechanisms by which cancer-initiating cells acquire malignant properties. Due to the complete

lack of lineage differentiation in the majority of the tumour population, PRC sarcomas are

hypothesized to be the most primitive paediatric STSs that have been identified, and understanding

the molecular processes responsible for the initiation and progression of these tumours will provide

novel insights into the critical biological events responsible for the development of sarcomas in

general.

Cell of Origin

The discovery of the cell of origin is extremely important in understanding the molecular

mechanisms involved in the formation of different sarcoma subtypes. Due to the fact that sarcomas

are mesenchymal-derived tumours, recent studies have focused on mesenchymal stem cells (MSCs)

as the cells of origin for sarcomas [37]. MSCs are capable of differentiating along numerous

differentiation pathways including adipocytic, chondrocytic, osteogeneic and myogenic lineages.

Recent studies have made progress in identifying the supposed cells of origin of specific paediatric

sarcomas, particularly mxyoid liposarcoma, ES/PNET and RMS. Data from several sources has

supported the notion that these tumours arise from mesenchymal stem cells (MSCs), with

9

expression of the tumour-specific fusion proteins in mesenchymal stem cells (TLS/CHOP in

myxoid liposarcoma [51]; PAX3/FKHR in ARMS [52]; EWS/FLI1 in ES [53-55]) inducing

phenotypic changes similar to those seen in the parent tumour. Furthermore, inhibition of the

EWS/FLI1 fusion gene in ES cell lines has been shown to shift the expression profiles of these cells

lines towards the MSC expression profile [56]. These cell lines were then shown to be able to

differentiate along both adipogenic and osteogenic lineages [56]. While increasing amounts of

evidence are being published to support the notion of MSCs as the cell of origin for many sarcoma

subtypes, this concept has been challenged by theories suggesting ES may emanate from the

neuroectodermal lineage. ES has been shown by immunohistochemistry (IHC) to express neural

and neuroectodermal markers, and one study has shown that the expression of the EWS/FLI1 fusion

transcript in neural crest-derived stem cells (NSCs) induces phenotypic and migratory changes [56,

57].

Traditionally, paediatric USTSs have been classified with ERMSs for the purposes of

treatment and therapeutic trials because these tumours often share a similar response to therapy [8,

58, 59]. More recent reports have suggested that at least some paediatric USTSs have more Ewing-

like characteristics, including sheet-like, round cell morphology, focal CD99 positivity (albeit

cytoplasmic), and occasional ultrastructural features of primitive neural differentiation [9, 33, 34,

60]. As previously mentioned, there is much debate as to whether MSCs or NSCs serve as the cell

of origin of many sarcomas. A recent study from our laboratory applied antibodies against several

MSC and NSC-associated proteins to a variety of sarcoma subtypes (ES, USTS, MPNST, ARMS,

ERMS, SS) in an attempt to determine whether differential expression of stem cell-associated

proteins could be used to aid in the subclassification of paediatric sarcomas [61]. It was found that

based on the expression of eight stem cell-associated proteins, paediatric sarcomas cluster into two

major subgroups (Figure 1.3). The first cluster included ES and USTS, and the second cluster

10

contained the majority of the remaining tumours. Lack of expression of CD56 was significantly

associated with the ES/USTS cluster (Figure 1.4). Interestingly, CD56 is a neural stem cell marker,

and the lack of expression in ESs and USTSs is suggestive of a mesenchymal stem cell origin for

these tumours [61]. However, the results of this study were not conclusive enough to definitively

establish a cell of origin for either ES or USTS. On the other hand, the results from this study do

indicate that USTS is much more closely related to ES than to RMS. Thus, the cell of origin of

most primitive sarcomas remains elusive. However based on the primitive nature of these tumours,

their pathological similarity to ES, and the increasing amounts of evidence supporting MSCs as the

cell of origin for ES, it is likely that MSCs serve as the cell of origin for USTSs.

11

Fig

ure 1

.2.

Eu

cli

dean

Clu

sterin

g o

f S

arcom

a T

um

ou

rs

Base

d o

n E

igh

t S

tem

Cell

Mark

ers.

B

ased

on t

he

expre

ssio

n o

f ei

ght

stem

cel

l-as

soci

ated

pro

tein

s, a

s m

easu

red b

y I

HC

, ped

iatr

ic s

arco

mas

clu

ster

into

tw

o m

ajor

sub

gro

ups.

T

he

firs

t cl

ust

er i

ncl

udes

ES

and U

ST

S,

and t

he

seco

nd c

lust

er c

onta

ins

the

rem

ainin

g t

um

ours

. B

lue

repre

sents

rel

ativ

e under

expre

ssio

n;

red r

epre

sents

rel

ativ

e over

expre

ssio

n;

gre

y r

epre

sents

mid

-lev

el e

xpre

ssio

n;

and

whit

e re

pre

sents

no d

ata.

(A

dap

ted

fro

m [

61])

.

12

Acquisition of Malignant Properties

The striking specificity of individual translocations for different sarcoma subtypes leads one

to question the functional basis of a specific gene rearrangement in a given tumour. More

specifically, does the acquired gene fusion directly influence the histology of the tumour, or does

the histology of the tumour influence which cell types are able to support the oncogenic activity of

the gene fusion product [28]? These concepts speak to the timing of translocation acquisition;

whether fusion transcripts are acquired at later stages of cellular differentiation and tumour

Figure 1.3. Immunohistochemical Staining of CD56. Immunohistochemical

staining of CD56 in USTS (A), ES (B), MPNST (C) and SS (D). Lack of

expression of CD56, a neural stem cell marker, was significantly associated with

the ES/USTS cluster, where positive staining was found in all other tumour types.

A-D, CD56 immunostain with hematoxylin counterstain, original magnification x

400. (Adapted from [61]).

13

formation, and simply facilitate tumour development, or if the acquisition of a translocation occurs

in more primitive cells, and is the critical event in the formation of a specific histologic subtype of

sarcoma. Conflicting evidence has been shown in ARMS. Some studies have suggested that the

critical event for RMS sarcomagenesis occurs after the tumour cells have acquired myoblastic

properties [37, 62]; whereas others suggest that the malignant phenotype occurs in the more

primitive MSCs [28, 37]. The latter theory is being more widely supported, as numerous studies

have now shown that PAX3 is an important myogenic regulator [63, 64], and that dysregulated

PAX3 (in the form of the ARMS fusion transcript PAX3-FOXO1) represses myogenic

differentiation [64-66]. The ability of PAX3 to commit cells to the myogenic lineage is however

only seen in specific cell types such as those of mesenchymal origin, and cannot induce muscle

commitment in endothelial cells [67]. Thus it is conceivable that dysregulation of PAX3 through a

genomic rearrangement in a very primitive cell may generate a skeletal muscle-derived tumour

[37]. Overall much evidence is supporting the notion that the critical events responsible for tumour

formation occur in more primitive cells as opposed to fully differentiated cells. Furthermore, the

degree of tumour differentiation is being linked to the point of cellular differentiation at which a

specific translocation is acquired [37]. For example, if the translocation is acquired early in lineage

commitment, the resulting tumour is much more primitive, however if the translocation is acquired

later in lineage commitment, the tumour will be well-differentiated. This has led our lab to develop

a hypothetical model of USTS molecular aberration acquisition (Figure 1.5), in which PRC

sarcomas acquire molecular aberrations at a much earlier stage of differentiation relative to other

sarcoma subtypes. The critical events leading to the development of most sarcoma subtypes are

believed to occur in mesenchymal progenitor cells. USTSs, on the other hand, are believed to be

the proliferation of uncommitted mesenchymal cells. It is this difference in the timing of

acquisition of molecular aberrations that results in the more primitive morphology of USTSs.

14

Gaining a better understanding of this process in more primitive tumour types may offer insight

into the specific mechanisms of sarcomagenesis.

Figure 1.4. Decoding the early events of sarcoma development. Many sarcomas, such as

rhabdomyosarcoma, show evidence of differentiation down a specific connective tissue

lineage. Undifferentiated sarcoma shows no specific lineage differentiation, leading to the

hypothesis that the critical molecular events leading to undifferentiated sarcoma development

are likely happening at an earlier stage of differentiation compared to other sarcoma

subtypes.

15

1.4.3 Molecular Studies of USTS

Identifying diagnostic and prognostic markers which help to more accurately identify

sarcoma subtypes is invaluable. Unfortunately, as with the morphological studies of USTS, the

molecular studies of these tumours have been rather uncommon and sporadic. However, in the past

5 years progress has been made in terms of characterizing some of the molecular abnormalities

arising in these tumours, although small sample sizes and minimal clinical correlation have been

limiting factors in these studies.

Immunohistochemical Screens

In general, immunohistochemical screening of USTS have been unsuccessful because of the

uniform negativity seen for most immunohistochemical markers, and inconsistent positivity for

others. However, Somers et al. [32] recently performed a thorough molecular analysis of 13

USTSs and identified certain molecular features of this specific USTS cohort.

Immunohistochemistry identified positivity for vimentin (92%), CD117 (92%), VEGF (69%),

HER-2 (54%), WT1 (46%), and Cox-2 (31%), among others. Alaggio et al. [9] performed a similar

immunohistologial study on 7 USTS tumours and found variable positivity for smooth muscle

actin, muscle-specific actin, CD34, cytokeratin, S-100, CD117, nestin, FLI1, CD105, survivin and

INI1. Discrepancies between these studies are likely a result of the small sample sizes used in each

study. While some of these immunohistochemical results are promising, more confirmatory studies

are necessary in order to determine the importance of these findings.

INI1

A specific type of sarcoma called malignant rhabdoid tumour has been found to harbour a

characteristic loss or mutation of the INI1 gene on chromosome 22q11.2. The INI1 protein is part

16

of the SWI/SNF chromatin-remodelling complex. This complex can act as either a transcriptional

activator or repressor, and has been found to be constitutively expressed in all cells [68, 69]. In a

recent study Kreiger et al [33] sought to investigate whether USTSs show a loss of INI1 protein

expression due to underlying genomic abnormalities in the INI1 gene. By immunohistochemisty,

the study found 5/17 (29%) cases of USTS showing loss of nuclear expression of INI1. They

further identified by fluorescence in situ hybridization (FISH), reverse transcription-polymerase

chain reaction (RT-PCR) and/or mutational analysis that 4 of these 5 tumours harboured genetic

abnormalities of the INI1 gene. The report goes on to suggest that this category represents a unique

subgroup of tumours. Unfortunately others have found data inconsistent with these results,

suggesting that a category of USTSs showing INI1 loss may not be significant [9, 32].

IGF2

A recent publication by Somers et al. applied gene expression array analysis and post-array

validation to a series of USTSs and identified consistent upregulation of the excitatory components

of the insulin-like growth factor pathway. Particularly high expression of the insulin-like growth

factor 2 (IGF2) ligand was identified and confirmed using RT-PCR. IHC identified overexpression

of the IGF2 protein in 19 of 21 tumours (90%) with 2 distinct staining patterns (diffuse cytoplasmic

(16/19) and punctuate perinuclear (3/19)). Convincing and consistent upregulation of this pathway

in USTS suggests that the IGF signalling pathway may be a critical early event in sarcomagenesis,

however functional studies are necessary to confirm these studies.

CIC-DUX4 translocation

Over the past decade there have been sporadic case reports of tumours with PRC

morphology that harbour a rearrangement involving chromosome 4 and chromosome 19. In 1996,

17

Richkind et al. [70] reported a balanced t(4;19)(q35;q13.1) translocation in a tumour diagnosed as a

malignant extraskeletal sarcoma. This translocation did not resurface in the literature again until

2006, when Kawamura-Saito et al. [60] reported two primitive Ewing-like sarcomas harbouring

t(4;19)(q35;q13) translocations. This group further deduced that in both cases, this translocation

resulted in the fusion of the CIC gene (19q13) to the C-terminal portion of the DUX4 gene (4q35).

Finally they found that overexpression of the CIC-DUX4 transcript in vitro increased anchorage-

dependent colony formation in murine NIH3T3 fibroblasts and directly induced over-expression of

downstream ETS-family transcription factors [60]. A more detailed description of the CIC and

DUX4 genes, as well as the CIC-DUX4 translocation, can be found in section 3.3.3. Most recently,

Rakheja et al. [71] published the 4th

case of a PRC sarcoma harbouring a t(4;19) translocation.

Unfortunately no group has attempted to characterize the prevalence of this translocation in a larger

cohort of USTS.

18

1.5 Hypothesis, Objectives & Expected Outcomes

The accurate sub-classification of sarcomas is of utmost importance in order to determine an

accurate prognosis and to plan optimal therapy for the patient. Based on the limited literature that

has been published on USTSs, I hypothesize that paediatric undifferentiated sarcomas contain

distinct subgroups that can be identified based on their morphology, genomic aberrations and

expression profiles. To test this hypothesis, three objectives are proposed. Firstly, we seek to

develop an assay with which to screen our cohort of USTSs for the t(4;19)(q35;q13.1), and

ultimately determine the prevalence of this translocation in our cohort of USTSs. Secondly we seek

to characterize genomic aberrations within the t(4;19)(q35;q13.1)-negative USTSs, looking

specifically for any novel recurrent regions of gain, loss or rearrangement. Lastly, we seek to

characterize the expression profiles of the t(4;19)(q35;q13.1)-negative USTSs with the aim of

identifying a unique expression signature. With these objectives, we hope to identify specific

genomic changes which will aid in the characterization and diagnosis of USTSs, establish unique

genomic and protein expression profiles for USTSs for diagnostic and prognostic use, and to

identify potential genes or proteins for targeted treatment.

19

CHAPTER TWO

MATERIALS & METHODS

20

2. MATERIALS & METHODS

2.1 Tumour Specimens

An electronic search of the pathology database at the Hospital for Sick Children was

performed to identify children diagnosed with USTS between 1987 and 2007. Primary tumours

involving viscera and bone, and intradural tumours, were excluded. Pretreatment biopsy specimens

were studied if available, and all tumours underwent extensive immunohistochemical and

molecular genetic screening before being included in the series of USTS (see [32] for methodology

used). Thirteen of the 22 USTSs (USTS1-4, USTS6-11, USTS16-18) were part of previous studies

looking at the clinical, pathologic and cytogenetic features of paediatric USTSs [31, 32, 34, 72].

Three of the USTSs were originally diagnosed as atypical Ewing sarcomas (USTS20-22), due to

the fact that none of these tumours harboured a demonstrable rearrangement of the EWS gene by

RT-PCR, but had PRC morphology. Primary tumour samples were used when available; for one

patient, the primary tumour specimen was not available and therefore one post-therapy specimen

was studied. Frozen tissue from five additional paediatric USTSs (USTS12-15, USTS19) were

obtained from the files of the Co-Operative Human Tissue Network (Columbus, OH). Clinical

details were not available for one such tumour. All studies were performed in accordance with the

guidelines of the Hospital for Sick Children’s Research Ethics Board.

2.2 DNA and RNA Extraction

High molecular weight genomic DNA was extracted from snap-frozen tissue digested in a

proteinase K buffer and purified by standard phenol/chloroform methods [73]. All samples were

treated with RNase A (Roche Diagnostics, Laval, QC, Canada) for 30 minutes at 37oC. Total RNA

was extracted from snap-frozen tumour tissue samples using the TRIzol reagent according to the

21

manufacturer’s instructions (Invitrogen, Carlsbad, CA), and treated with RNase-free DNase I

(Message Clean; GeneHunter Corporation, Nashville, TN) for 30 minutes at 37oC to remove DNA

contaminants. RNA extraction from formalin-fixed paraffin-embedded (FFPE) tissue samples was

performed as previously described [31]. In brief, tissue sections were deparaffinized in xylene and

subsequently washed in absolute ethanol. Following centrifugation, the tissue pellet was air-dried

and digested overnight at 55oC in 350 μl of RNA lysis buffer (20mM Tris pH 7.5, 10mM EDTA,

1% SDS, 500mg of Proteinase K). RNA extraction was then carried out using the TRIzol reagent

according to the manufacturer’s instructions (Invitrogen, Carlsbad, CA), and treated with RNase-

free DNase I (Message Clean; GeneHunter Corporation, Nashville, TN) for 30 minutes at 37oC to

remove DNA contaminants. Quality and concentration of extracted RNA and DNA samples were

quantified by the NanoDrop ND 3300 fluorospectrometer (NanoDrop products, Wilmington, DE).

2.3 Cytogenetic and Spectral Karyotyping (SKY) Analyses

Tumour specimens were processed for short-term culture in high-glucose Dulbecco’s

modified Eagle’s medium (4.5g/L) supplement with 15% fetal calf serum, 1% penicillin

streptomycin, and 1% L-glutamine [34]. The cultures were harvested within 5 days using 0.1

mg/mL colcemid (Gibco/BRL) for 2-4 hours, hypotonically treated with 0.075M KCl, and fixed in

3:1 methanol:acetic acid [34, 74]. The slides were dropped and aged for 2 to 3 days, after which

they were stained with Giemsa [34]. By convention, a minimum of 10 metaphases were analyzed

using conventional G-banding methods according to standard procedures, and were immediately

destained and dehydrated for subsequent fluorescence SKY analysis. The SKY analysis was

performed on the tumour metaphase cells according to the manufacturer’s instructions (Applied

Spectral Imaging, Carlsbad, CA) and as previously published [34, 74, 75]. Briefly, slides were

destained, formalin fixed, and denatured in 70% formamide/2XSSC at 75oC for 40 seconds. The

22

SKYPaints were denatured, preannealed and hybridized to the denatured slides for 24 hours at

37oC. Posthybridization washes and detections were conducted according to the manufacturer’s

instructions. Spectral images were acquired with an SD 200 spectral bioimaging system (ASI,

Migdal Ha’Emek, Israel), and were analyzed using ASI SkyView software (version 1.2). When

sufficient metaphase cells of adequate quality were present, 10 metaphase cells were analyzed.

2.4 Array Comparative Genomic Hybridization (CGH)

All tumour specimens were selected to include only those tissue samples containing >80%

of identifiable neoplastic tissue by routine histological characterization. The Agilent human

genome CGH microarray 44 k (USTS1, USTS16) and 244 k (USTS2) (Agilent Technologies, Santa

Clara, CA) were used for the array CGH experiments. Three micrograms of human genomic DNA

from multiple anonymous male donors (Promega, Madison, WI) and 3 μg of test genomic DNA

sample were digested with AluI (5 units) and RsaI (5 units) (Promega, Madison, WI) for a

minimum of 2 hours at 37oC. Digestion quality was assessed by the DNA 1000 LabChip Kit

(Agilent 2100 Bioanalyzer). Labeling reactions were performed using the Agilent genomic DNA

labeling kit PLUS according to the manufacturer’s instructions. Briefly, the reference and sample

DNA were labeled with 1.5-3 mmol/L Cy5-dUTP or Cy3-dUTP (Agilent), and purified using a

Centricon YM-30 filter (Millipore, Billerica, MA). Probe mixture of Cy3-labeled sample DNA,

Cy5-labeled reference DNA, 50 μL of 1.0 mg/mL of human Cot-1 DNA (Invitrogen Canada,

Burlington, ON, Canada), 52 μL of Agilent 10x blocking agent and 260 μL of Agilent 2X

hybridization buffer was denatured at 100oC for 1 minute 30 seconds and incubated at 37

oC for 30

minutes. The probe was applied to the array using an Agilent microarray hybridization chamber,

and hybridized for 40 hours at 65oC in a rotating oven (Robbins Scientific, Sunnyvale, CA) at 20

rpm. Dye swaps were performed for both USTS1 and USTS2 samples. Arrays were washed

23

according to the manufacturer’s recommendations, and then air dried and scanned using an Agilent

2565AA DNA microarray scanner. For each tumour, the data analysis of array CGH was

performed by applying rank segmentation, with a significance threshold of 1.0x10-6

, a maximum

contiguous probe spacing of 1,000 kb, and a minimum of 5 probes per segment (Nexus Copy

Number v.4; BioDiscovery, El Segundo, CA). Genomic imbalances were assigned as either gain

[log(3/2) or threshold of 0.2] or loss [log(1/2) or threshold of -0.3], considering only genomic

intervals >100 kb. The aberrations on X and Y chromosomes were excluded from the analysis to

eliminate the sex mismatching bias.

2.5 Interphase Fluorescence in situ Hybridization (FISH)

Interphase FISH was used in the pilot project portion of this thesis for the purposes of

identifying the translocation partners for chromosome 19 in samples USTS1 and USTS2.

Interphase FISH was also used in the exploratory array portion of this thesis in an attempt to

validate copy number changes of the EPHA3 gene in USTS12-15 and USTS19.

2.5.1 Isolation of DNA from Bacterial Artificial Chromosomes (BACs)

DNA was isolated from individual BAC clones grown overnight in a 20ml culture of Luria-

Bertani (LB) medium containing 12.5 μg/ml chloramphenicol (Sigma-Aldrich Canada Limited,

Oakville, ON, Canada). BAC DNA was isolated using the Qiagen Plasmid Midi Kit as per the

manufacturer’s instructions. Briefly, cell pellets were resuspended in Tris-EDTA (50mM Tris-

Chloride, pH 8.0, 10mM EDTA) and lysed in sodium hydroxide (200mM NaOH, 1% SDS).

Following neutralization with potassium acetate (3M KOAc, pH 5.5), BAC DNA was precipitated

in isopropanol for 1 hour at -80oC, and resuspended in DEPC water. Samples were then treated

with RNase A (Roche Diagnostics, Laval, QC, Canada) for 30 minutes at 37oC to remove RNA

24

contaminants and DNA was isolated by standard phenol-chloroform methods, precipitated with

isopropanol and resuspended in sterile water. Quality and concentration of extracted DNA was

quantified by the NanoDrop ND 3300 fluorospectrometer (NanoDrop products, Wilmington, DE).

2.5.2 Pilot Project Four-Colour Interphase FISH

For the pilot project portion of this study, a four-colour interphase FISH method was

applied to USTS1 and USTS2 paraffin sections to validate the genomic rearrangement involving

chromosomes 4 and 19. The following BAC clones were used for BACs located at chromosome 19

(a-e), chromosome 4 (f,g), and chromosome 20 (h): (a) 19p13.12~p13.11 (16.00-16.42 Mb, control

probe): RP11-121I1 and RP11- 451E20; (b) 19q13.2 (47.02-47.33 Mb, probe A): RP11-688M4 and

RP11-108I20; (c) 19q13.2 (47.26-47.44 Mb, probe B): RP11-317E13; (d) 19q13.2~q13.31 (47.52-

47.91 Mb, probe C): RP11-374A11 and RP11- 1029C16; (e) 19q13.2 (47.44-47.61 Mb, probe E):

CTC- 790D18; (f) 4q33 (17.05-17.07 Mb): RP11-157C21 and RP11-242A14; (g) 4q35.2 (19.10-

19.11 Mb, 111 kb upstream of DUX4, probe D): RP11-521G19; and (h) 20p12.3 (7.39-7.55 Mb,

probe F): RP11-19D2. The linear order and approximate distances are based on the March 2006

assembly (Hg18) of the University of California, Santa Cruz, genome browser

(http://www.genome.ucsc.edu). DNA from all BAC clones was extracted by standard methods and

labeled with Vysis Spectrum Green-dUTP, Spectrum Red-dUTP, Spectrum Orange-dUTP (Abbott

Molecular, Des Plaines, IL), or Cy5-dUTP (PerkinElmer Life and Analytical Sciences, Waltham,

MA), using a Vysis nick-translation kit according to manufacturer’s instructions. The presence of

CIC, DEDD2, and ERF sequences and correct chromosome location of all BAC clones were

verified by PCR and by hybridization to metaphase spreads from normal peripheral lymphocytes,

respectively. Samples were scanned using the Zeiss Axio Imager.M1 microscope (Carl Zeiss

25

Canada, Toronto, ON) equipped with appropriate filter sets and analyzed with ISIS imaging

software (MetaSystems, Altlussheim, Germany).

2.5.3 FISH Validation of Copy Number Gain of EPHA3

A BAC clone was selected that covers the EPHA3 gene region on 3p11.2 (89.13-89.30 Mb,

RP11-23D18). A commercially available CEP3 DNA probe labeled with Spectrum Orange-dUTP

and specific for the centromeric region 3p11.1-q11.1 (Vysis

Inc., Downers Grove, IL, USA) was

used as a control for both locus-specificity and ploidy analyses. DNA from all BAC clones was

extracted by standard methods and labeled with Spectrum Green–dUTP using the Vysis nick-

translation kit according to manufacturer’s instructions. Locus-specificity of the selected BAC

clones was verified by hybridization to metaphase spreads

from normal peripheral lymphocytes.

Five micrometer histological formalin-fixed and paraffin-embedded tissue sections were pre-treated

as follows: deparaffinization with a series of xylene followed by immersion in 100% ethanol,

incubation in 10mM NaCitrate (pH 6.4) at 80oC for 90 minutes followed by 2XSSC rinse, pepsin

digestion (4mg/ml pepsin in 100ml 0.01N HCl) at 45oC for 20 minutes followed by 2XSSC rinse

and a final ethanol dehydration series (70%, 90% and 100%, 2 minutes each) [76]. Slides and

probes were co-denatured on a hot plate at 75oC for 10 minutes and incubated overnight at 37

oC.

Following a post-hybridization rapid-wash technique consisting of one wash in 0.4xSSC and 0.3%

NP-40 at 72 °C for 3 min, followed by a 5-min wash at room temperature in 2XSSC and 0.1% NP-

40, slides were mounted in DAPI/Antifade (Vector Laboratories, Burlington, ON, Canada) and

visualized with a Zeiss Axioscope fluorescence microscope (Carl Zeiss Canada). The establishment

of EPHA3 gene copy number status was defined relative to the CEP 3 DNA probe gene copy

number.

26

2.6 Reverse Transcriptase Polymerase Chain Reaction (RT-PCR) for Detection of CIC-DUX4

Fusion Transcripts

To test for the presence of the CIC-DUX4 translocation, RT-PCR amplification was

performed using the GeneAmp RNA PCR core kit (Applied Biosystems, Foster City, CA).

Complementary DNA was synthesized by reverse transcription (RT) using 1-2μg of total RNA and

primed using random hexamers. The RT reaction consisted of 15 minute incubation at 42oC

followed by inactivation at 99oC for 5 minutes. Specific amplification of the CIC-DUX4 fusion

transcript was performed using combinations of CIC forward primers and DUX4 reverse primers at

a concentration of 0.2μM for each primer (Table 2.1).

Primer Name Primer Sequence (5'-->3')

DUX4RTr2 [60] TGAGGGGTGCTTCCAGCG

DUX4-R2 ATGCCTTGCATCTGCCC

CIC4120 [60] TGAGTTGCCTGAGTTTCG

CIC2F ATCATGCAGGCTGCCACT

CIC3F CCCTGGAGCTGAGGCT

CIC4F ACTGGCACCGCTGCT

Table 2.1. Primer sequences for the RT-PCR reactions described in the

materials and methods. (Adapted from [30]).

27

Primer sets were designed such that the PCR products were small enough so that the CIC-

DUX4 fusion transcript could be detected from RNA extracted from paraffin, and that primer sets

would detect all known breakpoints. The cycling parameters for each PCR reaction consisted of 40

cycles of 94oC for 30 seconds, 62

oC for 45 seconds, 72

oC for 45 seconds, followed by a final

elongation step at 72oC for 7 minutes. RT-PCR products were size-fractionated by gel

electrophoresis on a 1.5% agarose gel. To determine RNA integrity, cDNA from the same RT

reaction was also amplified with primers from the endogenous housekeeping gene

phosphoglycerate kinase 1 (PGK1). A single product of 247bp was visualized on a 1.5% agarose

gel with ethidium bromide staining (data not shown). A full-length CIC-DUX4 cDNA subcloned

into pGEM (Promega, Madison, WI; a kind gift from Dr. Takuro Nakamura, Department of

Carcinogenesis, Japanese Foundation for Cancer Research, Japan) was used as a positive control for

the CIC-DUX4 fusion transcript.

2.7 Copy Number Arrays

Copy number analysis was performed using the Affymetrix Genome-Wide Human SNP 6.0

array platform (Affymetrix, Santa Clara CA, USA). High molecular weight genomic DNA was

extracted from five snap-frozen t(4;19)(q35;q13.2)-negative USTS tumours (USTS12-15, USTS19)

as described in section 2.2. 500ng of genomic DNA was labeled and hybridized to the array as per

the manufacturer’s instructions at the Centre for Applied Genomics (The Hospital for Sick

Children, Toronto ON, Canada). Quality control was performed by Contrast QC calculation. Data

collected by the International HapMap project [77] was used as the normal control reference

(Affymetrix SNP 6.0 data on 270 normal samples).

28

2.8 Expression Arrays

Genomic RNA expression analysis was performed using the Affymetrix Gene 1.0 ST array

platform (Affymetrix, Santa Clara CA, USA). Total RNA was extracted from five snap-frozen

t(4;19)(q35;q13.2)-negative USTS tumours as described in section 2.2. To test the purity of the

RNA samples, total RNA was run on the Agilent 2100 Bioanalyzer (Agilent Technologies, Santa

Clara, CA). The software for this program provided an RNA Integrity Number (RIN) for each

sample that allowed for estimation of the integrity of the total RNA extracted from each sample

[78]. Normal Adult Human Dermal Fibroblasts were obtained from Cedarlane Canada and

ScienCell Research Laboratories (Cedarlane Canada, Burlington On, Canada; ScienCell Research

Laboratories, Carlsbad CA, USA) and used as normal reference controls. Total RNA (200ng) from

normal human fibroblasts and tumour samples was analyzed as per manufacturer’s instructions at

the Centre for Applied Genomics (The Hospital for Sick Children, Toronto ON, Canada).

2.9 Analysis and Integration of Microarray Data

For the purposes of array analysis and integration two programs were used: Partek

Genomics Suite (PGS) and Ingenuity Pathway Analysis (IPA).

2.9.1 Partek Genomics Suite: Data Analysis and Integration

Gene copy number and RNA expression analyses were performed using the PGS Software.

Data from copy number array experiments and RNA expression array experiments were received in

the form of .cel files (GCOS 1.3 software), and were imported into, analyzed, and integrated using

the PGS software.

The analysis of copy number changes followed the PGS Copy Number Workflow, which

compares signal log2 ratios to a reference of 270 normal HapMap samples [79]. The PGS Genomic

29

Segmentation tool was used to detect amplified and deleted segments with the following

parameters: minimum continuous probes set at 10, p-value threshold of 0.01, and signal to noise

ratio of 0.1. Regions of significant gain or loss were annotated to the corresponding genes on the

Affymetrix Gene 1.0 Array using the publically available HuGene-1_0-st-v1.na24.hg18.-

transcript.csv file. A gene list was subsequently created which contained all genes that were

significantly gained or lost in at least 3/5 (60%) of tumours.

The analysis of the expression array data used the PGS one-way analysis of variance

(ANOVA) tool at P < 0.05 and a cut-off of ±2-fold enrichment. Changes in gene expression were

cumulatively analyzed across all tumours versus normal fibroblast controls. Significantly over- and

under-expressed genes were annotated using the Affymetrix Gene 1.0 Array, which in turn uses the

publically available HuGene-1_0-st-v1.na24.hg18.-transcript.csv file. A gene list was subsequently

created which contained all genes that were significantly over- and under-expressed cumulatively

across all tumours analyzed.

The integration of genes lists of significantly gained and lost, and significantly over- and

under-expressed genes was performed using the PGS Venn analysis tool. The genes in the cross-

section area of the Venn diagram represent genes found to have significant changes in both copy

number and gene expression. From the list of genes found in the Venn diagram cross-section,

genes that showed copy number gain and under-expression or copy number loss and over-

expression were excluded from further analysis (5 of 51 genes). Copy number and gene expression

analysis excluded sex chromosomes to avoid bias in the identification of significant regions owing

to sex differences between some tumour samples and controls.

30

2.9.2 Ingenuity Pathway Analysis: Network Identification and Pathway Analysis

Functional identification of gene networks was performed using the IPA program as

previously described [80, 81]. Excel (.xls) tables representing genomic changes (as per analysis of

the Affymetrix SNP 6.0 microarray data) and differentially expressed genes (as per analysis of the

Affymetrix Gene 1.0 ST microarray data relative to normal fibroblasts) in the tumour samples were

imported as individual experiments using the IPA Core Analysis tool. The analysis was performed

using the Ingenuity Knowledge Database for Humans only, and was limited to known direct

molecular interactions.

2.10 Quantitative Polymerase Chain Reaction (qPCR)

A quality control RT-PCR using primers for the PGK1 housekeeping gene demonstrated

that RNA from all five samples used for the microarray studies were of suitable quality for RT-PCR

experiments. Thus, RNA from the five tumour samples and two normal adult human fibroblasts

were reverse transcribed into complementary DNA using 1μg of total RNA and primed using

random hexamers. The RT reaction consisted of a 15 minute incubation at 42oC followed by

inactivation at 99oC for 5 minutes. Real-time PCR was performed as previously published [82, 83].

0.5μl of cDNA was used for real-time PCR using the SYBR green master mix as per

manufacturer’s instructions and primers at a concentration of 50μM each on the Roche

Lightcycler480 (Hoffmann-La Roche Limited, Mississauga, ON, Canada). Each combination of

tumour sample and gene primer sets was run in triplicate. Reactions were incubated for 5 minutes

at 95oC, followed by 50 cycles of 95

oC for 15 seconds, 60

oC for 40 seconds, and 72

oC for 15

seconds. Product specificity was confirmed by melt-point analysis. Primer sets for the four genes

of interest (EPHA3, SNX10, ADAM9, CDC73) as well as the housekeeping control were obtained

from SABiosciences (SABiosciences: A Qiagen Company, Frederick MD, USA). Data was

31

analyzed by the comparative CT method (see below) using the TATA box-binding protein (TBP) as

the housekeeping control.

2.10.1 Comparative CT Analysis of qPCR Data

Comparative CT analysis, or 2-ΔΔC

T, is a method of presenting quantitative gene expression

[83]. This method calculates the relative gene expression of a gene of interest compared to an

internal control gene using the CT, or threshold cycle, of each test sample [83]. The CT is defined

as the PCR cycle at which the fluorescent signal of the reporter dye crosses an arbitrarily placed

threshold [83]. Plots of cDNA dilutions versus delta CT were performed, and the resultant plots

were consistently close to zero, indicating that the amplification efficiencies of the target and

housekeeping genes were similar for all gene primers. The following formula [83] was used to

determine relative fold-change for genes of interest in each tumour sample:

2-ΔΔC

T = [(CT gene of interest in tumour – CT internal control gene in tumour) – (CT gene of interest

in normal – CT internal control gene in normal)]

Where TBP was used as the internal control reference gene, and pooled adult human dermal

fibroblasts were used as the normal reference sample.

2.11 Immunohistochemistry (IHC)

IHC for CD99 was performed in all tumours in order to determine CD99 expression status.

IHC for EPHA3 was performed on the t(4;19)(q35;q13.1)-negative tumours used in the array

portion of the study in order to attempt to confirm the gain of EPHA3 in some of these tumours.

32

2.11.1 IHC for CD99

Immunohistochemistry was performed on dewaxed slides cut at 4-µm using the Ventana

Benchmark XTTM autostainer (Ventana Medical Systems, Tucson, Arizona, USA) and the

Ventana i-VIEW LSAB DAB kit. The CD99 antibody (Leica-Novocastra, Richmond IL; clone

HO36-1.1) was used at 1:50 following heat-induced epitope retrieval. CD99 staining was

performed on all tumours.

2.11.2 IHC EPHA3

Immunohistochemistry with an EPHA3 antibody (1:300; Abnova, Walnut CA, USA) was

performed on 4-µm thick sections of each tumour. Tumours used for IHC were the same as those

used for array analysis (USTS12-15, USTS19). Optimal conditions of heat-induced epitope

retrieval using the Ventana 3,3’diaminobenzidine tetrachloride kit according to the manufacturer’s

instructions (Ventana Medical Systems Inc., Tucson, AZ). Sections were scored using a previously

established scoring system [32]. Intensity of staining was scored as 1 for low, 2 for moderate, and

3 for high, where the positive control was used as the standard for high intensity. Distribution of

staining was as 1 if less than 10% of cells were positive, 2 if 11-50% of cells were positive, and 3 if

greater than 50% of cells were positive. A combined score of 4 or more was considered positive,

and a combined score of less than 4 was considered negative. Optimization was performed on

tissues identified as highly expressing EPHA3 by the Human Protein Atlas version 6.0

(www.proteinatlas.org) [84-87]. Tissues identified as most favorable for optimization were testis,

kidney, placenta, and breast carcinoma. Ultimately breast carcinoma was identified as the optimal

positive control reference tissue.

33

CHAPTER THREE

RESULTS & DISCUSSION

34

3. RESULTS & DISCUSSION

3.1 Clinicopathological Features of Tumour Samples

In total 22 USTSs were used over the course of this thesis work. Table 3.1 summarizes the

clinicopathological features of the tumours; for one patient, the age and sex was not available. The

clinicopathological features of 13 tumours have been previously described [31, 32, 34, 72]. Briefly,

17 cases were composed of sheets of primitive round to plump spindle cells; one case was

composed of nests of PRCs; and four cases were composed of pure spindle cells within a myxoid,

collagenous or cellular fascicular background (Figure 3.1). There was a slight male predominance

(12:9) and the mean and median age was 9.3 and 11 years, respectively. The most common sites of

involvement were the trunk (50%), the extremities (27%) and the head & neck (23%). CD99

showed cytoplasmic positivity in 3 cases (USTS13-15) and membranous positivity in 5 cases

(USTS1-3, USTS20-21) (Figure 3.1).

Figure 3.1. Histological features and CD99 staining pattern of USTSs used in t(4;19) pilot study.

(A,B) USTS1 and USTS2 exhibited a primitive round cell phenotype. (C) USTS16 exhibited a more

spindled and myxoid phenotype. (D) CD99 staining of USTS2 exhibited weak and inconsistent membranous

staining. (A-C) Hematoxylin-eosin stain, x 200, hematoxylin and eosin. (D) CD99 antibody (Dako,

Carpinteria, CA, 1:50) with hematoxylin countertain. Magnification x 400. (Adapted from [32]).

35

Sample Sex Age Site of Primary Morphology CD99 CIC-DUX4

status

USTS1 F 16 yr Trunk (abdominal wall) PRC, sheets Pos (m) POS

USTS2 M 14 yr H&N PRC, sheets Pos (m) POS

USTS3 F 11 yr Trunk (flank) PRC, sheets Pos (m) POS

USTS4 F 9 yr Trunk (paraspinal) PRC, nested Neg POS

USTS5 F 11 yr Trunk (inguinal) PRC, sheets Neg POS

USTS6 F 12 yr Trunk (chest wall) PRC, sheets Neg NEG

USTS7 F 14 yr Trunk (back) PRC, sheets Neg NEG

USTS8 M 13 yr H&N (neck) PRC, sheets Neg NEG

USTS9 M 12 yr Extremity (knee) PRC, sheets Neg NEG

USTS10 M 3 yr Extremity (toe) PRC, sheets Neg NEG

USTS11 M 4 yr Trunk (shoulder) PRC, sheets Neg NEG

USTS12 M 9 m Trunk (retroperitoneal) PRC, sheets Neg NEG

USTS13 M 9 m Extremity (arm) PRC, sheets Pos (c) NEG

USTS14 F 11 yr H&N (neck) PRC, sheets Pos (c) NEG

USTS15 n/a n/a Extremity (thigh) PRC, sheets Pos (c) NEG

USTS16 M 12 yr Trunk (paraspinal) SC, myxoid Neg NEG

USTS17 M 11 yr H&N (orbit) SC, myxoid Neg NEG

USTS18 F 7 yr Trunk (chest wall) SC, fascicular Neg NEG

USTS19 M 10 yr Trunk (flank) SC, collagenous Neg NEG

USTS20 M 13 yr Extremity (leg) PRC, sheets Pos (m) NEG

USTS21 M 11 yr Extremity (toe) PRC, sheets Pos (m) NEG

USTS22 F 9 m H&N (orbit) PRC, sheets Neg NEG

Table 3.1. Clinicopathological features, CD99 staining pattern, and CIC-DUX4 fusion

transcript status of the cohort of tumours used in the study. USTS, undifferentiated soft tissue

sarcoma; N/A, not available; H&N, head and neck; PRC, primitive round cell; SC, spindled cell; pos,

positive; neg, negative; m, membranous; c, cytoplasmic. (Table modified from [31]).

36

3.2 t(4;19) Pilot Project

The following data has been published as follows: Yoshimoto M., Graham C., Chilton-MacNeill

S. et al., Cancer Genet Cytogenet. 2009;195:1-11 [31].

Prior to the commencement of this thesis project, our laboratory initiated a pilot project to

investigate the presence of a 19q13 rearrangement in our cohort of USTSs. This study applied

high-throughput array comparative genomic hybridization (CGH) together with spectral

karyotyping (SKY), four-colour fluorescence in situ hybridization (FISH), and reverse

transcriptase-polymerase chain reaction (RT-PCR) to a series of three paediatric USTS samples

(USTS1, USTS2, USTS16). Two of these samples have PRC morphology with CD99 positivity,

and the third sample has a spindled and myxoid appearance and is negative for CD99. G-banding

and SKY analyses were initiated and completed prior to my addition to the study group. Array

CGH studies had been initiated prior to my addition to the study, however analysis of the array

CGH results were completed by both myself and Dr. Maisa Yoshimoto. I subsequently played a

major role in the development and application of the four-colour FISH and RT-PCR assays.

3.2.1 Genomic Imbalances in Undifferentiated Sarcomas

High-resolution array CGH analysis was performed to identify: (a) copy number changes of

a whole chromosome; (b) copy number changes affecting a chromosomal arm; (c) small genomic

imbalances (>750kb or ≤ 10Mb); and (d) cryptic microaberrations (>100kb or ≤ 750kb) in the

paediatric USTS samples [34]. From comparison analysis of genomic gains and losses among the

three tumour samples, USTS16 exhibited the least genomic copy number imbalance relative to the

other samples (Figure 3.2). USTS1 and USTS2 exhibited the highest frequency of genomic

complexity (15 and 26 respectively), whereas USTS16 exhibited only 1 imbalance by array CGH.

37

USTS2 harboured the most genomic losses (17/26), compared with USTS1 (3/15) and USTS16

(0/1). Although small genomic imbalances were widely detected in USTS1 (7/15), the high-

resolution genome-wide analysis also identified chromosome-arm (2/15) and whole chromosomal

copy number changes (1/15). USTS2 showed chromosome-arm copy number changes, small

genomic imbalances and cryptic microaberrations (1/26, 10/26, and 15/26, respectively). These

cryptic microaberrations were randomly distributed across the genome. Each genomic imbalance

interval in USTS1 and USTS2 was converted to the corresponding cytoband at 850-band resolution.

Notably, the 19q13 region was consistently rearranged in both USTS1 and USTS2, but not

USTS16. This common 19q13 region rearranged in USTS1 and USTS2 was further interrogated

using SKY and four-colour FISH.

38

Fig

ure 3

.2.

Com

posi

te a

naly

sis

of

gen

om

ic i

mb

ala

nces

in t

hree U

ST

S s

am

ple

s, w

ith

wh

ole

gen

om

ic p

rofi

les

of

(A)

US

TS

16,

(B)

US

TS

1,

an

d (

C)

US

TS

2.

Reg

ions

of

gai

n a

re s

how

n o

n t

he

right

side

of

each

ver

tica

l pro

file

and a

re h

ighli

ghte

d a

s gre

en

sect

ions

on t

he

chro

moso

mal

rep

rese

nta

tions.

Reg

ions

of

loss

are

show

n o

n t

he

right

side,

and a

re h

ighli

ghte

d a

s re

d.

Aber

rati

ons

wer

e as

sign

ed a

s ei

ther

gai

n [

log(3

/2)

or

thre

shold

of

0.2

] or

loss

[lo

g(1

/2)

or

thre

shold

of

0.3

]. G

enom

ic i

mbal

ance

s w

ere

ass

igned

as e

ither

gai

n [

log(3

/2)

or

thre

shold

of

0.2

) o

r lo

ss [

log

(1/2

) o

r th

resh

old

of

0.3

], t

akin

g i

nto

acc

ou

nt

on

ly g

eno

mic

inte

rval

s >

10

0 k

b.

(Adap

ted f

rom

[30])

.

39

3.2.2 Karyotypes of Undifferentiated Sarcomas

Metaphase cells from the 3 tumours were characterized by G-banding (data not shown) and

SKY analysis. All of the tumours demonstrated both balanced and unbalanced structural

rearrangements. The karyotypes are given in Table 3.2.

Sample Karyotype

USTS16

a 46,XY,t(1;15)(q?31;q?21),t(2;6)(q?33;q12~q14),der(8)t(8;20)(p23;q11.2),t(11;20)(q12

~q13;p11.2),del(20)(q11.2)

USTS1b 47,XX,i(1)(q10),der(4)t(4;19)(q33~q35;q13.1),+8,t(15,17)(q24;p11.2~p12),der(19)t(19

;20)(q13.1;p11.2),der(22)t(20;22)(q13;q13)

USTS2 46,XY,del(4)(q33),der(6)t(6;19)(p21.1;p13.3),der(7)t(7;19)(q36;q13.2),der(13)t(6;13)(

p21.1;p11.2),del(19)(p13.3),der(19)t(6;19)(p21.3;q13.2)

SKY analysis confirmed that the 19q13 region was consistently rearranged in the PRC sarcoma

samples. In sample USTS1, chromosome 19 was involved in two unbalanced translocations, one

with chromosome 4 and one with chromosome 20 (Figure 3.3A). In sample USTS2, chromosome

19 was involved in unbalanced translocations with chromosome 6 and chromosome 7 (Figure

3.3B). In both cases it was subsequently necessary to determine which chromosome was the

binding partner for the region of interest on chromosome 19 (19q13.1).

Table 3.2. Summary of the undifferentiated sarcoma karyotypes according to G-banding and

spectral karyotyping analyses. a Cytogenetic analysis previously described by Selvarajah et al

[33]. b Cytogenetic analysis previously described by Somers et al [36].

40

3.2.3 Detailed Comparison of Cytogenetic Karyotyping and Array CGH

Comparison of the array CGH profiles to the SKY results showed that apparent balanced

rearrangements affecting chromosomes 1, 2, 6, 8, 11, 15, and 20 were associated with no

acquisition of genomic imbalances in USTS16 (data not shown). In contrast, both USTS1 and

USTS2 showed unbalanced translocations associated with genomic imbalances. Array CGH

profiling of USTS1 revealed loss and gain of the short and long arm of chromosome 1, respectively.

These genomic imbalances were consistent with the presence of i(1q), as previously described from

Figure 3.3. Representative spectral karyotyping (SKY) image of two USTS samples

showing multiple structural alterations in a diploid karyotype. (A) SKY and G-band

(data not shown) analysis of USTS1 showing a complex karyotype including

isochromosome of 1q, trisomy of chromosome 8 and rearrangements of chromosomes 4,

15, 17, 19, 20 and 22. (Adapted from [33]). (B) SKY analysis of USTS2 showing

chromosomal rearrangements on chromosomes 4, 6, 7, and 19. (Adapted from [30, 36]).

41

SKY analysis [34]. In addition, the extra copy of chromosome 8 was detected by both array CGH

and SKY analyses [88]. The derivative chromosomes, such as der(4)t(4;19)(q33~q35;q13.1),

der(19)t(19;20)(q13.1;p11.2), and der(22)t(20;22)(q13;q13) were also associated with gain and loss

of genomic regions (Figure 3.4A). For example, genomic gains of chromosome 20 and genomic

loss of chromosome 22q were likely generated through unbalanced translocation mechanisms

involving chromosomes 20 and 22, respectively. The der(19)t(19;20)(q13.1;p11.2) unbalanced

translocation was accompanied by a genomic loss on chromosome 19. USTS2 also showed

genomic imbalances consistent with unbalanced chromosomal rearrangements. Small genomic

losses on chromosome 4 were associated with a deletion at the 4q33 region (Figure 3.4B).

Genomic gain of chromosomal arm 6p was likely generated through mechanisms involving

unbalanced translocations. USTS2 exhibited the recurrent 19q13 region rearrangement, and the

der(19)t(6;19)(p21.3;q13.2) and der(7)t(7;19)(q36;q13.2) unbalanced translocation were

accompanied by a genomic loss and gain on chromosome 19q13 (Figure 3.4B). These recurrent

19q abnormalities may lead to haploinsufficiency or gene fusion events affecting one or more genes

mapping to either the deleted or gained interval, the translocation breakpoints, or flanking genomic

regions. Notably, array CGH analysis indicated at least two candidate genes, CIC and ERF,

potentially affected by aberrations on chromosome 19.

42

3.2.4 Four-colour FISH Validation of 19q Rearrangement and Detection of Translocation Binding

Partners

In order to further investigate the binding partners of the region of interest on chromosome

19, four-colour interphase FISH analysis was performed on corresponding formalin-fixed, paraffin-

embedded sections. This approach was also used to validate the genomic imbalances and to test the

sensitivity of the platform in detecting a small gain of interest in proximity to the microdeletion.

Copy number changes for the chromosomal 19q13 region frequently showing alterations were

recorded in 50 tumour cells for USTS1 and USTS2.

43

USTS1

As predicted by cytogenetic analysis, the USTS1 tumour cells did not exhibit co-

localization of the control probe (pink) with probe C (yellow). Instead, an extreme nuclear

separation of these two probes was apparent and concurrent with the detection of losses of probe A

and B (green and red signals, respectively), confirming the presence of the 19q abnormalities

described by array CGH and SKY (Figure 3.5A). The BAC probes hybridizing to the unaffected

chromosome 19 showed the normal pink, green, red and yellow co-localization pattern.

Subsequently, we sought to investigate the binding partners of the chromosomal 19q

rearrangement in USTS1. Based on hybridization in the control sample (data not shown) and

tumour samples, the detection of chromosomal 19q rearrangement was defined when the distance

between two signals was more than three times the estimated signal diameter [89]. Fusion FISH

strategy was used in the analysis of 19q rearrangement, with BAC DNA probes, producing two-

colour fusions on the derivative chromosome. To determine the translocation partner for USTS1,

the four-colour FISH approach consisted of BAC DNA probe sets proximal to the breakpoint

region involving the CIC gene on chromosome 19 (probe E, red signal), the region upstream of the

DUX4 gene on chromosome 4 (probe D, green signal), a control region on chromosome 19 (probe

A, pink signal), and region of interest on chromosome 20 (probe F, yellow signal). Co-localization

of probes for the CIC gene and the DUX4 gene confirmed the presence of a t(4;19) translocation

(Figure 3.5B). Lack of co-localization of the red and yellow signals indicates that CIC is not

involved in a rearrangement with chromosome 20p.

44

Figure 3.5. Four-colour FISH validation of 19q abnormalities and detection of

translocation binding partners in USTS1. (A) FISH probes used to validate genomic

imbalances in USTS1 (arbitrarily designated as probes A-C; see Materials and Methods for

details). Gene and BAC locations are taken from the UCSC Genome Browser (May 2004

assembly, Hg17; http://www.genome.ucsc.edu). (B) Representative FISH images

confirming deletions on chromosome 19 (green and red signals). Extreme nuclear

separation of the yellow and pink signals is indicative of a rearrangement involving 19q,

confirming array CGH and SKY results. BAC probes hybridizing to the unaffected

chromosome 19 showed the normal pink, green, red, and yellow colocalization pattern. (C)

FISH probes used to determine binding partner of 19q13 in USTS1 (arbitrarily designated

as probes D-F; see Materials and Methods for details). Gene and BAC locations are taken

from the UCSC Genome Browser (May 2004 assembly, Hg17;

http://www.genome.ucsc.edu). (D) FISH analysis identifies a t(4;19)(q35;q13.1)

translocation in USTS1. Probes for the CIC gene (probe E, red signal) showed

colocalization with a probe just upstream of DUX4 (probe D, green signal), confirming the

presence of the CIC-DUX4 fusion rearrangements. Lack of colocalization of the red and

yellow (probe F) signal indicates that CIC is not involved in a rearrangement with 20p. All

captured signals were converted to default false colours: Spectrum Orange to red, Spectrum

Red to pink, and Cy5 to yellow. (Adapted from [30]).

45

USTS2

As predicted by array CGH, the USTS2 tumour showed cell populations with extra copies

of probes A and B (green and red signals, respectively), ranging from three to four copies for the

chromosomal loci analyzed, and hemizygous loss of probe C (yellow signal) (Figure 3.6A). The

BAC probes hybridizing to the unaffected chromosome 19 showed the normal pink, green, red and

yellow co-localization pattern.

To determine the translocation partner for USTS2, four-colour FISH was performed using

probes for the CIC gene, a control region on chromosome 19, the region of interest on chromosome

6 and the region of interest on chromosome 7. Surprisingly, a lack of co-localization was seen for

chromosome 19 with either chromosome 6 or 7 (data not shown). To be thorough, the FISH assay

used to detect the t(4;19) translocation in USTS1 was applied to both USTS2 and USTS16. No co-

localization of the CIC and flanking DUX4 probes was seen in USTS16 (data not shown).

However, in USTS2 the CIC probe signal showed co-localization with the probe immediately

upstream of DUX4 (probe D, green signal), indicating that this sample harboured a similar t(4;19)

translocation (Figure 3.6B). The probe for DUX4 falls within a region of hemizygous deletion on

chromosome 4 (as detected by array CGH), thus co-localization is indicative of a fusion of the CIC

gene to the unaffected chromosome 4. Therefore, both USTS1 and USTS2 showed evidence of a

t(4;19) CIC-DUX4 fusion event by FISH analysis.

46

3.2.5 Detection of CIC-DUX4 Fusion Transcripts by RT-PCR

A t(4;19)(q35;q13.1) translocation had been previously identified in 4 samples with a

similar PRC morphology over the past 15 years [60, 70, 71]. One group determined that in 2 cases,

Figure 3.6. Four-colour FISH validation of 19q abnormalities and detection of

translocation binding partners in USTS2. (A) FISH probes used to validate genomic

imbalances in USTS2 (arbitrarily designated as probes A-C; see Materials and Methods for

details). Gene and BAC locations are taken from the UCSC Genome Browser (May 2004

assembly, Hg17; http://www.genome.ucsc.edu). (B) Representative FISH images showing

hemizygous loss of the yellow signal and gain of the red and green signals, confirming array

CGH results. BAC probes hybridizing to the unaffected chromosome 19 showed the normal

pink, green, red, and yellow colocalization pattern. (C) FISH probes used to validate binding

partner for 19q13 in USTS1, applied to USTS2 (arbitrarily designated as probes D-F; see

Materials and Methods for details). Gene and BAC locations are taken from the UCSC

Genome Browser (May 2004 assembly, Hg17; http://www.genome.ucsc.edu). (D) FISH

analysis identifies the presence of a t(4;19)(q35;q13.1) rearrangement in USTS2. The red

signal (probe E, CIC) showed extreme nuclear separation from the chromosome 19 control

probe signal (purple); however, this signal showed colocalization with a probe immediately

upstream of DUX4 (probe D, green signal). The probe for DUX4 falls within a region of

hemizygous deletion on chromosome 4; thus, colocalization is indicative of a fusion of the

CIC gene to the unaffected chromosome 4. All captured signals were converted to default

false colours: Spectrum Orange to red, Spectrum Red to pink, and Cy5 to yellow. (Adapted

from [30]).

47

this translocation resulted in the fusion of the CIC gene (19q13) to the C-terminal portion of the

DUX4 gene (4q35). The primer sequences and positive control were obtained from [60], and RT-

PCR was run with the CIC4120 and DUX4RTr2 primers. Of the three tumours analyzed by RT-

PCR, the two tumours with PRC morphology (USTS1 and USTS2) were confirmed to harbour a

CIC-DUX4 fusion transcript (Figure 3.7). One of the two positive CIC-DUX4 fusion products was

consistent in size with published findings; however the other tumour sample (USTS2) contained a

novel variant CIC-DUX4 fusion transcript. Automated DNA sequencing of gel-purified transcripts

from the positive control and USTS2 confirmed fusions of the CIC gene with the DUX4 gene.

Unfortunately USTS1 had insufficient material for sequence analysis. The positive control was

890bp, with exon 20 of the CIC gene fused to the end of exon 1 of the DUX4 gene. The novel

variant CIC-DUX4 transcript was 710bp, with a 50bp deletion in CIC and a 130bp deletion in

DUX4 relative to the positive control. Interestingly, the fusion gene event involving CIC and

DUX4 was detected by RT-PCR and FISH analysis in both PRC sarcoma cases, but by SKY

analysis in only one case.

Figure 3.7. Confirmation of the CIC-DUX4 fusion transcript in two of

the three USTS samples. RT-PCR products from the three cases (USTS16,

USTS1, and USTS2) were sized against a 100-bp ladder marker using

electrophoresis on a 1.5% agarose gel. Both USTS1 (lane 4) and USTS2

(lane 5) showed the presence of the CIC-DUX4 fusion transcript. The full-

length CIC-DUX4 cDNA was subcloned into a pGEM vector (Promega,

Madison, WI) and used as a positive control (lane 2). (Adapted from [30]).

48

3.2.6 Discussion of t(4;19) Pilot Project

This pilot project describes the results of a detailed analysis of the genomic aberrations in

three paediatric USTSs. Two of the tumours were of PRC morphology, and one tumour had

spindled, less cellular architecture with a focal hemangiopericytomatous-like pattern. All three

tumours had in common a lack of lineage-specific differentiation and no common translocations

associated with paediatric sarcomas. Both tumours with PRC morphology also exhibited CD99

positivity to varying degrees. Both tumours exhibiting the PRC phenotype with CD99

immunoreactivity harboured a recurrent rearrangement between 19q13 and 4q35, as detected by a

combination of SKY, array CGH, FISH and RT-PCR. As previously discussed, four cases of PRC

sarcomas have been previously reported that harbour similar rearrangements involving a

translocation between 19q13 and 4q35 [60, 70, 71]. One of these studies identified two tumours

with this rearrangement, and found that the fusion event between the CIC gene and the DUX4 gene

resulted in a protein with transforming properties. The human CIC gene codes for a protein that is

a member of the HMG-box superfamily of transcription factors [90]. The DUX4 gene is a double-

homeobox gene belonging to the family of double homeodomain transcriptional activators [91].

The biological roles of the CIC and DUX4 genes will be discussed in more detail in section 3.3.3.

As confirmed by RT-PCR, both t(4;19)-positive cases in this pilot project harboured the CIC-DUX4

fusion transcript. Notably, one tumour had a PCR product of a different size than those previously

published, suggesting a different site of rearrangement within one or both genes. Thus, both

tumours are members of the novel category of paediatric primitive sarcomas with a t(4;19)

rearrangement resulting in a CIC-DUX4 fusion transcript.

One particularly intriguing aspect of these analyses was that the original SKY analysis of

USTS2 did not reveal the presence of a t(4;19) rearrangement. By RT-PCR, and subsequent

49

sequence confirmation, a CIC-DUX4 fusion transcript was shown to be present in this sample.

Finer mapping of the genome of USTS2 did indeed confirm the presence of a t(4;19)(q35;q13)

rearrangement, together with a gain, followed by a loss of chromosome 19q at the CIC region. As

chromosomal breakage generally occurs as part of chromosomal rearrangements [92], it is probable

that a small region of CIC was fused to DUX4, and that the size of the chromosome 19 locus that

was translocated was below the threshold of detection for SKY analysis. Thus, the absence of

specific translocations by SKY analysis does not exclude the presence of cryptic or complex

rearrangements below the threshold of detection, and RT-PCR remains the method of choice for

confirming the presence of this fusion gene.

Ultimately, molecular and cytogenetic analyses identified two tumours with PRC

morphology that harbour a t(4;19)(q35;q13) translocation involving the CIC gene on chromosome

19 and the DUX4 gene on chromosome 4. One tumour of spindled and myxoid morphology was

not found to harbour this translocation. These initial findings suggest that the CIC-DUX4 fusion

events are limited to tumours with an undifferentiated round cell phenotype. More importantly, this

pilot study allowed us to optimize RT-PCR and FISH conditions that can be used to screen the

remaining USTSs in our cohort.

3.3 Screening of USTSs Cohort for CIC-DUX4 Translocation

The following data has been submitted for publication as follows: Graham et al., Pediatr Dev

Pathol, Submitted October 2010 [93].

In this portion of the study, an additional 19 tumours were screened for the CIC-DUX4

fusion transcript with an RT-PCR assay developed for both frozen and paraffin-based tissues. A

combination of primer sets were used which cover all known breakpoints of the CIC-DUX4 fusion

50

transcript. Some of the clinicopathological features of these tumours have been previously

described [32, 34, 72, 88]. Fifteen cases were composed of sheets of primitive round to plump

spindle cells, one case was composed of nests of PRCs; and three cases were composed of pure

spindle cells within a myxoid, collagenous or cellular fascicular background. CD99 showed

cytoplasmic positivity in three cases (USTS9-11) and membranous positivity in 3 cases (USTS3,

USTS20, USTS21). The ultrastructural features of USTS3, USTS4, USTS6-11 and USTS17 have

been previously described [34, 72]. For seven USTSs, tissue was not available for ultrastructural

analysis. The remaining 2 USTSs with PRC morphology showed sparse intracytoplasmic

neurosecretory granules, variable amounts of intracytoplasmic glycogen and cytoplasmic processes.

3.3.1 Detection of the CIC-DUX4 fusion by RT-PCR

Total RNA from ten fresh-frozen USTS samples and nine FFPE USTS samples were

subjected to RT-PCR analysis for the CIC-DUX4 fusion transcript. All samples were subjected to a

series of PCR reactions using a combination of CIC forward primers and DUX4 reverse primers

(Table 2.1). CIC4120 and DUX4RTr2 were selected as the optimal primer set for fresh-frozen

samples as they result in the largest PCR product, however all primer sets successfully amplified

specific PCR products (data not shown). CIC4F and DUX4-R2 were selected as the optimal primer

set for RNA extracted from FFPE tissue samples. Of the nineteen tumour samples analyzed by RT-

PCR, three were positive for the CIC-DUX4 fusion transcript (USTS3, USTS4 and USTS5) (Figure

3.8). All three CIC-DUX4 positive tumours had PRC morphology, and one tumour showed

membranous positivity for CD99. Two of the three positive CIC-DUX4 fusion products (USTS3,

USTS4) were consistent in size and sequence with previously published findings [60], whereas one

product (USTS5) contained a novel variant of the CIC-DUX4 fusion transcript.

51

Figure 3.8. Identification of the CIC-DUX4 fusion by reverse transcriptase-polymerase chain

reaction (RT-PCR) screening. (A) Total RNA samples from 10 fresh-frozen USTS tumours were

subjected to RT-PCR analysis for the CIC-DUX4 fusion transcript using the CIC4120 forward and

DUX4RTr2 reverse primers. These products were sized against a 100-bp ladder marker using

electrophoresis on a 1.5% agarose gel. USTS5 (lane 3) showed the presence of the CIC-DUX4 fusion

transcript. The full-length CIC-DUX4 cDNA was subcloned into a pGEM vector (Promega, Madison,

WI) and used as a positive control (lane 2). (B) Total RNA from 9 formalin fixed paraffin-embedded

USTS tumours was subjected to RT-PCR analysis for the CIC-DUX4 fusion transcript using the CIC4

forward and DUX4-R2 reverse primers. These products were sized against a 100-bp ladder marker

using electrophoresis on a 1.5% agarose gel. USTS3 (lane 3) and USTS4 (lane 4) showed the presence

of the CIC-DUX4 fusion transcript. The full-length CIC-DUX4 cDNA was subcloned into a pGEM

vector (Promega, Madison, WI) and used as a positive control (lane 2). (Adapted from [93]).

52

3.3.2 DNA Sequence Analysis of the CIC-DUX4 Positive PCR Products

Gel-purified transcripts from the positive control, USTS3, USTS4 and USTS5 were directly

sequenced using automated DNA sequencing. Sequencing confirmed fusions of the CIC gene with

the DUX4 gene in all three cases, as well as the positive control. Using the CIC4120 forward

primer and DUX4rtR2 reverse primer, the positive control was 890bp. Using the same primer set

USTS5 was 821bp, with an additional 10bp in CIC and a 75bp deletion in DUX4 relative to the

positive control.

Using the CIC4F forward primer and the DUX4-R2 reverse primer, the positive control was

170bp. USTS3 and USTS4 shared the same breakpoint as the positive control. In all cases, exon

20 of the CIC gene was fused to exon 1 of the DUX4 gene. Combining the sequencing analysis of

these three additional cases with the sequencing analysis from the pilot project resulted in the

identification of three breakpoints associated with the CIC-DUX4 fusion transcript in our cohort

(Figure 3.9). Based on the sequence analysis results, the amino acid structure of the fusion

proteins was predicted (Figure 3.10). This results in an in-frame fusion between CIC and DUX4

with the CIC open reading frame. This fusion leaves intact virtually all functional regions of the

CIC gene, including the DNA-binding high-mobility group (HMG)-box and fifteen of sixteen

putative MAPK phosphorylation sites [94-98]. On the other hand, this translocation results in the

loss of the majority of the DUX4 functional regions, including both DNA-binding homeodomains

(Figure 3.11).

53

CIC mRNA Exons 17-20

...GGTCCTGTCAGAAGTGGACTTCGAAGAGCGCTTTGCTGAGTTGCCTGAGTTTCGGCCTGAGGAGGTGCTGCCCTCCCCCACCCTGCAG

CIC4120

TCTCTGGCCACCTCACCCCGGGCCATCCTGGGCTCTTACCGCAAGAAGAGGAAGAACTCCACGGACCTGGATTCAGCACCCGAGGACCCCA

CCTCGCCCAAGCGCAAGATGAGAAGACGCTCCAGCTGCAGCTCGGAGCCCAACACCCCCAAGAGTGCCAAGTGCGAGGGGGACATCTTCAC

CTTTGACCGTACAGGTACAGAAGCCGAGGACGTGCTTGGGGAGCTAGAGTATGACAAGGTGCCATACTCCTCCCTGCGGCGCACCCTGGAC

CAGCGCCGGGCCCTGGTCATGCAGCTCTTTCAGGACCATGGCTTCTTCCCGTCAGCCCAGGCCACAGCCGCCTTCCAGGCCCGCTATGCAG

ACATCTTTCCCTCCAAGGTTTGTCTGCAGTTGAAGATCCGTGAGGTGCGCCAGAAGATCATGCAGGCTGCCACTCCCACGGAGCAGCCCCC

CIC2F

TGGAGCTGAGGCTCCTCTCCCTGTACCGCCCCCCACTGGCACCGCTGCTGCCCCTGCCCCCACTCCCAGCCCCGCAGGGGGCCCTGACCCC

CIC3F CIC4F

ACCTCACCCAGCTCGGACTCTGGCACGGCCCAGGCTGCCCCGCCACTGCCTCCACCCCCAGAGTCGGGGCCTGGACAGCCTGGCTGGGAGG

GGGCTCCCCAGCCCTCCCCCCCACCCCCAGGTCCCTCCACAGCTGCCACAGGCAGGTGA

- Positive Control, USTS3 & USTS4 breakpoint

- USTS1 breakpoint

- USTS5 breakpoint

DUX4 mRNA

...CCGCCCCCGCGCTGCAGCCCAGCCAGGCCGCGCCGGCAGAGGGGGTCTCCCAACCTGCCCCGGCGCGCGGGGATTTCGCCTACGCCGC

CCCGGCTCCTCCGGACGGGGCGCTCTCCCACCCTCAGGCTCCTCGGTGGCCTCCGCACCCGGGCAAAAGCCGGGAGGACCGGGACCCGCAG

CGCGACGGCCTGCCGGGCCCCTGCGCGGTGGCACAGCCTGGGCCCGCTCAAGCGGGGCCGCAGGGCCAAGGGGTGCTTGCGCCACCCACGT

CCCAGGGGAGTCCGTGGTGGGGCTGGGGCCGGGGTCCCCAGGTCGCCGGGGCGGCGTGGGAACCCCAAGCCGGGGCAGCTCCACCTCCCCA

GCCCGCGCCCCCGGACGCCTCCGCCTCCGCGCGGCAGGGGCAGATGCAAGGCATCCCGGCGCCCTCCCAGGCGCTCCAGGAGCCGGCGCCC

DUX4R2

TGGTCTGCACTCCCCTGCGGCCTGCTGCTGGATGAGCTCCTGGCGAGCCCGGAGTTTCTGCAGCAGGCGCAACCTCTCCTAGAAACGGAGG

CCCCGGGGGAGCTGGAGGCCTCGGAAGAGGCCGCCTCGCTGGAAGCACCCCTCAGCGAGGAAGAATACCGGGCTCTGCTGGAGGAGCTTTA

DUX4RTr2

G

- Positive Control, USTS3 & USTS4 breakpoint

- USTS1 breakpoint

- USTS5 breakpoint

Figure 3.9 Nucleotide sequences of CIC and DUX4 sequences denoting primer sequences and translocation

breakpoints. (Top) Partial nucleotide sequence of the CIC gene, with CIC forward primer sequences underlined,

and different translocation breakpoints denoted by coloured arrows. (Bottom) Partial nucleotide sequence of the

DUX4 gene, with DUX4 reverse primer sequences underline, and different translocation breakpoints denoted by

coloured triangles. Arrows represent direction in which the genes are fused to one another. (Adapted from [93]).

54

USTS1 Fusion

... GCC ACT CCC ACG GAG CAG CCC CCT GGA GCT GAG GCT CCT CTC CCT GTA CCG CCC CCC

A A T P T E Q P P G A E A P L P V P P P

ACT GGC ACC GCT GCT GCC CCT GCC CCC CTC CCC TGC GGC CTG CTG CTG GAT GAG CTC CTG

T G T A A A P A P L P C G L L L D E L L

GCG AGC CCG GAG TTT CTG CAG CAG GCG CAA CCT CTC CTA GAA ACG GAG GCC CCG GGG GAG

A S P E F L Q Q A Q P L L E T E A P G E

CTG GAG GCC TCG GAA GAG GCC GCC TCG CTG GAA GCA CCC CTC AGC GAG GAA GAA TAC CGG

L E A S E E A A S L E A P L S E E E Y R

GCT CTG CTG GAG GAG CTT TAG

A L L E E L *

Positive Control, USTS3 and USTS4 Fusion



ACT GGC ACC GCT GCT GCC CCT GCC CCC ACT CCC AGC CCC GCA GGG GGC CCT GAC CCC ACC

T G T A A A P A P T P S P A G G P D P T

TCA CCC AGC TCG GAC TCT GGG GGT GGA CCC CAA GCC GGG GCA GCT CCA CCT CCC CAG CCC

S P S S D S G G G P Q A G A A P P P Q P

GCG CCC CCG GAC GCC TCC GCC TCC GCG CGG CAG GGG CAG ATG CAA GGC ATC CCG GCG CCC

A P P D A S A S A R Q G Q M Q G I P A P

TCC CAG GCG CTC CAG GAG CCG GCG CCC TGG TCT GCA CTC CCC TGC GGC CTG CTG CTG GAT

S Q A L Q E P A P W S A L P C G L L L D

GAG CTC CTG GCG AGC CCG GAG TTT CTG CAG CAG GCG CAA CCT CTC CTA GAA ACG GAG GCC

E L L A S P E F L Q Q A Q P L L E T E A

CCG GGG GAG CTG GAG GCC TCG GAA GAG GCC GCC TCG CTG GAA GCA CCC CTC AGC GAG GAA

P G E L E A S E E A A S L E A P L S E E

GAA TAC CGG GCT CTG CTG GAG GAG CTT TAG

E Y R A L L E E L *

USTS5 Fusion



ACT GGC ACC GCT GCT GCC CCT GCC CCC ACT CCC AGC CCC GCA GGG GGC CCT GAC CCC ACC

T G T A A A P A P T P S P A G G P D P T

TCA CCC AGC TCG GAC TCT GGC ACG GCC CAG CAA GGC ATC CCG GCG CCC TCC CAG GCG CTC

S P S S D S G T A Q Q G I P A P S Q A L

CAG GAG CCG GCG CCC TGG TCT GCA CTC CCC TGC GGC CTG CTG CTG GAT GAG CTC CTG GCG

Q E P A P W S A L P C G L L L D E L L A

AGC CCG GAG TTT CTG CAG CAG GCG CAA CCT CTC CTA GAA ACG GAG GCC CCG GGG GAG CTG

S P E F L Q Q A Q P L L E T E A P G E L

GAG GCC TCG GAA GAG GCC GCC TCG CTG GAA GCA CCC CTC AGC GAG GAA GAA TAC CGG GCT

E A S E E A A S L E A P L S E E E Y R A

CTG CTG GAG GAG CTT TAG

L L E E L *

Figure 3.10. Nucleotide and predicted amino acid sequences of the different CIC-DUX4 fusions. (Top) In-frame

fusion between CIC and DUX4 in USTS1. (Middle) In-frame fusion between CIC and DUX4 in the positive control,

USTS3 and USTS4. 2 additional glycine residues are present at the fusion point which do not natively belong to

either CIC or DUX4. (Bottom) In-frame fusion between CIC and DUX4 in USTS5. Legend: CIC nucleotide sequence,

DUX4 nucleotide sequence, nucleotide sequence not belonging to CIC or DUX4, CIC amino acid sequence, DUX4

amino acid sequence, amino acid sequence not belonging to CIC or DUX4.

55

3.3.3 Discussion of Overall Screening of USTS cohort of CIC-DUX4 Fusion Transcript

Paediatric USTSs are a very poorly understood group of tumours on which very little data

had been published. Recently however, several groups have identified a particular variant of USTS

having a PRC phenotype, and harbouring recurrent translocations involving chromosomes 4q35 and

19q13 [31, 60, 70, 71]. One such study determined that in two cases, this rearrangement resulted in

the fusion of the CIC gene on chromosome 19q13.1 and the DUX4 gene on chromosome 4q35 [60].

Characterization of the genes involved in the t(4;19)(q35;q13.1) translocation has provided the

necessary information to develop an RT-PCR approach for the detection of the fusion transcript

resulting from this translocation. As such, the aims of this study were two-fold: (i) to identify

Figure 3.11. Predicted protein structure of CIC-DUX4 fusion protein variants.

Predicted structure of the CIC–DUX4 protein is based on the results of sequence analysis and

protein prediction. This fusion leaves intact most of the functional regions of the CIC gene,

including the DNA-binding high-mobility group-box and the majority of the putative MAPK

phosphorylation sites. However, this translocation results in the loss of the majority of the

DUX4 functional regions, including both DNA-binding homeodomains. (Adapted from [93]).

56

primer sets and RT-PCR conditions that could be used to effectively detect the CIC-DUX4 fusion

transcript on RNA extracted from both fresh-frozen and FFPE tissue and (ii) to screen a large

cohort of paediatric USTSs for the presence of this rearrangement.

Overall our institution has screened 22 USTS tumours and identified 5 of 18 (28%) tumours

with PRC morphology that harbour the CIC-DUX4 fusion transcript. Three of these 5 tumours

show CD99 positivity. Given that CIC-DUX4-positive tumours have been previously described as

‘atypical Ewing sarcomas’ [31, 88] or ‘Ewing-like sarcomas’ [60], the distinction of ESs and CIC-

DUX4-positive tumours on morphological and immunohistochemical grounds appears to be

challenging. Both exhibit some degree of CD99 positivity and ultrastructural evidence of neural-

type differentiation. However, previous reports have highlighted that the PRC sarcomas appear less

differentiated, with some lacking evidence of neural differentiation [31, 71] and an overall

inconsistent CD99 staining pattern [31, 60, 71]. Nevertheless, definitive diagnosis rests upon

cytogenetic and molecular diagnostic assays for identification of this specific rearrangement.

Interestingly, 4 distinct breakpoints were identified in the 5 positive cases (Figure 3.9),

though we were unable to determine the specific breakpoint in USTS1 due to insufficient material

for sequencing. The breakpoint is variable in both CIC and DUX4, suggesting that the structure of

both of these genes may facilitate multiple rearrangements with different genomic sequences [99].

In all cases, however, exon 20 of the CIC gene was fused to exon 1 of the DUX4 gene, leaving

intact the functional regions of CIC, but resulting in a loss of all DNA-binding portions of DUX4

(Figure 3.10). This suggests that while a specific nucleotide breakpoint may not be necessary, the

fusion between specific higher-level functional domains may be of particular importance.

57

The human CIC gene is an ortholog of the Drosophila capicua gene, and is a member of the

HMG-box superfamily of transcription factors [90]. This gene has 20 exons encoding a protein of

1608 amino acids [100], and contains an N-terminal DNA-binding HMG-box and sixteen possible

MAPK phosphorylation sites [94-98, 101]. Expression of CIC is predominantly limited to

immature granule cells in the central nervous system, and is important for their development [90,

101]. CIC has been shown to be involved in mediating two oncogenic signaling pathways, EGFR

and Wnt, by transcriptional repression. In cancer, CIC has been shown to be differentially

expressed in medulloblastomas [101], and has been shown to play a role in the pathogenesis of

PRC sarcomas [31, 60]. Furthermore, CIC has been shown to play a role in spinocerebellar ataxia

type 1 (SCA1) through association of the CIC protein with ATAXIN-1 (ATXN-1) [102, 103].

Binding of ATXN-1 to the N-terminal region of CIC alters the repressional activity of the CIC

protein [103].

The DUX4 gene is a double-homeobox gene belonging to the family of double

homeodomain transcriptional activators [91]. DUX4 is located within the tandem repeat locus

D4Z4 on chromosome 4 [104], and contains two DNA-binding homeoboxes at its N-terminus [91].

DUX4 has been implicated in the pathogenesis of facioscapulohumeral muscular dystrophy, in

which the D4Z4 copy number is significantly reduced [104, 105], and in ERMS where DUX4 is

involved in a chromosomal rearrangement with the EWSR1 gene [106]. Under normal conditions

DUX4 is believed to be involved in myogenic differentiation and cell-cycle control [106], and over-

expression of this gene in cultured primary myoblasts has been shown to induce caspase activity

and promote cell death [107].

58

The CIC-DUX4 fusion preserves the majority of the CIC gene, including the DNA-binding

HMG-box and the majority of the MAPK phosphorylation sites, but both DUX4 homeobox

domains were lost. Preliminary evidence suggests that the chimeric protein has transforming

properties, and acts as a strong transcriptional activator of downstream CIC targets, with CIC-

DUX4 showing 130-fold enhancement of transcriptional activity against wild-type CIC [60, 108].

Nonetheless, full characterization of the CIC-DUX4 fusion product will be required to fully

understand the functional role that this rearrangement plays in oncogenesis.

The CIC-DUX4-positive cohort forms a distinct subcategory of tumours with PRC

morphology and variable membranous CD99 positivity. Of note are another 17 cases of USTS

lacking the CIC-DUX4 transcript by RT-PCR. Such tumours may harbour variants of the t(4;19)

rearrangement not detected in the current analysis, or may represent a different category of

primitive sarcomas altogether.

3.4 Discovery of Novel Markers for CIC-DUX4-Negative Tumours

The t(4;19) pilot project and screening portions of this project allowed for the identification

of 5 novel cases of paediatric PRC sarcomas that harbour the CIC-DUX4 fusion transcript.

Furthermore, this screening allowed for the identification of a cohort of CIC-DUX4-negative

USTSs with no established genetic markers. As such we sought to characterize novel genomic

aberrations that may play a role in the deregulation of gene expression in these CIC-DUX4-negative

tumours. Recent developments in microarray technologies have revolutionized the way in which

we study the relationships between genomic changes and gene expression in cancer cells, allowing

us to study this relationship on a whole genome scale [81]. Identifying changes in gene expression,

as well as the mechanisms responsible for these changes, serves to significantly enhance our

59

understanding of oncogenesis [109]. Moreover, the integration of platforms that identify genomic

and transcriptomic aberrations will aid in determining genetic variations that control the

development and progression of cancer. As such, in order to study the effects of genomic

mechanisms on sarcoma-related gene networks, we used an integrative approach for genome-wide

profiling of genomic and gene expression changes. To do so we have tested DNA and RNA from

five CIC-DUX4-negative tumours on the Affymetrix Human SNP 6.0 microarray and the

Affymetrix Gene 1.0 ST expression microarray, respectively.

3.4.1 DNA Copy Number Analysis

In order to identify genome-wide copy number changes in each tumour sample, high

molecular weight genomic DNA was sent to The Centre for Applied Genomics (TCAG) microarray

facility to be run on the Affymetrix Genome-Wide Human SNP Array 6.0 microarray platform

(Affymetrix Inc., Santa Clara, CA, USA). Copy number analysis was conducted in 5 primary

tumour samples – USTS12, USTS13, USTS14, USTS15 and USTS19. Copy number analysis was

performed using the PGS software, using the publicly available normal control SNP data from the

HapMap consortium [79] as the reference for determining copy number alterations. While copy

number analysis revealed gross heterogeneity between the individual tumour samples (Figure 3.12),

numerous smaller regions could be identified which showed similar patterns of aberration across

the majority of the tumour samples. Tumour specific copy number changes ranged between 250

(USTS13) and 10019 (USTS12). In all tumours there was more copy number gain relative to loss.

A cumulative gene list was created that contained the genes showing statistically significant copy

number changes (P ≤ 0.05) in at least 3 of the 5 tumour samples.

60

3.4.2 Gene Expression Analysis

To characterize the expression profiles of these tumours, total RNA from each tumour as

well as two adult human dermal fibroblast samples was sent to the TCAG microarray facility to be

run on the Affymetrix Gene 1.0 ST microarray platform (Affymetrix Inc, Santa Clara, CA, USA).

The RNA extracted from USTS15 was not of high enough quality to be used for the expression

microarray analysis. As such the tumour-specific integration analysis, cumulative expression

microarray analysis and cumulative integration analysis did not include USTS15. Expression

Figure 3.12. Overall genomic profiling view from Partek Genomic Suite (PGS). High

molecular weight DNA from five t(4;19)(q35;q13.2)-negative USTS tumour samples were

tested on the genome-wide human SNP 6.0 microarray platform , and results were analyzed

using PGS to identify tumour-specific and cumulative copy number alterations. Copy number

analysis was normalized against the publicly available data collected by the International

HapMap consortium. A cumulative gene list was created that contained genes showing

statistically significant copy number changes (P ≤ 0.05) in at least 3 of the 5 tumour samples.

The x-axis contains chromosomes 1-22, X and Y in order.

61

analysis was performed using the PGS software, using human fibroblasts as reference RNA

samples for determining differential expression. Clustering analysis showed distinct clustering of

tumours relative to the normal human fibroblast samples (Figure 3.13). Tumour specific changes

ranged between 726 genes in USTS14 and 1370 genes in USTS13. An overall cumulative analysis

was performed, which compared the cumulative expression profiles of all tumours to the combined

profiles of the normal fibroblasts. This analysis produced a gene list with genes showing at least

two-fold, statistically significant (P ≤ 0.05) differential gene expression, cumulatively across the

tumours.

Figure 3.13. Clustering analysis shows distinct clustering of tumours relative to normal

human fibroblast samples. (A) Hierarchical clustering using average Euclidean cluster analysis

on 4 tumours tested on the Gene 1.0 ST expression microarray platform. Tumour samples (blue)

clustering separately from the normal fibroblast control expression profiles. (B) Principal

component analysis (PCA) of individual tumour samples and normal human fibroblast control

samples from the expression analysis. The PCA tool in PGS subjects the raw expression data to

3-dimensional clustering analysis. The PCA-generated ellipsoids denote separate clustering

between normal and tumour samples in the gene expression analysis.

62

3.4.3 Integration of Copy Number and Gene Expression Analyses

Tumour-Specific Integration

Microarray and statistical analyses using PGS software allowed for the identification of

genomic and transcriptional alterations in individual tumour samples. Genes with significant

changes (P ≤ 0.05) in copy number and gene expression in individual tumour samples were

analyzed using the PGS-Venn analysis tool. Integration of these data revealed gene-specific

overlap in each tumour (Figure 3.14A). The number of overlapping genes in each tumour sample

varied from 9 genes in USTS13 to 225 genes in USTS12. In order to characterize the correlation

between gene-specific changes in the Venn analysis intersects of each tumour, the genes were

plotted based on copy number and gene expression status (Figure 3.14B). This analysis clearly

showed the strongest correlation between copy number gain and over-expression.

Figure 3.14. Tumour-specific integration of copy number and gene expression analyses. (A)

Integration of expression and copy number data in individual USTS tumours. Genes with significant

(P≤0.05) changes in copy number relative to the HapMap and gene expression relative to normal

human fibroblasts in individual tumours were analyzed using the Partek Genomics Suite Venn

Analysis tool. (B) Distribution of significant tumour-specific changes. The line graph represents the

percentage of genes (y-axis) identified with specific genomic and transcriptomic changes (x-axis).

63

Cumulative Integration

Microarray and statistical analyses using the PGS software also allowed for the

identification of genomic and transcriptional alterations across all tumours. This analysis allowed

us to identify the most significant recurrent changes in USTSs as a group. Parameters for this

analysis were as follows: (1) Copy number changes present in at least 3 samples with P ≤ 0.05; and

(2) Transcriptional changes at a magnitude of at least 2-fold over- or under-expression with P ≤

0.05 cumulatively across all tumour samples relative to normal fibroblasts. Based on these

parameters, 438 genes were identified in the cumulative copy number gene analysis, and 2296

genes were identified in the gene expression analysis. Integration of these analyses was performed

using the PGS-Venn analysis tool, and resulted in the identification of 51 genes with overlapping

copy number and expression level changes (Figure 3.15A). These genes were plotted based on

copy number and gene expression status to characterize the correlation between gene-specific

changes in the Venn analysis intersect. This analysis displayed a strong correlation between copy

number gain and over-expression (66.7%), and copy number loss and under-expression (23.8%)

(Figure 3.15B).

Figure 3.15. Cumulative analysis of gene-specific changes in copy number and gene expression in

undifferentiated sarcoma. (A) Integration of cumulative copy number and expression analyses in USTS

tumours. Genes with significant (P≤0.05) changes in copy number relative to the HapMap, and gene expression

relative to normal human fibroblasts across all tumours were analyzed using the PGS Venn analysis tool. (B)

Distribution of significant overall genomic changes in undifferentiated sarcoma. The pie chart represents the

percentage of genes identified with specific genomic and transcriptomic changes.

64

3.4.4 Gene Network Analysis

In order to identify USTS-related gene networks, Ingenuity Pathway Analysis (IPA) of the

genes identified in the cumulative copy number and cumulative expression analyses was performed.

This analysis allowed us to identify significant biological pathways that have been disrupted at the

genomic and transcriptomic levels, and to identify genes of interest within those pathways. We

compared the most significantly affected biological functions from copy number analysis to the

most significantly affected biological functions from gene expression analysis (Figure 3.16). All

tumours showed disruption in copy number and gene expression in similar biological functions.

Overall the biological functions that were most significantly affected by the changes in copy

number and gene expression were cell death, cell movement, cellular growth and proliferation, cell

development, cell cycle, and cellular assembly and organization.

Figure 3.16. Biological functions significantly affected by changes in copy

number and gene expression. The top six most significantly affected biological

functions in individual USTS tumours detected using the Ingenuity Pathway

Analysis (IPA) Comparative Analysis tool. IPA analysis of copy number (dark

blue bars) and gene expression (light blue bars) . Yellow line is indicative of P-

value threshold of 0.05.

65

3.4.5 Identification of Candidate Genes

By combining the results from the PGS-based integrative analysis with the IPA network

analysis, a list of candidate genes were selected. This selection process focused on genes which

were either gained and over-expressed or deleted and under-expressed, and are directly involved in

the biological pathways identified in the network analysis. The candidate genes identified are

presented in Table 3.3.

Gene Genomic

Region

Copy

Number

Cumulative

Expression

Brief Description

EPHA3 3p11.2 Gain 12.11

· Transmembrane protein belonging to the Eph

receptor tyrosine kinase family.

· Eph receptors and ephrin ligands known to play

an important role in many biological functions

including axon guidance, cell migration,

angiogenesis and cytoskeletal regulation.

SNX10 7p15.2 Gain 2.3

· Member of sorting nexin family which play a role

in cellular functions including intracellular protein

trafficking, endocytosis and cell-to-cell signaling.

· May control endosome homeostasis.

ADAM9 8p11.22 Deletion -6.83

· Member of the ADAM (a disintegrin and

metalloprotease domain) family which have a role

in ectodomain shedding of membrane-bound

molecules.

· Known to cleave and release many molecules

associated with tumourigenesis and angiogenesis.

CDC73 1q25 Deletion -2.08

· Tumour suppressor involved in transcriptional

control pathways.

· Part of the PAF1 transcriptional regulatory

complex.

· Loss of function results in increased cell

proliferation and over-expression of proto-

oncogenes.

Table 3.3. Candidate genes with USTS-specific changes identified by integrative copy number

array, expression array, and gene network analysis.

66

3.4.6 Gene Expression Validation of Candidate Genes

Integrative microarray analysis identified significant genomic and transcriptomic changes in

the following genes: (1) a gain in chromosome 3p11.2 and over-expression of EPHA3, (2) a gain in

chromosome 7p15.2 and over-expression of SNX10, (3) a loss in chromosome 8p11.22 and under-

expression of ADAM9, and (4) a loss in chromosome 1q25 and under-expression of CDC73. To

validate the results of the expression analysis (Figure 3.17A), quantitative PCR was performed in

all 5 tumours (Figure 3.17B). Over-expression of the EPHA3 gene was found in 4/5 tumours, with

an average of 11.3-fold over-expression relative to fibroblasts. Cumulative microarray expression

analysis found a 12.1-fold over-expression of the EPHA3 gene relative to fibroblasts. Over-

expression of the SNX10 gene was found in 3/5 tumours, with an average of 3.7-fold over-

expression relative to fibroblasts. Cumulative microarray analysis found 2.3-fold over-expression

of the SNX10 gene relative to fibroblasts. Under-expression of the ADAM9 gene was found in 5/5

tumours, with an average of 27.7-fold under-expression relative to fibroblasts. Cumulative

microarray analysis found 6.8-fold under-expression of the ADAM9 gene relative to fibroblasts.

Under-expression of the CDC73 gene was found in 5/5 tumours, with an average of 2.1-fold under-

expression relative to fibroblasts. Cumulative microarray analysis found 2.1-fold under-expression

of the CDC73 gene relative to fibroblasts.

67

Figure 3.17. Expression analysis of candidate genes. (A) PGS dot plot visualization of

candidate genes from expression microarray analysis. Significant over-expression of EPHA3 and

SNX10 (top), and under-expression of ADAM9 and CDC73 (bottom) were seen cumulatively

across the four samples tested. The y-axis represents the raw lo2 signal intensity for each probe

set. (B) Quantitative RT-PCR validation of candidate gene expression. The y-axis represents fold

enrichment values generated using the comparative CT method. TATA-box binding protein

(TBP) was used as the endogenous housekeeping reference gene and human adult fibroblast

samples were used as the control samples. Each qPCR reaction was performed in triplicate and

average values were used for enrichment calculations.

68

3.4.7 Copy Number Validation of EPHA3

Quantitative PCR confirmed over-expression of the EPHA3 gene in 4/5 USTS cases.

Ephrins and ephrin receptors have been found to play a role in the development of numerous types

of cancer [110-112]. Due to the high correlation between copy-number gain (3 copies; Figure

3.18A) and over-expression, subsequent qPCR validation, and the prominent role that EPHA3 has

been found to play in cellular development, we decided to further validate this potential biomarker

in our samples. We performed validation of the copy number status of EPHA3 by performing dual-

colour FISH on all five tumour samples using a commercial Spectrum-Orange labeled centromere 3

probe as a control. In order to confirm locus specificity and optimize the EPHA3-specific probe,

FISH was first performed on normal metaphase spreads and on FFPE normal tonsil tissue (Figure

3.18B and C). The expected low-level gain of EPHA3 (3 copies) was not identified in any of the 5

tumours samples. Instead, 2 copies of both the control probe (orange probe) and test probe (green

probe) were seen in the majority of the tumour cells (Figure 3.18D and E).

3.4.8 Protein Validation of EPHA3

Immunohistochemistry was performed on FFPE sections from all 5 tumours in order to

determine the protein expression patterns of the EPHA3 gene in these tumours. The EPHA3

antibody (Abnova) was applied to USTS12-15 and USTS19 using methodology described

previously. USTS12 and USTS13 showed weak, consistent cytoplasmic staining in <10% of cells;

the remaining 3 tumours showed no staining (Figure 3.19). All 5 tumours were thus scored as

negative.

69

Fig

ure

3.1

8.

FIS

H v

ali

da

tion

of

cop

y n

um

ber

ga

in o

f E

PH

A3

. (

A)

PG

S v

isu

aliz

atio

n o

f th

e h

eat

map

an

d p

rofi

le v

iew

of

5 t

um

ou

r

sam

ple

s sh

ow

ing

a s

ign

ific

ant

gai

n o

f a

29

-kb

reg

ion

in

all

5 t

um

ou

r sa

mp

les.

T

his

reg

ion i

s h

igh

lig

hte

d i

nsi

de

the

red

box

. (

B-D

)

Val

idat

ion

of

the

cop

y n

um

ber

sta

tus

of

EP

HA

3 w

as p

erfo

rmed

usi

ng

du

al-c

olo

ur

inte

rph

ase

FIS

H.

(B

) L

ocu

s sp

ecif

icit

y o

f th

e E

PH

A3

-

spec

ific

BA

C p

rob

e (R

P1

1-2

3D

18

) w

as p

erfo

rmed

on

no

rmal

met

aph

ase

spre

ads.

(C

) O

pti

miz

atio

n o

f th

e E

PH

A3

-sp

ecif

ic p

rob

e w

as

per

form

ed o

n F

FP

E n

orm

al t

on

sil

tiss

ue.

(D

) R

epre

sen

tati

ve

imag

e o

f d

ual

-co

lou

r in

terp

has

e F

ISH

an

aly

sis

of

EP

HA

3 i

n U

ST

S14

. F

ISH

was

per

form

ed u

sin

g t

he

Sp

ectr

um

-Gre

en l

abel

ed E

PH

A3

-sp

ecif

ic p

rob

e an

d a

co

mm

erci

al S

pec

tru

m-O

ran

ge

lab

eled

cen

tro

mer

e 3

pro

be

as

a co

ntr

ol.

L

ow

-lev

el g

ain

of

EP

HA

3 w

as n

ot

con

firm

ed i

n a

ny

of

the

tum

ou

r sa

mp

les,

as

2 c

op

ies

of

bo

th t

he

con

tro

l p

rob

e an

d t

he

test

pro

be

wer

e se

en i

n t

he

maj

ori

ty o

f th

e tu

mo

ur

cell

s.

70

Figure 3.19. Immunohistochemical staining for EPHA3. Positive control tissue

included breast, both normal (A) and cancer (B). The photomicrographs show strong

cytoplasmic positivity for EPHA3 in the normal glandular epithelium (A), as well as

moderate to weak cytoplasmic positivity in ductal carcinoma and infiltrating histiocytes

(B). The negative controls show negativity in the same tissues, as well as focally positive

inflammatory cells that stain for endogenous peroxidase (C, D). Representative sections

of two tumours (E, F; USTS12 & USTS13) show very focal, weak and inconsistent

cytoplasmic staining, scored as negative. Scale bar = 100 µm.

71

3.4.9 Discussion of Discovery of Novel Markers for CIC-DUX4-Negative Tumours

As previously described, paediatric USTS remains a diagnosis of exclusion, despite the

advancement of highly specific and sensitive diagnostic applications. The similarities that these

tumours share with other sarcoma subtypes (e.g. Ewing Sarcoma) has evoked much debate as to

whether this group comprises tumours with common histological and biological features

representing a distinct subgroup of sarcomas or if these tumours have simply not been accurately

sub-classified into known sarcoma subgroups [9]. Recent studies on the CIC-DUX4 fusion

transcript have suggested the former – that there are distinct morphological and molecular

subgroups which can be identified from within the USTS family [31, 60]. However, identification

of the subgroup of USTSs harbouring the CIC-DUX4 fusion transcript left a group of USTSs

devoid of any known molecular identifiers. As such, the final studies in this thesis project sought to

identify novel genomic markers in a series of CIC-DUX4-negative USTS tumours using an

integrative genomic and transcriptomic analysis. Using bioinformatic whole-genome approaches,

the genomic and transcriptomic profiles of 5 USTS tumours were analyzed, and a series of

biological networks were identified that may play a role in sarcomagenesis. While we did find that

there was gross heterogeneity between the tumours at both the DNA and RNA levels, significant

changes were identified that were present in the majority of the tumours studied. Cumulative

analysis of the tumours found that there was a correlation between copy number gain and

overexpression, as well as copy number loss and underexpression, with more than 90% of the genes

identified following these patterns. However, these molecular correlations are present in a

relatively small proportion of the genes identified in any given tumour sample. Many of the genes

found to have changes at the copy number level, did not exhibit changes at the expression level and

vice-versa. This can be partially explained as a result of other biological phenomena present in the

cellular environment such as the actions of regulatory elements and epigenetic mechanisms such as

72

DNA methylation [113, 114]. However, by identifying genes which do in fact show a

genomic/transcriptomic correlation, and investigating the biological pathways in which these genes

are involved, it may be possible to identify specific genes that are responsible for driving USTS

tumourigenesis.

Integrative analysis of changes in copy-number, gene expression, and USTS-related gene

networks, identified a set of candidate genes which may play a role in the early event involved in

USTS development. The genes identified from this analysis were EPHA3, SNX10, ADAM9 and

CDC73. EPHA3 is a transmembrane protein that belongs to the Eph receptor tyrosine kinase

(RTK) family, which can be subdivided into 2 groups (EphA and EphB receptors) based on

sequence homology of the extracellular domain (ECD) of the receptor [115]. The Eph-receptor

ECD is composed of a ligand-binding domain, a cysteine-rich region, and two fibronectin type III

repeats. The cytoplasmic domain is composed of a juxtamembrane domain, a classical tyrosine

kinase domain, a sterile α-motif and a PDZ-domain. The Eph-ephrin receptor-ligand interaction

serves as a guidance system which regulates cell positioning and the modulation of cell morphology

[116]. Eph receptors and ephrin ligands are known to play an important role in many biological

functions including axon guidance, cell migration, angiogenesis and cytoskeletal regulation [117-

119]. Over-expression of the Eph receptors may promote cellular growth by increasing

vascularization, disrupting cell-cell adhesion, and providing potential adherence in a new tumour

microenvironment [120]. As such, it is not surprising that over-expression of EPHA3 has been

implicated in the pathogenesis of numerous cancers including lung cancer, prostate cancer,

melanoma, rhabdomyosarcoma and lymphocytic leukemia [110, 112, 120, 121].

73

SNX10 is a member of the sorting nexin family which are characterized by the presence of

an extracellular PX domain [122]. There are 29 known or predicted members of the sorting nexin

family, and the function of the majority of these proteins remains unknown [123]. Sorting nexins

have been shown to play a role in numerous cellular functions including intracellular protein

trafficking, endocytosis and cell-to-cell signaling [124, 125]. SNX10 specifically has been shown

to be capable of generating giant vacuoles in mammalian cells, suggesting that this protein may

serve to control endosome homeostasis [126]. EPHA3 and SNX10 both showed significant copy

number gains in 4/5 USTS samples. Interestingly, preliminary analysis also found EPHA3 to be

gained in the fifth USTS sample, however this gain was not found to be statistically significant.

These genes were also found to be significantly over-expressed in the cumulative expression

analysis.

ADAM9 is a member of the ADAM (a disintegrin and metalloprotease domain) family,

which have a predominant role in ectodomain shedding of membrane-bound molecules [127].

ADAMs are critical regulators of cell-cell signaling during development, and have been shown to

specifically play a role in muscle development, neurogenesis, cell adhesion and cell migration

[128]. ADAM9 specifically is known to cleave and release many molecules associated with

tumourigenesis and angiogenesis [129], and as such dysregulation of this protein has been

implicated in the pathogenesis of breast, prostate and lung cancer, melanoma, adenocarcinoma, and

metastasis to the brain [127, 130-133].

CDC73, or parafibromin, is a tumour suppressor that has been found to be involved in

transcriptional and post-transcriptional control pathways. Human parafibromin binds to RNA

polymerase II as part of the PAF1 transcriptional regulatory complex, and facilitates the association

74

of 3’ mRNA processing factors with chromatin that is being actively transcribed [134]. It has been

documented that loss of function of CDC73 can lead to oncogenesis, though the mechanisms

underlying this phenomenon are poorly understood. Furthermore, it has been shown that down-

regulation of CDC73 expression leads to increased cell proliferation and increased expression of

known proto-oncogenes [135]. Not surprisingly, deletions and loss-of-function mutations in the

CDC73 gene have been associated with various cancers including pancreatic cancer and

parathyroid carcinoma [136, 137]. ADAM9 and CDC73 both showed copy number loss in 3/5

USTS samples, and significant under-expression in the cumulative expression analysis.

Quantitative PCR analysis confirmed that USTS12, USTS13, USTS15 and USTS18 showed over-

expression of the EPHA3 gene, USTS12, USTS14, USTS15 and USTS18 showed over-expression

of SNX10, and that all 5 tumours showed under-expression of ADAM9 and CDC73. Each of these

genes plays an important role in the USTS-related biological pathways identified by gene network

analysis.

However, EPHA3 was identified as the most biologically relevant marker in this study as

this gene showed the most significant correlation between genomic gain and over-expression. As

such, we sought to validate the copy-number gain identified in EPHA3 by dual-colour FISH.

Microarray analysis identified a low-level gain in a 29-kb region in EPHA3 in all samples, but this

gain was not found by FISH in any of the tumour samples. This does not however, negate the

results of the microarray analysis. This 29-kb region of EPHA3 encompasses the entire exon III

coding region, as well as portions of introns II and III. It is possible that this copy number gain is a

result of a segmental duplication of exon III of the EPHA3 gene. If this were the case, it is possible

that separate EPHA3 probe signals could not be seen due to the size of the duplication, and the

proximity of the duplicate signals to one another. It would have been possible to visualize this

75

signal if the low-level copy-number gain were a result of rearrangement of this region, though this

does not seem to be the case. Interestingly, the ephrin ligand-binding domain has been found to be

specifically restricted to the N-terminal extracellular region of EPHA3 encoded by exon III.

Duplication of exons coding for the ligand-binding domain have been similarly documented in

another RTK, epidermal growth factor receptor (EGFR), in human gliomas [138-140]. Duplication

of the ligand-binding domain of EGFR has been shown to induce constitutive receptor

phosphorylation, while retaining the ability to bind the EGF ligand. Furthermore, it has been

shown that this segmental duplication may prevent efficient ligand-mediated receptor

internalization, resulting in prolonged RTK signaling [138-140]. Hypothetically, the same may be

true for the EPHA3 receptor. Duplication of the EPHA3 ligand-binding domain may promote

oncogenesis by increasing receptor activation, and impairing receptor attenuation. In order to

elucidate the role of this duplication in EPHA3, detailed functional studies are essential.

Furthermore, the IHC for EPHA3 was negative in all 5 samples tested, suggesting that

EPHA3 is not overexpressed at the protein level in these samples. However, negative staining for

EPHA3 may be a result of a poor quality antibody, as optimization on the positive control tissues

proved challenging. These results may also be a result of poor quality tissue received from the

Cooperative Human Tissue Network, as both FISH and IHC were difficult to perform on these

FFPE slides. Alternatively, it is possible that the EPHA3 protein is simply not expressed, or is

expressed at very low levels in these tissues. More studies using a larger cohort of samples, as well

as a different EPHA3-specific antibody are necessary to the EPHA3 expression status in USTSs.

This microarray study was limited in terms of the number of samples that were studied, as

well as the number of genes that were validated. Further studies which include more samples and

more comprehensive validation will be imperative to determine the role of these genes in

76

sarcomagenesis. Importantly, the general pathways identified in this study may provide novel

insights into the critical biological events responsible for the development of USTS as well as

sarcomas in general.

77

CHAPTER FOUR

SUMMARY, GENERAL CONCLUSIONS

&

FUTURE DIRECTIONS

78

CHAPTER 4. SUMMARY, GENERAL CONCLUSIONS & FUTURE DIRECTIONS

4.1 Summary and General Conclusions

Paediatric sarcomas have long been a challenging group of tumours to accurately diagnose

due to the similarity in histology and clinical presentation between different subgroups [141].

Though different sarcoma subtypes may appear very similar at the histological level, the prognosis

and treatment of these subtypes varies immensely, making correct sub-categorization paramount for

optimal therapy. The development of ancillary techniques such as immunohistochemistry,

cytogenetics and RT-PCR has significantly enhanced the pathologists’ ability to make an accurate

diagnosis [142]. Until very recently no markers had been identified in USTS, and consequently

undifferentiated sarcoma has remained a diagnosis of exclusion.

In this study, extensive and detailed genomic analysis of paediatric undifferentiated soft

tissue sarcomas has identified a specific variant of USTS with PRC morphology harbouring the

CIC-DUX4 rearrangement. Using RT-PCR and FISH, we have established a reliable and specific

screening platform with which any tumour can be screened for this rearrangement. In our cohort of

22 USTS samples, we identified 18 tumours with PRC morphology, 5 of which harbour the CIC-

DUX4 rearrangement (Objective 1).

Following this screening we were left with 17 tumours in which no specific molecular

markers had been identified. In order to identify potentially novel molecular markers for these

tumours, we performed an integrative copy-number and expression microarray analysis to identify

genes with changes at both the DNA and RNA levels. This study identified specific genes

(EPHA3, SNX10, ADAM9 and CDC73) which may play an important role in sarcomagenesis.

More importantly this study identified key USTS-related genomic pathways that may play a pivotal

79

role in the acquisition of the USTS phenotype (Objectives 2 and 3). While it remains unclear if it

will be possible to sub-classify the 17 tumours without the CIC-DUX4 translocation, we have

certainly made progress into identifying pathways responsible for the development of USTS.

Although USTS currently remains a diagnosis of exclusion, the cumulative data surrounding

the CIC-DUX4 translocation suggests that a portion of these tumours may in fact be identifiable by

a distinct chromosomal rearrangement. In CIC-DUX4-negative tumours, we have made progress in

identifying specific genes and biological pathways that may play a role in the development and

progression of these tumours. These findings help to further define this novel genetic subset of

paediatric sarcomas and provide an additional diagnostic tool for their classification and diagnosis.

4.2 Future Directions

The results of this project have given rise to several issues that are of utmost importance to

address in future studies. This study identified a subset of USTSs with PRC morphology that

harbour the CIC-DUX4 fusion gene. These tumours show minimal evidence of differentiation,

suggesting that they are able to maintain a stem cell-like phenotype. Thus, it is believed that the

presence of the CIC-DUX4 fusion transcript is associated with transforming properties [60], as well

as the maintenance of the undifferentiated state. However, significantly more evidence is needed in

order to prove both of these concepts. Future work in our laboratory will seek to determine the

specific biological and pathological consequences of the CIC-DUX4 fusion transcript. To do so, it

will be necessary to identify specific target molecules of the CIC-DUX4 fusion protein, as well as

to identify overall biological pathways that affected by this fusion. Furthermore, the diagnosis of

CIC-DUX4-positive tumours would be greatly aided by the identification of novel diagnostic

markers for these tumours. For example, the development of an immunohistochemcial-based

80

diagnostic assay for these tumours would allow for a timely and cost-efficient diagnosis to be made

where RT-PCR is unavailable. Additionally, the testing of other sarcoma subtypes for the presence

of this fusion transcript with help further elucidate the diagnostic utility of the CIC-DUX4 fusion

transcript. Future studies on CIC-DUX4-positive tumours will help to further elucidate the

biological features of these tumours, the prognostic implications of carrying this translocation, and

potential therapeutic options for this challenging subgroup of sarcomas.

A more in-depth study of the CIC-DUX4-negative tumours will provide a greater

understanding of the specific mechanisms of sarcomageneis in these tumours. Analysis of a greater

number of CIC-DUX4- negative tumours will allow for the accurate determination of both genomic

and transcriptomic features of these tumours, and will certainly enhance our understanding of

paediatric primitive sarcomas. Furthermore, it will be necessary to perform functional studies on

the candidate genes in order to gain a better understanding of the roles that these genes play in

sarcomagenesis. Additional work will hopefully give rise to diagnostic markers that can be used to

recognize this challenging group of tumours.

Future work in this field will provide a greater understanding of the early events involved in

the acquisition of the sarcoma phenotype. Furthermore, the analysis of both the CIC-DUX4 fusion

transcript and dysregulated pathways identified in CIC-DUX4-negative tumours will help to

improve diagnosis and treatment of such aggressive and poorly-understood sarcomas. These

findings will have relevance not only for paediatric sarcomas, but for sarcomas in general.

81

REFERENCES

82

1. Rosenthal, T.C. and W. Kraybill, Soft tissue sarcomas: integrating primary care recognition

with tertiary care center treatment. Am Fam Physician, 1999. 60(2): p. 567-72.

2. Davicioni, E., D.H. Wai, and M.J. Anderson, Diagnostic and prognostic sarcoma

signatures. Mol Diagn Ther, 2008. 12(6): p. 359-74.

3. van de Rijn, M. and J.A. Fletcher, Genetics of soft tissue tumors. Annu Rev Pathol, 2006. 1:

p. 435-66.

4. Chibon, F., et al., Validated prediction of clinical outcome in sarcomas and multiple types

of cancer on the basis of a gene expression signature related to genome complexity. Nat

Med. 16(7): p. 781-7.

5. Mackall, C.L., P.S. Meltzer, and L.J. Helman, Focus on sarcomas. Cancer Cell, 2002. 2(3):

p. 175-8.

6. Riggi, N., M.L. Suva, and I. Stamenkovic, Ewing's sarcoma origin: from duel to duality.

Expert Rev Anticancer Ther, 2009. 9(8): p. 1025-30.

7. Wolden, S.L. and K.M. Alektiar, Sarcomas across the age spectrum. Semin Radiat Oncol.

20(1): p. 45-51.

8. Pawel, B.R., et al., Undifferentiated sarcomas of children: pathology and clinical behavior-

-an Intergroup Rhabdomyosarcoma study. Med Pediatr Oncol, 1997. 29(3): p. 170-80.

9. Alaggio, R., et al., Undifferentiated sarcoma: does it exist? A clinicopathologic study of 7

pediatric cases and review of literature. Hum Pathol, 2009. 40(11): p. 1600-10.

10. Ries LAG, S.M., Gurney JG, Linet M, Tamra T, Young JL, Bunin GR (eds). , Cancer

Incidence and Survival among Children and Adolescents: United States SEER Program

1975-1995. National Cancer Institute, SEER Program. NIH Pub. No. 99-4649. Bethesda,

MD, 1999.

11. Meyer, W.H. and S.L. Spunt, Soft tissue sarcomas of childhood. Cancer Treat Rev, 2004.

30(3): p. 269-80.

12. Helman, L.J. and P. Meltzer, Mechanisms of sarcoma development. Nat Rev Cancer, 2003.

3(9): p. 685-94.

13. Eilber, F.C. and S.M. Dry, Diagnosis and management of synovial sarcoma. J Surg Oncol,

2008. 97(4): p. 314-20.

14. Burchill, S.A., Molecular abnormalities in Ewing's sarcoma. Expert Rev Anticancer Ther,

2008. 8(10): p. 1675-87.

15. Barr, F.G., Gene fusions involving PAX and FOX family members in alveolar

rhabdomyosarcoma. Oncogene, 2001. 20(40): p. 5736-46.

83

16. Mitelman, F., B. Johansson, and F. Mertens, The impact of translocations and gene fusions

on cancer causation. Nat Rev Cancer, 2007. 7(4): p. 233-45.

17. Coffin, C.M., L.P. Dehner, and P.A. O'Shea, Pediatric Soft Tissue Sarcomas: A Clinical,

Pathological and Therapeutic Approach. 1997, Baltimore: Williams and Wilkins.

18. Rubin, B.P., Recent progress in the classification of soft tissue tumors: role of genetics and

clinical implications. Curr Opin Oncol, 2001. 13(4): p. 256-60.

19. Coindre, J.M., Immunohistochemistry in the diagnosis of soft tissue tumours.

Histopathology, 2003. 43(1): p. 1-16.

20. Fletcher, C.D.M., K.K. Unni, and F. Mertens, World Health Organisation Classification of

Tumors: Tumors of Soft Tissue and Bone. 2002, Lyon: IARC Press.

21. Sorensen, P.H., et al., A second Ewing's sarcoma translocation, t(21;22), fuses the EWS

gene to another ETS-family transcription factor, ERG. Nat Genet, 1994. 6(2): p. 146-51.

22. Cessna, M.H., et al., Are myogenin and myoD1 expression specific for rhabdomyosarcoma?

A study of 150 cases, with emphasis on spindle cell mimics. Am J Surg Pathol, 2001. 25(9):

p. 1150-7.

23. Kast, R., et al., Differentiation of small round blue cell tumors using Raman spectroscopy. J

Pediatr Surg. 45(6): p. 1110-4.

24. Tanas, M.R. and J.R. Goldblum, Fluorescence in situ hybridization in the diagnosis of soft

tissue neoplasms: a review. Adv Anat Pathol, 2009. 16(6): p. 383-91.

25. Ladanyi, M., C.R. Antonescu, and P. Dal Cin, Cytogenetic and molecular genetic pathology

of soft tissue tumors in Enzinger and Weiss's Soft Tissue Tumors, S.W. Weiss and J.R.

Goldblum, Editors. 2008, Mosby Elsevier: Philadelphia. p. 73-102.

26. Knuutila, S., Cytogenetics and molecular pathology in cancer diagnostics. Ann Med, 2004.

36(3): p. 162-71.

27. Ordonez, J.L., et al., The clinical relevance of molecular genetics in soft tissue sarcomas.

Adv Anat Pathol. 17(3): p. 162-81.

28. Skapek, S.X. and C.H. Chui, Cytogenetics and the biologic basis of sarcomas. Curr Opin

Oncol, 2000. 12(4): p. 315-22.

29. Sebire, N.J., et al., Aberrant immunohistochemical expression in nonrhabdomyosarcoma

soft tissue sarcomas of infancy: retrospective review of clinical material. Pediatr Dev

Pathol, 2002. 5(6): p. 579-86.

84

30. Qualman, S.J., et al., Intergroup Rhabdomyosarcoma Study: update for pathologists. Pediatr

Dev Pathol, 1998. 1(6): p. 550-61.

31. Yoshimoto, M., et al., Detailed cytogenetic and array analysis of pediatric primitive

sarcomas reveals a recurrent CIC-DUX4 fusion gene event. Cancer Genet Cytogenet, 2009.

195(1): p. 1-11.

32. Somers, G.R., et al., Pediatric undifferentiated sarcoma of the soft tissues: a

clinicopathologic study. Pediatr Dev Pathol, 2006. 9(2): p. 132-42.

33. Kreiger, P.A., et al., Loss of INI1 expression defines a unique subset of pediatric

undifferentiated soft tissue sarcomas. Mod Pathol, 2009. 22(1): p. 142-50.

34. Selvarajah, S., et al., Characterization of trisomy 8 in pediatric undifferentiated sarcomas

using advanced molecular cytogenetic techniques. Cancer Genet Cytogenet, 2007. 174(1):

p. 35-41.

35. Pappo, A.S., et al., Survival after relapse in children and adolescents with

rhabdomyosarcoma: A report from the Intergroup Rhabdomyosarcoma Study Group. J Clin

Oncol, 1999. 17(11): p. 3487-93.

36. Raney, R.B., et al., Rhabdomyosarcoma and undifferentiated sarcoma in the first two

decades of life: a selective review of intergroup rhabdomyosarcoma study group experience

and rationale for Intergroup Rhabdomyosarcoma Study V. J Pediatr Hematol Oncol, 2001.

23(4): p. 215-20.

37. Charytonowicz, E., et al., Alveolar rhabdomyosarcoma: is the cell of origin a mesenchymal

stem cell? Cancer Lett, 2009. 279(2): p. 126-36.

38. Mani, S.A., et al., The epithelial-mesenchymal transition generates cells with properties of

stem cells. Cell, 2008. 133(4): p. 704-15.

39. Subramaniam, D., et al., Cancer stem cells: a novel paradigm for cancer prevention and

treatment. Mini Rev Med Chem. 10(5): p. 359-71.

40. Tan, B.T., et al., The cancer stem cell hypothesis: a work in progress. Lab Invest, 2006.

86(12): p. 1203-7.

41. Suva, M.L., et al., Identification of cancer stem cells in Ewing's sarcoma. Cancer Res, 2009.

69(5): p. 1776-81.

42. Bonnet, D. and J.E. Dick, Human acute myeloid leukemia is organized as a hierarchy that

originates from a primitive hematopoietic cell. Nat Med, 1997. 3(7): p. 730-7.

43. Passegue, E., et al., Normal and leukemic hematopoiesis: are leukemias a stem cell disorder

or a reacquisition of stem cell characteristics? Proc Natl Acad Sci U S A, 2003. 100 Suppl

1: p. 11842-9.

85

44. Li, C., et al., Identification of pancreatic cancer stem cells. Cancer Res, 2007. 67(3): p.

1030-7.

45. Singh, S.K., et al., Identification of human brain tumour initiating cells. Nature, 2004.

432(7015): p. 396-401.

46. Quintana, E., et al., Efficient tumour formation by single human melanoma cells. Nature,

2008. 456(7222): p. 593-8.

47. Levings, P.P., et al., Expression of an exogenous human Oct-4 promoter identifies tumor-

initiating cells in osteosarcoma. Cancer Res, 2009. 69(14): p. 5648-55.

48. Naka, N., et al., Synovial sarcoma is a stem cell malignancy. Stem Cells. 28(7): p. 1119-31.

49. Murase, M., et al., Side population cells have the characteristics of cancer stem-like

cells/cancer-initiating cells in bone sarcomas. Br J Cancer, 2009. 101(8): p. 1425-32.

50. Jiang, X., et al., CD133 expression in chemo-resistant Ewing sarcoma cells. BMC Cancer.

10: p. 116.

51. Riggi, N., et al., Expression of the FUS-CHOP fusion protein in primary mesenchymal

progenitor cells gives rise to a model of myxoid liposarcoma. Cancer Res, 2006. 66(14): p.

7016-23.

52. Ren, Y.X., et al., Mouse mesenchymal stem cells expressing PAX-FKHR form alveolar

rhabdomyosarcomas by cooperating with secondary mutations. Cancer Res, 2008. 68(16):

p. 6587-97.

53. Miyagawa, Y., et al., Inducible expression of chimeric EWS/ETS proteins confers Ewing's

family tumor-like phenotypes to human mesenchymal progenitor cells. Mol Cell Biol, 2008.

28(7): p. 2125-37.

54. Riggi, N., et al., EWS-FLI-1 expression triggers a Ewing's sarcoma initiation program in

primary human mesenchymal stem cells. Cancer Res, 2008. 68(7): p. 2176-85.

55. Riggi, N., et al., Development of Ewing's sarcoma from primary bone marrow-derived

mesenchymal progenitor cells. Cancer Res, 2005. 65(24): p. 11459-68.

56. Tirode, F., et al., Mesenchymal stem cell features of Ewing tumors. Cancer Cell, 2007.

11(5): p. 421-9.

57. Hu-Lieskovan, S., et al., EWS-FLI1 fusion protein up-regulates critical genes in neural

crest development and is responsible for the observed phenotype of Ewing's family of

tumors. Cancer Res, 2005. 65(11): p. 4633-44.

86

58. Newton, W.A., Jr., et al., Classification of rhabdomyosarcomas and related sarcomas.

Pathologic aspects and proposal for a new classification--an Intergroup

Rhabdomyosarcoma Study. Cancer, 1995. 76(6): p. 1073-85.

59. Davis IJ, P.-A.A., Fisher DE. , Nonrhabdomyosarcomas and other soft tissue tumors.

Oncology of Infancy and Childhood. , ed. F.D. Orkin SH, Look AT, et al., editors. 2009:

Philadelphia: Saunders Elsevier.

60. Kawamura-Saito, M., et al., Fusion between CIC and DUX4 up-regulates PEA3 family

genes in Ewing-like sarcomas with t(4;19)(q35;q13) translocation. Hum Mol Genet, 2006.

15(13): p. 2125-37.

61. Sadikovic, B., Graham, C., Ho, M., Zielenska, M., Somers, G., Immunohistochemical

expression and cluster analysis of mesenchymal and neural stem cell-associated proteins in

pediatric soft tissue sarcomas. submitted to Pediatr Dev Pathol, 2010.

62. Keller, C., et al., Pax3:Fkhr interferes with embryonic Pax3 and Pax7 function:

implications for alveolar rhabdomyosarcoma cell of origin. Genes Dev, 2004. 18(21): p.

2608-13.

63. Tajbakhsh, S., et al., Redefining the genetic hierarchies controlling skeletal myogenesis:

Pax-3 and Myf-5 act upstream of MyoD. Cell, 1997. 89(1): p. 127-38.

64. Maroto, M., et al., Ectopic Pax-3 activates MyoD and Myf-5 expression in embryonic

mesoderm and neural tissue. Cell, 1997. 89(1): p. 139-48.

65. Epstein, J.A., et al., Pax3 inhibits myogenic differentiation of cultured myoblast cells. J Biol

Chem, 1995. 270(20): p. 11719-22.

66. Lang, D., et al., Pax3 functions at a nodal point in melanocyte stem cell differentiation.

Nature, 2005. 433(7028): p. 884-7.

67. Gang, E.J., et al., Pax3 activation promotes the differentiation of mesenchymal stem cells

toward the myogenic lineage. Exp Cell Res, 2008. 314(8): p. 1721-33.

68. Hoot, A.C., et al., Immunohistochemical analysis of hSNF5/INI1 distinguishes renal and

extra-renal malignant rhabdoid tumors from other pediatric soft tissue tumors. Am J Surg

Pathol, 2004. 28(11): p. 1485-91.

69. Judkins, A.R., et al., INI1 protein expression distinguishes atypical teratoid/rhabdoid tumor

from choroid plexus carcinoma. J Neuropathol Exp Neurol, 2005. 64(5): p. 391-7.

70. Richkind, K.E., S.G. Romansky, and J.Z. Finklestein, t(4;19)(q35;q13.1): a recurrent

change in primitive mesenchymal tumors? Cancer Genet Cytogenet, 1996. 87(1): p. 71-4.

87

71. Rakheja, D., et al., Translocation (4;19)(q35;q13.1)-associated primitive round cell

sarcoma: report of a case and review of the literature. Pediatr Dev Pathol, 2008. 11(3): p.

239-44.

72. Somers, G.R., et al., IGF2 is highly expressed in pediatric undifferentiated sarcomas and

reveals two distinct cytoplasmic trafficking patterns. Pediatr Dev Pathol, 2010. 13(3): p.

169-77.

73. Ausubel FM, B.R., Kingston RE, Moore DD, Seidman JG, Smith JA, Sturhl K. , Short

protocols in molecular biology. . 5th ed, ed. N.Y.J. Wiley. 2002.

74. Bayani, J., et al., Spectral karyotyping identifies recurrent complex rearrangements of

chromosomes 8, 17, and 20 in osteosarcomas. Genes Chromosomes Cancer, 2003. 36(1): p.

7-16.

75. Bayani, J., et al., Molecular cytogenetic analysis of medulloblastomas and supratentorial

primitive neuroectodermal tumors by using conventional banding, comparative genomic

hybridization, and spectral karyotyping. J Neurosurg, 2000. 93(3): p. 437-48.

76. Chin, S.F., et al., A simple and reliable pretreatment protocol facilitates fluorescent in situ

hybridisation on tissue microarrays of paraffin wax embedded tumour samples. Mol Pathol,

2003. 56(5): p. 275-9.

77. Sabeti, P.C., et al., Genome-wide detection and characterization of positive selection in

human populations. Nature, 2007. 449(7164): p. 913-8.

78. Schroeder, A., et al., The RIN: an RNA integrity number for assigning integrity values to

RNA measurements. BMC Mol Biol, 2006. 7: p. 3.

79. The International HapMap Project. Nature, 2003. 426(6968): p. 789-96.

80. Sadikovic, B., et al., Genome-wide H3K9 histone acetylation profiles are altered in

benzopyrene-treated MCF7 breast cancer cells. J Biol Chem, 2008. 283(7): p. 4051-60.

81. Sadikovic, B., et al., Identification of interactive networks of gene expression associated

with osteosarcoma oncogenesis by integrated molecular profiling. Hum Mol Genet, 2009.

18(11): p. 1962-75.

82. Durbin, A.D., et al., JNK1 determines the oncogenic or tumor-suppressive activity of the

integrin-linked kinase in human rhabdomyosarcoma. J Clin Invest, 2009. 119(6): p. 1558-

70.

83. Schmittgen, T.D. and K.J. Livak, Analyzing real-time PCR data by the comparative C(T)

method. Nat Protoc, 2008. 3(6): p. 1101-8.

84. Berglund, L., et al., A genecentric Human Protein Atlas for expression profiles based on

antibodies. Mol Cell Proteomics, 2008. 7(10): p. 2019-27.

88

85. Uhlen, M., et al., A human protein atlas for normal and cancer tissues based on antibody

proteomics. Mol Cell Proteomics, 2005. 4(12): p. 1920-32.

86. Nilsson, P., et al., Towards a human proteome atlas: high-throughput generation of mono-

specific antibodies for tissue profiling. Proteomics, 2005. 5(17): p. 4327-37.

87. Ponten, F., K. Jirstrom, and M. Uhlen, The Human Protein Atlas--a tool for pathology. J

Pathol, 2008. 216(4): p. 387-93.

88. Somers, G.R., et al., Primary subcutaneous primitive neuroectodermal tumor with

aggressive behavior and an unusual karyotype: case report. Pediatr Dev Pathol, 2004. 7(5):

p. 538-45.

89. Yoshimoto, M., et al., Absence of TMPRSS2:ERG fusions and PTEN losses in prostate

cancer is associated with a favorable outcome. Mod Pathol, 2008. 21(12): p. 1451-60.

90. Lee, C.J., et al., CIC, a member of a novel subfamily of the HMG-box superfamily, is

transiently expressed in developing granule neurons. Brain Res Mol Brain Res, 2002.

106(1-2): p. 151-6.

91. Gabriels, J., et al., Nucleotide sequence of the partially deleted D4Z4 locus in a patient with

FSHD identifies a putative gene within each 3.3 kb element. Gene, 1999. 236(1): p. 25-32.

92. Mathas, S., et al., Gene deregulation and spatial genome reorganization near breakpoints

prior to formation of translocations in anaplastic large cell lymphoma. Proc Natl Acad Sci

U S A, 2009. 106(14): p. 5831-6.

93. Graham, C., Chilton-MacNeill, S., Zielenska, M., Somers, G.R., The CIC-DUX4 fusion

transcript is present in a distinct subgroup of pediatric primitive round cell sarcomas.

Submitted to Pediatr Dev Pathol., 2010.

94. Beausoleil, S.A., et al., Large-scale characterization of HeLa cell nuclear phosphoproteins.

Proc Natl Acad Sci U S A, 2004. 101(33): p. 12130-5.

95. Olsen, J.V., et al., Global, in vivo, and site-specific phosphorylation dynamics in signaling

networks. Cell, 2006. 127(3): p. 635-48.

96. Cantin, G.T., et al., Combining protein-based IMAC, peptide-based IMAC, and MudPIT for

efficient phosphoproteomic analysis. J Proteome Res, 2008. 7(3): p. 1346-51.

97. Dephoure, N., et al., A quantitative atlas of mitotic phosphorylation. Proc Natl Acad Sci U S

A, 2008. 105(31): p. 10762-7.

98. Mayya, V., et al., Quantitative phosphoproteomic analysis of T cell receptor signaling

reveals system-wide modulation of protein-protein interactions. Sci Signal, 2009. 2(84): p.

ra46.

89

99. Hernandez, L., et al., Diversity of genomic breakpoints in TFG-ALK translocations in

anaplastic large cell lymphomas: identification of a new TFG-ALK(XL) chimeric gene with

transforming activity. Am J Pathol, 2002. 160(4): p. 1487-94.

100. Grimwood, J., et al., The DNA sequence and biology of human chromosome 19. Nature,

2004. 428(6982): p. 529-35.

101. Lee, C.J., W.I. Chan, and P.J. Scotting, CIC, a gene involved in cerebellar development and

ErbB signaling, is significantly expressed in medulloblastomas. J Neurooncol, 2005. 73(2):

p. 101-8.

102. Lim, J., et al., Opposing effects of polyglutamine expansion on native protein complexes

contribute to SCA1. Nature, 2008. 452(7188): p. 713-8.

103. Lam, Y.C., et al., ATAXIN-1 interacts with the repressor Capicua in its native complex to

cause SCA1 neuropathology. Cell, 2006. 127(7): p. 1335-47.

104. Wijmenga, C., et al., Chromosome 4q DNA rearrangements associated with

facioscapulohumeral muscular dystrophy. Nat Genet, 1992. 2(1): p. 26-30.

105. Dixit, M., et al., DUX4, a candidate gene of facioscapulohumeral muscular dystrophy,

encodes a transcriptional activator of PITX1. Proc Natl Acad Sci U S A, 2007. 104(46): p.

18157-62.

106. Sirvent, N., et al., Fusion of EWSR1 with the DUX4 facioscapulohumeral muscular

dystrophy region resulting from t(4;22)(q35;q12) in a case of embryonal

rhabdomyosarcoma. Cancer Genet Cytogenet, 2009. 195(1): p. 12-8.

107. Kowaljow, V., et al., The DUX4 gene at the FSHD1A locus encodes a pro-apoptotic

protein. Neuromuscul Disord, 2007. 17(8): p. 611-23.

108. de Launoit, Y., et al., The PEA3 group of ETS-related transcription factors. Role in breast

cancer metastasis. Adv Exp Med Biol, 2000. 480: p. 107-16.

109. Farber, C.R. and A.J. Lusis, Integrating global gene expression analysis and genetics. Adv

Genet, 2008. 60: p. 571-601.

110. Clifford, N., et al., The EphA3 receptor is expressed in a subset of rhabdomyosarcoma cell

lines and suppresses cell adhesion and migration. J Cell Biochem, 2008. 105(5): p. 1250-9.

111. Wykosky, J. and W. Debinski, The EphA2 receptor and ephrinA1 ligand in solid tumors:

function and therapeutic targeting. Mol Cancer Res, 2008. 6(12): p. 1795-806.

112. Timar, J., et al., Melanoma genomics reveals signatures of sensitivity to bio- and targeted

therapies. Cell Immunol, 2006. 244(2): p. 154-7.

90

113. Sakabe, N.J. and M.A. Nobrega, Genome-wide maps of transcription regulatory elements.

Wiley Interdiscip Rev Syst Biol Med. 2(4): p. 422-37.

114. Sadikovic, B., et al., Cause and consequences of genetic and epigenetic alterations in

human cancer. Curr Genomics, 2008. 9(6): p. 394-408.

115. Himanen, J.P., N. Saha, and D.B. Nikolov, Cell-cell signaling via Eph receptors and

ephrins. Curr Opin Cell Biol, 2007. 19(5): p. 534-42.

116. Pasquale, E.B., Eph receptor signalling casts a wide net on cell behaviour. Nat Rev Mol

Cell Biol, 2005. 6(6): p. 462-75.

117. Klein, R., Excitatory Eph receptors and adhesive ephrin ligands. Curr Opin Cell Biol, 2001.

13(2): p. 196-203.

118. Egea, J. and R. Klein, Bidirectional Eph-ephrin signaling during axon guidance. Trends

Cell Biol, 2007. 17(5): p. 230-8.

119. Kullander, K. and R. Klein, Mechanisms and functions of Eph and ephrin signalling. Nat

Rev Mol Cell Biol, 2002. 3(7): p. 475-86.

120. Dodelet, V.C. and E.B. Pasquale, Eph receptors and ephrin ligands: embryogenesis to

tumorigenesis. Oncogene, 2000. 19(49): p. 5614-9.

121. Surawska, H., P.C. Ma, and R. Salgia, The role of ephrins and Eph receptors in cancer.

Cytokine Growth Factor Rev, 2004. 15(6): p. 419-33.

122. Worby, C.A. and J.E. Dixon, Sorting out the cellular functions of sorting nexins. Nat Rev

Mol Cell Biol, 2002. 3(12): p. 919-31.

123. Carlton, J., et al., Sorting nexins--unifying trends and new perspectives. Traffic, 2005. 6(2):

p. 75-82.

124. Kurten, R.C., D.L. Cadena, and G.N. Gill, Enhanced degradation of EGF receptors by a

sorting nexin, SNX1. Science, 1996. 272(5264): p. 1008-10.

125. Horazdovsky, B.F., et al., A sorting nexin-1 homologue, Vps5p, forms a complex with

Vps17p and is required for recycling the vacuolar protein-sorting receptor. Mol Biol Cell,

1997. 8(8): p. 1529-41.

126. Qin, B., et al., Sorting nexin 10 induces giant vacuoles in mammalian cells. J Biol Chem,

2006. 281(48): p. 36891-6.

127. Duffy, M.J., et al., Role of ADAMs in cancer formation and progression. Clin Cancer Res,

2009. 15(4): p. 1140-4.

91

128. Edwards, D.R., M.M. Handsley, and C.J. Pennington, The ADAM metalloproteinases. Mol

Aspects Med, 2008. 29(5): p. 258-89.

129. Mancia, F. and L. Shapiro, ADAM and Eph: how Ephrin-signaling cells become detached.

Cell, 2005. 123(2): p. 185-7.

130. Shintani, Y., et al., Overexpression of ADAM9 in non-small cell lung cancer correlates with

brain metastasis. Cancer Res, 2004. 64(12): p. 4190-6.

131. O'Shea, C., et al., Expression of ADAM-9 mRNA and protein in human breast cancer. Int J

Cancer, 2003. 105(6): p. 754-61.

132. Peduto, L., et al., Critical function for ADAM9 in mouse prostate cancer. Cancer Res, 2005.

65(20): p. 9312-9.

133. Grutzmann, R., et al., ADAM9 expression in pancreatic cancer is associated with tumour

type and is a prognostic factor in ductal adenocarcinoma. Br J Cancer, 2004. 90(5): p.

1053-8.

134. Rozenblatt-Rosen, O., et al., The parafibromin tumor suppressor protein is part of a human

Paf1 complex. Mol Cell Biol, 2005. 25(2): p. 612-20.

135. Lin, L., et al., Nuclear localization of the parafibromin tumor suppressor protein implicated

in the hyperparathyroidism-jaw tumor syndrome enhances its proapoptotic function. Mol

Cancer Res, 2007. 5(2): p. 183-93.

136. Hahn, M.A., et al., CDC73/HRPT2 CpG island hypermethylation and mutation of 5'-

untranslated sequence are uncommon mechanisms of silencing parafibromin in parathyroid

tumors. Endocr Relat Cancer. 17(1): p. 273-82.

137. Chaudhary, K., et al., Human RNA polymerase II-associated factor complex: dysregulation

in cancer. Oncogene, 2007. 26(54): p. 7499-507.

138. Ciesielski, M.J. and R.A. Fenstermaker, Oncogenic epidermal growth factor receptor

mutants with tandem duplication: gene structure and effects on receptor function.

Oncogene, 2000. 19(6): p. 810-20.

139. Fenstermaker, R.A. and M.J. Ciesielski, Deletion and tandem duplication of exons 2 - 7 in

the epidermal growth factor receptor gene of a human malignant glioma. Oncogene, 2000.

19(39): p. 4542-8.

140. Fenstermaker, R.A., M.J. Ciesielski, and G.J. Castiglia, Tandem duplication of the

epidermal growth factor receptor tyrosine kinase and calcium internalization domains in A-

172 glioma cells. Oncogene, 1998. 16(26): p. 3435-43.

92

141. Lewis, T.B., C.M. Coffin, and P.S. Bernard, Differentiating Ewing's sarcoma from other

round blue cell tumors using a RT-PCR translocation panel on formalin-fixed paraffin-

embedded tissues. Mod Pathol, 2007. 20(3): p. 397-404.

142. Downing, J.R., et al., Detection of the (11;22)(q24;q12) translocation of Ewing's sarcoma

and peripheral neuroectodermal tumor by reverse transcription polymerase chain reaction.

Am J Pathol, 1993. 143(5): p. 1294-300.

Documents

UNDIFFERENTIATED SARCOMAS: …...Figure 3.7 RT-PCR confirmation of the CIC-DUX4 fusion for t(4;19) pilot study 47 Figure 3.8 Identification of the CIC-DUX4 fusion by RT-PCR screening