Upload
vungoc
View
214
Download
0
Embed Size (px)
Citation preview
Genetic susceptibility to rheumatoid arthritis and its implications for novel drug
discovery.
Abstract
Introduction
Over 100 susceptibility loci have now been identified for rheumatoid arthritis (RA), several
of which are already the targets of approved RA therapies providing proof of concept for the
use of genetics in novel drug development for RA. Determining how these loci contribute to
disease will be key to elucidating the mechanisms driving disease development and has the
potential for major impact on therapeutic development.
Areas covered
Here we review the use of genetics in drug discovery, including the use of ‘omics’ data to
prioritise potential drug targets at susceptibility loci using RA as an exemplar. We discuss
the current state of RA genetics its impact on stratified medicine, and how the findings from
RA genetics studies can be used to inform drug discovery.
Expert opinion
It is anticipated that functional characterisation of disease variants will provide biological
validation of a gene as a drug target, providing safer targets, with an increased likelihood of
efficacy. In the future techniques such as genome editing may represent a plausible option
for RA therapy. Technologies such as genome wide chromatin conformation capture Hi-C
and CRISPR will be crucial to inform our understanding of how diseases develop and in
developing new treatments.
1
Key words
Genetics
Rheumatoid Arthritis
Drug Discovery
Functional genomics
Therapeutic target
2
1.0 Introduction
Rheumatoid arthritis (RA) is a complex common autoimmune disease which manifests in the
inflammation and destruction of synovial joints, affecting approximately 1% of the
population worldwide. RA can progress to disability, systemic complications and early death
all of which contribute to socioeconomic costs. Currently both conventional and biological
disease modifying anti-rheumatic drugs (DMARDs) are used in the treatment of RA. The
first line therapy for RA is a conventional DMARD such as methotrexate, sulfasalazine,
leflunomide, hydroxychloroquine and azathioprine. Methotrexate is usually the drug of
choice, however many patients fail to respond and only 55% of patients remain on the drug
after 2 years [1]. Patients failing to respond to MTX are subsequently prescribed biologic
DMARDs such as anti-tumour necrosis factor (TNF) agents which disrupt TNF-α in order to
reduce inflammation , indeed combinations of traditional and biologic DMARDs are often
prescribed. Several anti-TNF agents are used to treat RA including infliximab, adalimumab,
golimumab, etanercept and certolizumab, all of which inhibit the effects of TNF-α by
preventing it from binding to its receptor. Anti-TNF biologics are costly, approximately
£10,000 per patient per year, and also have a high rate of non-response (approximately 30-
40% of patients) [2, 3]. Other biologics, targeting different pathways are also now available,
for example rituximab which targets B-cells and Tocilizumab targeting the interleukin 6
pathway. Although there has been significant progress in the treatment of RA over the past 10
years a significant proportion of patients are refractory to current therapies meaning there is
still need for new therapies. The heritability of RA is estimated to be approximately 60%,
indicating that the most important factor in determining whether someone is likely to develop
RA is found in their genetic background. An individual’s genetic background is present
before the onset of the disease and therefore could potentially be used as both an early
3
biomarker of disease or as a target for therapeutic intervention. Indeed a recent report
estimated that a therapeutic target with existing genetic evidence is twice as likely to be
brought to market [4]. This is supported by a review of the AstraZeneca drug discovery
pipeline, which indicated that in their trials 73% of the compounds which made it to phase II
were supported by genetic evidence, compared to only 43% without genetic support [5].
Understanding the genetics of disease susceptibility will therefore be key to elucidating the
mechanisms driving disease development and will have a major impact on therapeutic
development. Here we review how genetics can be used in a translational approach to inform
drug discovery using RA as an exemplar.
2.0 Genetics of Rheumatoid arthritis, the current state of play
Since 2007, advances in genotyping technologies and the advent of genome wide association
studies (GWAS), over 100 genetic susceptibility loci have been identified for RA adding to
the well- established associations to the human leukocyte antigen (HLA) locus and protein
tyrosine non receptor 22 (PTPN22). Parallel studies in other autoimmune and inflammatory
diseases have revealed a picture of shared genetic susceptibility loci across many diseases.
The Immunochip project used this overlap to its advantage, with a consortium of 12
autoimmune diseases designing a custom illumina SNP genotyping array containing
~200,000 SNPs at 186 loci previously implicated in these diseases allowing the fine mapping
of these loci. The study included 11475 RA cases and 15870 controls of European descent,
which were combined in a meta-analysis with previous independent GWAS data [6]. The
study identified 14 new susceptibility loci and refined the association to a single gene for 19
loci. A follow up study testing variants which demonstrated suggestive evidence of
significance in the Immunochip also identified two novel loci [7]. The most recent large
4
meta-analysis used a trans-ethnic approach including >100,000 samples of European and
Asian ancestry, analysing over 10 million SNPs, increased the number of RA susceptibility to
101[8]. Indeed GWAS focus on common frequency variants, therefore little has been done to
investigate rare variants in RA. Several studies have suggested that the impact of rare variants
on the heritability of RA is minimal (refs), several large scale sequencing projects may help
to shed light on the contribution of these rare coding variants. Variants identified through
GWAS are not necessarily the true causal variants as they may be correlated with several
other variants through linkage disequilibrium. In addition although >100 susceptibility
variants have been identified the mechanism by which these variants contribute to disease is
largely unknown.
Given this wealth of genetic data emerging from recent studies the question remains as to
how this data can be used to impact on clinical practice. An understanding of how disease
associated genetics variants affect gene function has the potential to inform drug discovery,
through the identification of disease genes that are drugable targets or through the
identification of pathways that could be up or down regulated to modulate the disease
condition. In addition the elucidation of a genetic basis for the existence of disease subsets
will in turn inform a stratified medicine approach in which the different biological pathways
driving disease in patient subsets can be characterised and targeted. Before this can happen,
though, the next challenge is to determine precisely which of the associated variants have
functional consequences, on which genes and in which cell types these are effective in order
that the key biological pathways disrupted in disease can be identified.
3.0 Utilising biological pathways and multi “Omics” data to prioritize genetic loci
5
The abundance of biological “omics” data, such as gene expression data, chromatin histone
marks, protein-protein interactions, biological pathways and GWAS results which can be
used to develop hypotheses and advance the findings from GWA studies to facilitate insight
into the pathological mechanisms of disease associated variants (Figure 1).
This was demonstrated in the most recent meta-analysis in RA to date, in which publically
available data was used to develop an in-silico pipeline based on functional annotations [8].
The authors combined genetic risk loci with data including; functional annotation of SNPs,
cis acting eQTLs, risk variants for other traits, text mining, epigenetic motifs, protein-protein
interactions, knockout mouse phenotypes and annotated molecular pathways to identify the
candidate genes at the RA risk loci. Studying chromatin marks in 34 cell types, the authors
found that the RA loci were enriched for tri-methylation of histone H3 at lysine 4
(H3K4me3) in primary CD4+ regulatory T cells. H3K4me3 is a histone modification
associated with open and active chromatin as it has been correlated with increased gene
expression and nucleosome loss [9], suggesting RA loci, although mostly located in
intergenic or intronic regions, are located in open active sites. For 98 of the 101 loci,
candidate risk genes were identified. Forty four of these loci showed evidence of being
expression quantitative trait loci (eQTLs) acting in cis, in data obtained from peripheral blood
mononuclear cells, CD4+ T-cells and CD14+ and CD16+ monocytes. Pathway analysis
supported these findings, demonstrating enrichment of risk loci in T-cell related pathways, B-
cell pathways and cytokine signalling pathways.
4.0 Genetic studies reveal Disease Pathways
Using genetics to identify critical pathways involved in disease is one method by which
genetics can be utilised in drug discovery (Figure 1). Once a pathway has been identified it
6
would then be assumed that if a risk variant increased the expression of a gene that positively
regulates a particular pathway, inhibition of that gene or signalling pathway may be effective
in treating the disease, a high throughput drug screen of the pathway could then be
conducted. Okada et al used a library of drugs currently approved for use in experimental or
in clinical trial therapies for human diseases which target specific genes and screened them
for the presence of the identified RA candidate genes or any genes from their direct protein-
protein interaction networks[8]. Twenty seven of the approved drug target genes overlapped
the RA candidate genes, demonstrating the utility of this approach.
Genes mapping to RA loci have been shown to cluster into three pathways, NFKB, T cell
signalling and JAK-STAT pathways [10]. Candidate genes at several RA loci are involved in
NFKB signalling, including REL, TNFAIP3, PRKCQ, TRAF1, TRAF6, CCL19/CCL21,
IRAK1 and CD40. A study by Li et al investigated the CD40 locus as a potential therapeutic
target; fine mapping and exon sequencing of the locus, refined the association to a single
SNP, the authors then show that this SNP is an eQTL for CD40 expression in PBMCs, this
was supported by flow cytometry results which showed that individuals who were
homozygous for the risk allele had increased amounts of CD40 on the cell surface of CD19+
B-cells compared to those homozygous for the non-risk allele. Perturbation experiments
showed a direct correlation between the amount of CD40 on the cells surface and the
phosphorylation of RelA, a subunit of the NFKB transcription factor. A high throughput
drug screen identified two novel compounds as potential therapies [11].
Another interesting RA susceptibility locus is the PADI4 locus. PADI4 encodes an enzyme
which catalyses protein citrullination, converting arginine residues to citrulline which are the
target of anti citrullinated peptide antibodies (ACPA). Indeed the association at this locus is
specific to RA, where as many of the other RA loci are shared across other autoimmune
diseases. Cigarette smoking has been associated with an increased risk of RA and causes
7
protein citrullination, ncreased expression of PADI enzymes has been found in
bronchoalveolar lavage cells from smokers compared to non-smokers; suggesting that the
PADI4 locus may be important in the development of RA, representing an interesting drug
target.
5.0 Support for using genetics in drug discovery; proof of concept
The identification of variants associated with susceptibility to RA that are already the targets
of successful therapeutics (IL6, CTLA4, TYK2) provides a proof of concept that genetics
could be successful in identifying novel targets. A variant within the interleukin 6 receptor
(IL6R) gene has recently been associated with RA and IL6R is the target of the monoclonal
antibody tocilizumab, an approved RA therapy. IL6 is a pro-inflammatory cytokine,
overproduction of which causes systemic inflammation and elevated levels have been
observed in the serum and synovial fluid of RA patients [12]. IL6R exists as both soluble and
membrane bound forms, where the RA disease associated variant, leads to increased cleavage
of the membrane-bound protein, increasing the amount of the soluble form of the IL6
receptor. Tocilizumab is a humanised monoclonal antibody which binds to both the
membrane bound and soluble forms of the IL6 receptor prohibiting IL6 from exerting its pro-
inflammatory effects. A new therapy Sirukumab which blocks IL-6 as opposed to the IL6
receptor is now in phase II trials after promising results in Phase II, where low doses of the
drug showed significant improvements in symptoms [13]. Other approved RA therapies
targeting candidate genes include abatacept, which targets the CTLA4 gene and tofacitinib
targeting the JAK-STAT pathway. Tyrosine kinase 2 (TYK2) is a key molecule in the JAK-
STAT signalling pathway implicated in cytokine signalling responses associated with
8
immunological diseases, a variant in this gene has been associated with RA [6]. Mouse
models have also provided further supporting evidence that TYK2 is a genetic target for RA,
with Tyk2 deficient mice shown to be resistant to experimental arthritis [14]. These
examples provide compelling evidence that genetics can significantly contribute to the
identification of drug targets. Currently less than 5% of drug molecules that begin phase I
trials are approved as safe effective therapeutics [15]. It has been shown that most failures
occur during phase II trials mainly as a result of lack of efficacy or toxicity. The application
of human genetics to drug discovery has been demonstrated to reduce the number of
compounds which fail at phase II trials, potentially by increasing the probability that
therapeutically modulating the target will result in a safe and effective drug. The majority of
the RA susceptibility loci identified to date confer small effect sizes, raising the concern that
they may not be sufficient therapeutic targets, however the examples described above
demonstrate that pharmacological modulation of disease associated genes conferring small
effects on risk (OR<1.4) has an observable clinical impact.
GWAS finding could also highlight target genes where there is the possibility of
repositioning drugs already approved for other diseases, short-cutting the long process of
major clinical trials. This has proven successful in the past, for example both methotrexate
and rituximab were originally used to treat cancer, and early indications suggest GWAS may
provide more compelling targets. In addition, the emerging genetic sharing, known as
pleiotropy, between autoimmune diseases suggests that drug re-positioning between these
genetically related diseases may be a feasible option.
A recent study has estimated the cost of bringing a prescription drug to market to be $2.6
billion dollars [16], given the low success rate of molecules entering phase I trials that
successfully complete this journey, a huge financial loss is incurred by failing drugs. Not
only can genetics provide novel, or re-positioned targets for RA, but by fully understanding
9
the mechanistic action of genetic variants we can model the effects of modifying a target
within patients. This then provides both the magnitude and direction (agonist/antagonist) that
the therapy must mimic to be effective. As discussed earlier the variants associated with RA
in IL6R have the same effect as the approved drug tocilizumab, inhibiting the IL6 receptor.
The observation of the effects of the protective risk allele on the IL6 receptor provides
validation of IL6 as a suitable drug target. This could be referred to as a natural experiment,
and has been successful in the development of several approved therapies, for example statins
as a treatment for high cholesterol. Families with hypercholesterolemia carrying mutations in
the low density lipoprotein receptor gene (LDLR) demonstrated high LD cholesterol and
increased risk of heart disease, a relationship was seen between the number/type of mutations
and the blood levels of LDL cholesterol levels, corresponding to a range of individual
disease severity [17, 18]. This series of alleles resulting in different extremes of the
phenotype allowed the relationship between LDL and increased risk of heart attack to be
defined. 3-hydroxy-3-methylglutaryl-coenzyme A (HMG-CoA) was known to be critical to
the synthesis of LDL and this ultimately led to the development of HMG-CoA reductase
inhibitors (statins) to treat high cholesterol. The identification of a series of alleles associated
with a disease affecting a single gene or pathway ranging from complete inhibition of the
target to an increase in expression, would allow us to model the effects of these alleles on the
disease phenotype in a dose response manner. For example, loss of function variants would
allow us to observe the effects of target inhibition on the phenotype and could mimic the
effects of a potential therapeutic, including off-target and detrimental effects. This would
require in in depth knowledge of the causal alleles and their function. The identification of
such allelic series may require large scale sequencing studies in order to identify the full
range of variants.
10
PCSK9 is an example of the success of an allelic series leading to treatment; a rare gain of
function mutation in the PCSK9 gene was found in two families with an increased risk of
coronary artery disease (CAD) hypercholesterolemia [19]. Further studies found that a loss
of function variant at PCSK9 correlated with low LDL levels and reduced occurrence of CAD
[20-22]. In vivo studies then revealed a mechanistic link between PCSK9 and LDL levels
demonstrating that overexpression of PCSK9 in mice resulted in hypercholesterolemia. This
was determined to be due to a post-transcriptional modification causing a reduction in the
expression of the LDL receptor protein [23, 24]. Clinical trials of a PCSK9 monoclonal
antibody significantly reduced LDL cholesterol levels in both healthy individuals and
individuals with hypercholesterolemia [25]. Subsequently, two cholesterol lowering PCSK9
monoclonal antibodies, repatha and praluent, have been approved by the FDA.
A further indication of how genetic discoveries can aid therapeutic intervention can be
demonstrated with a SNP within TNFRSF1A a gene encoding tumour necrosis factor receptor
1 (TNFR1), that has been associated with multiple sclerosis but not with other AID, such as
RA. A study has shown that the MS risk allele regulates the expression of a novel soluble
form of TNFR1 that can block TNF [26]. Interestingly anti-TNF drugs have been shown to
aggravate or even promote the onset of MS, but conversely have been successful in the
treatment of other AID where there is no association, including RA. The TNFRSF1A variant
mimics the effect of anti-TNF therapies and suggests their mechanism of action in
ameliorating disease, showing how insight can be gained by comparing genetic loci across
related AID and determining the functional consequences.
6.0 Disease prediction and stratified medicine
11
One of the objectives of complex disease genetics is the ability to predict an individual’s risk
of developing disease with a view to development of interventions such as lifestyle
modifications, close monitoring or the use of therapies to delay or prevent onset. Statistical
models incorporating RA genetic loci into genetic risk scores (GRS) have been developed to
predict who is at risk of developing disease, with limited success (Area under the curve
(AUC) 0.72-0.79), although this is increased in males who smoke (AUC 0.86) [27, 28].
Given the low population prevalence of RA, it is unlikely that general population screening
would be of use; instead such models would be targeted to populations of individuals with
other risk factors such as first degree relatives of patients with RA.
In some areas such as cancer, knowledge gained from genetics has huge potential to inform
personalised medicine, where healthcare can be tailored to each individual patient based on
predictive markers, allowing the most effective drug to be selected for each individual. In the
case of a complex, polygenic disease such as RA, with hundreds of contributing genetic
factors, medicines targeted to the individual may be overly ambitious, however the
identification of genetically defined sub-groups of patients, the so-called stratified medicine
approach is more likely to be attainable.
Indeed we can already begin to sub-divide RA based on the presence of ACPA, and many
people consider ACPA positive and ACPA negative RA to be two distinct diseases [29-31].
Genetic studies to date have focused on ACPA positive individuals, although ACPA negative
individuals make up between 50 and 10% of the RA patients depending on their stage in
disease [32]. A recent study of ACPA negative patients identified associations in the HLA
region, accounted for by two amino acid positions, position 11 at HLA-DRB1 and position 9
at HLA-B, both of which have previously been associated with ACPA positive disease.
However, interestingly, the specific amino acid residues at each position conferring risk were
different, with a serine residue at position 11 HLA-DRB1 conferring risk of ACPA negative
12
disease, but being protective against ACPA positive disease [33]. The high heritability of
ACPA negative disease [34] and the low number of associated loci [35] highlights the need
for larger studies to identify seronegative specific loci.
Stratified medicine could also Identify groups of patients more or less likely to respond to
treatment or develop an adverse reaction, for example, testing for thiopurine
methyltransferase (TMPT) gene polymorphisms before beginning azathioprine therapy is
now included in the prescribing guidelines from the British Society of Rheumatology (BSR)
and the US Food and Drug Administration (FDA). Individuals with genetic polymorphisms
in TMPT exhibit reduced/deficiency of the enzyme which metabolises 6-mercaptopurine, a
product of azathioprine, which can lead to toxicity [36]. Knowing the genotype at this locus
can help clinicians to determine the most appropriate course of treatment for the patient.
It is well known that in order to reduce joint damage and prevent disability it is important to
treat patients as soon as possible, indeed it has been shown that the introduction of early
effective treatment can markedly improve long term outcomes [37-39]. However there are
still no true reliably validated biomarkers of treatment response and therefore treatments are
prescribed in a process of first line and second line therapies representing a trial and error
strategy.
The majority of genetic studies in RA have focused on susceptibility however some work on
treatment response has been carried out. To date only two genetic associations with treatment
response have been identified, and neither hold enough predictive value to be informative in
clinic; Firstly a variant in the CD84 gene (which encodes leukocyte differentiation antigen
CD84) has been found to be predictive of response to etanercept (an anti TNF inhibitor) [40].
Secondly, the PDE3A-SLCO1C1 locus has been associated with response to the TNF
inhibitors etanercept, infliximab and adalimumab [41].
13
7.0 Identifying RA target genes
Although over 100 susceptibility loci have been identified for RA the task remains to
determine their functional effects and how they contribute to disease before their potential
benefit as a therapeutic target can be fully realised.
For only a handful of RA loci have the functional variants of disease genes been fully
characterised, as has been found in many other complex autoimmune diseases, the majority
of susceptibility variants identified reside outside the coding regions of the genome, and
therefore their function is as yet unknown. Although variants may lie in intergenic or intronic
regions, they can still affect gene regulation, transcription or splicing events, and evidence
suggests that some regions of the genomes may be transcribed and not translated for example
regions encoding microRNA (miRNA) or long non coding RNAs (lncRNAs), which can in
turn affect gene regulation. An example of this is a non-coding variant associated with
plasma low density lipoprotein levels and myocardial infarction [42]. The SNP was found to
create a transcription factor binding site altering the expression of the SORT1 gene, which
directly altered plasma levels of LDL, demonstrating that non-coding variants can have a
direct impact on a disease phenotype.
Integration of GWAS results and gene expression data has shown that disease associated loci
are enriched for expression quantitative trait loci (eQTLS), suggesting they have a regulatory
role [43-46]. Many eQTLs act universally across tissues; however some are cell type specific
and require specific stimulatory conditions. It will be crucial to identify the cell type in
which disease associated loci are exerting their effects as many regulatory mechanisms are
cell type specific. The investigation of the innate immune cells, monocytes and dendritic
cells, in two separate studies, identified eQTLs that were only revealed after specific
14
stimulations [47, 48]. One study stimulated monocytes with lipopolysaccharide (LPS), or
interferon γ (IFNγ) [47] , while a second stimulated dendritic cells with E.coli, LPS, influenza
or interferon-β (IFNβ) [48], and measured the resulting gene expression. The patterns of
eQTLs were not only cell type specific but interestingly were dependent on the cell
stimulation and were termed response QTLs (reQTLs). Both studies showed an enrichment of
reQTLs at GWAS loci. These studies add a further level of complexity to understanding the
regulatory role of disease associated variants, illustrating that not only must we identify the
critical cell types, but some eQTLs may only be revealed after identification of the relevant
stimulus.
It is well known that in the past disease associated regions of the genome were assigned
candidate genes based on the gene which is closest to the associated variants, or the gene in
the region with the most biologically plausible role given the phenotype. Candidate genes are
now assigned to variants based on biological omics evidence such as the presence of
regulatory regions, binding sites and chromatin state. However this designation is constrained
to some extent by our knowledge of genomics. It has been demonstrated through functional
studies that the gene closest to the associated genetic variant is not always the target; indeed
variants can have effect in both cis and trans on genes many megabases away. A region on
chromosome 16p13 has been associated with several autoimmune diseases including T1D
and MS, the disease associated variants are located within the c-type lectin domain family 16
(CLEC16A) gene, and were assumed to exert their function on the gene in which they reside.
However functional studies found that an intron of the CLEC16A gene, containing disease
associated variants, was acting as a regulatory sequence for the neighbouring gene
dexamethasone induced protein (DEXI). The disease alleles that conferred protection against
T1D and MS were correlated with an increase in the expression of DEXI in monocytes.
Furthermore chromatin conformation capture (3C) experiments determined a physical
15
interaction between the promoter of DEXI and the intron of CLEC16A containing risk
variants, demonstrating that the DNA had looped out in order to allow these regions to be in
close proximity [49].
A recent study attempted to determine the target genes of disease associated variants using
chromatin conformation capture technology (Capture Hi-C) [50-52], linking disease SNPs
with the disease causing genes [53]. The 3d conformation of the genome means that it is
possible for sections of DNA many megabases apart, or even on different chromosomes, to
interact. Hi-C uses formaldehyde to fix the conformation of DNA in a specific cell at a
specific time point, the crosslinked DNA is then digested and re-ligated so that only
fragments of DNA which were covalently liked form ligation products. A biotin label is
incorporated into the ligation product allowing the products to be selected for and then
sequenced. Capture Hi-C, as used in the Martin et al study utilised this technique in two
complementary experiments, firstly using region capture targeting regions associated with
disease (rheumatoid arthritis (RA), juvenile idiopathic arthritis (JIA), type 1 diabetes (T1D)
and psoriatic arthritis (PsA)) and secondly promoter capture, capturing all gene promoters
within 500kb of each disease associated variant. This technique identifies physical
interactions between the selected disease associated fragments (region capture) or promoter
fragments (promoter capture) and the rest of the genome. The study investigated chromatin
interaction in both T and B cell lines and found that the majority of interactions were cell
type specific with only 20% found in both cell lines.
Several of the confirmed interactions further demonstrated the importance of functional
studies to determine where the SNP of interest is exerting its effect, as SNPs were shown to
interact with distant promoters, sometimes over a megabase away. For example SNPs
proximal to the EOMES gene associated with RA had a physical interaction spanning 640kb
with the promoter of AZI2 involved in NFKB activation. Additionally SNPs within the
16
COG6 gene associated with RA and JIA also demonstrated long range interaction with the
promoter of FOXO1 over 1Mb away.
Interestingly the authors also found that loci from different AID which would typically be
classed as separate loci may act on the same promoter. Using the traditional method of
assigning the nearest candidate gene, loci where the lead disease associated SNPs from two
AID map some distance apart would therefore be assigned to different candidate genes.
SNPs associated with PsA located within the DENND1B gene physically interacted with a
region in the PTPRC gene over 1Mb away, involved in T and B cell signalling, which is also
associated with RA. A further example showed interaction between an enhancer region in
RAD51B containing RA associated variants and the promoter of ZFP36L1 containing JIA
associated variants, this interaction was only observed in B cells which is appropriate as
ZFP36L1 encodes a transcription factor associated with B cell differentiation to plasma cells.
The study has further demonstrated the complex regulatory landscape of the genome where
promoters can interact with many different enhancers and vice versa. The way we think
about disease associated loci and how we often speculate on their mechanisms must be
broadened to incorporate the bigger picture. SNPs do not need to be in close proximity or in
LD to expend their effects on the same gene. Further investigating the key findings of this
study will be pivotal in our understanding of the role of genetics in RA.
8.0 Conclusion
We have presented evidence to support the use of genetics in the development of new
therapeutics for RA. Only when the true target of a disease associated variant has been
determined and characterised within the relevant cell type under the appropriate stimulatory
17
conditions can pathway based approaches to inform therapeutic target investigation be used
to best effect.
9.0 Expert opinion
Although huge advances have been made in RA genetics, a substantial amount of work is still
required in order to better understand how these disease associated loci contribute to
pathogenesis. Experiments investigating each disease associated variant in specific cell types
under different environmental conditions will be essential to determine the way in which such
variants influence the disease process. A thorough investigation of the regulatory role of
these loci should then be undertaken. This will be challenging as experiments in many cell
types will be required with data collected over time course experiments and under different
stimulatory conditions to reveal potential eQTLs and reQTLs. Identification of the relevant
cell type will allow further more complex functional experiments which can incur substantial
financial costs, including experiments such as RNA-seq, chromatin conformation capture
experiments, chromatin immunoprecipitation (CHIP) assays and luciferase reporter assays.
Once the causal genetic variant is known and we have determined the relevant cell type, the
regulated target gene and how the variant works mechanistically (e.g. up or down regulate),
the plausibility of the locus as a drug target should become apparent. We believe that our
knowledge of the genetic component of RA holds great potential for the development of new
therapeutics. Indeed we have discussed the advantages of having genetic evidence for a
target, and it is hoped that conducting these functional experiments will elucidate
mechanisms which will provide biological validation of a gene as a drug target, in so called
natural experiments. The aim will then be to increase collaborations with industrial partners
in order to take these candidates into clinical trials. An advantage of such natural
18
experiments is that it should provide safer targets with a reduced likelihood of toxicity and
adverse events, with an increased likelihood of efficacy, which in turn should reduce attrition
rates. Okada et al have demonstrated, with proof of concept, the potential which lies within
pathway analysis and combining genetic and bioinformatics data[8]
Perhaps one of the most exciting oppourtunities on the horizon is the use of genome editing
technologies where it is relevant to introduce “genetic corrections” into precursor stem cells.
Clustered Regulatory Interspaced Palindromic Repeats (CRISPR) type II system is a
modified bacterial immune system used for genome editing. Which can target specific
modifications to a specific site in the genome allowing single base changes or large insertions
or deletions. Another use of CRIPSR is to use a nuclease dead Cas9 molecule known as dead
Cas9 (dCas9) which cannot cleave DNA. The dCas9 molecule can bind to DNA and can be
tagged with transcriptional activators or repressors, therefore targeting this to a promoter or
enhancer sequence can act to enhance or repress gene transcription. CRISPR has the
capability to introduce single base changes into the human germline or act as a novel
therapeutic to increase or decrease gene expression. There is enormous potential for this
technology to revolutionise the treatment or even cure monogenic diseases. For example
CRISPR has been used in mice with a dominant mutation in the Crygc gene which causes
cataracts. [54]. In humans, CRISPR has been used to correct a mutation in the CFTR gene
causing cystic fibrosis (CF) [55]. These examples provide proof of concept for gene
correction in single gene disorders. Although RA is a complex disease with many genetic
loci, we have shown that perturbation of a single locus is significant for treatment of the
disease. CRISPR is currently in its infancy and more understanding is needed about the
potential for off target effects and the specificity of the DNA repair stage. Currently CRISPR
provides an exciting new tool for in vitro studies aimed at understanding how disease
associated variants or combinations of variants work at a cellular level, but excitingly for
19
many diseases it immediately offers revolutionary new opportunities for therapies the
possibilities of which only arise following the elucidation of the genetic mechanism
underpinning disease.
Article highlights box
Drug molecules with genetic support are more likely to progress to phase II clinical
trials.
Proof of concept for the utility of genetics in drug development lies within current
approved RA therapies targeting susceptibility loci; e.g. Tocilizumab and IL6R.
Determining the mechanistic action of GWAS hits and their contribution to disease
pathogenesis through functional experiments will be critical to identify potential
therapeutic targets.
Not only can genetics provide novel, or re-positioned targets for RA, by fully
understanding the mechanistic action of genetic variants we can model the effects of
modifying a target within patients. This then provides both the magnitude and
direction (agonist/antagonist) that the therapy must mimic to be effective.
In the future genome editing techniques such as CRISPR may represent an alternative
to therapeutic intervention.
Figure 1: Pipeline utilising genetics to identify therapeutic targets
Reference List
20
1. Barrera P, van der Maas A, van Ede AE, et al. Drug survival, efficacy and toxicity of monotherapy with a fully human anti-tumour necrosis factor-alpha antibody compared with methotrexate in long-standing rheumatoid arthritis. Rheumatology (Oxford) 2002;41(4):430-9
2. Dixon WG, Hyrich KL, Watson KD, et al. Drug-specific risk of tuberculosis in patients with rheumatoid arthritis treated with anti-TNF therapy: results from the British Society for Rheumatology Biologics Register (BSRBR). Ann Rheum Dis 2010;69(3):522-8
3. Coenen MJ, Toonen EJ, Scheffer H, et al. Pharmacogenetics of anti-TNF treatment in patients with rheumatoid arthritis. Pharmacogenomics 2007;8(7):761-73
4. Nelson MR, Tipney H, Painter JL, et al. The support of human genetic evidence for approved drug indications. Nat Genet 2015;47(8):856-60
*This paper provides support for the use of genetics in drug discovery
5. Cook D, Brown D, Alexander R, et al. Lessons learned from the fate of AstraZeneca's drug pipeline: a five-dimensional framework. Nat Rev Drug Discov 2014;13(6):419-31
6. Eyre S, Bowes J, Diogo D, et al. High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis. Nat Genet 2012;44:1336-40
7. McAllister K, Yarwood A, Bowes J, et al. Identification of BACH2 and RAD51B as rheumatoid arthritis susceptibility loci in a meta-analysis of genome-wide data. Arthritis Rheum 2013;65:3058-62
8. Okada Y, Wu D, Trynka G, et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 2014;%20;506(7488):376-81
** A large scale GWAS and meta analysis of RA, which utilises a bioinformatic pipeline
to combine GWAS results with biological and pathway data. The authors
conduct a high throughput drug screen to identify drug targets from GWAS
results providing proof of concept for the utility of genetics in drug discovery.
9. Barski A, Cuddapah S, Cui K, et al. High-resolution profiling of histone methylations in the human genome. Cell 2007;129(4):823-37
10. Messemaker TC, Huizinga TW, Kurreeman F. Immunogenetics of rheumatoid arthritis: Understanding functional implications. J Autoimmun 2015;64:74-81
21
11. Li G, Diogo D, Wu D, et al. Human genetics in rheumatoid arthritis guides a high-throughput drug screen of the CD40 signaling pathway. PLoS Genet 2013;9:e1003487
* A systematic study of the CD40 locus associated with RA. The authors use eQTL and
pertubation studies to show direct correlation between associated variants, gene
expression and cell surface protein expression. A high throughput drug screen
identifies potential therapeutic inhibitors.
12. Nishimoto N. Interleukin-6 in rheumatoid arthritis. Curr Opin Rheumatol 2006;18(3):277-81
13. Smolen JS, Weinblatt ME, Sheng S, et al. Sirukumab, a human anti-interleukin-6 monoclonal antibody: a randomised, 2-part (proof-of-concept and dose-finding), phase II study in patients with active rheumatoid arthritis despite methotrexate therapy. Ann Rheum Dis 2014;73(9):1616-25
14. Shaw MH, Boyartchuk V, Wong S, et al. A natural mutation in the Tyk2 pseudokinase domain underlies altered susceptibility of B10.Q/J mice to infection and autoimmunity. Proc Natl Acad Sci U S A 2003;100(20):11594-9
15. Plenge RM, Scolnick EM, Altshuler D. Validating therapeutic targets through human genetics. Nat Rev Drug Discov 2013;12(8):581-94
16. Avorn J. The $2.6 billion pill--methodologic and policy considerations. N Engl J Med 2015;372(20):1877-9
17. Brown MS, Goldstein JL. Expression of the familial hypercholesterolemia gene in heterozygotes: mechanism for a dominant disorder in man. Science 1974;185(4145):61-3
18. Rader DJ, Cohen J, Hobbs HH. Monogenic hypercholesterolemia: new insights in pathogenesis and treatment. J Clin Invest 2003;111(12):1795-803
19. Abifadel M, Varret M, Rabes JP, et al. Mutations in PCSK9 cause autosomal dominant hypercholesterolemia. Nat Genet 2003;34(2):154-6
20. Cohen J, Pertsemlidis A, Kotowski IK, et al. Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9. Nat Genet 2005;37(2):161-5
21. Kotowski IK, Pertsemlidis A, Luke A, et al. A spectrum of PCSK9 alleles contributes to plasma levels of low-density lipoprotein cholesterol. Am J Hum Genet 2006;78(3):410-22
22. Cohen JC, Boerwinkle E, Mosley TH, Jr., Hobbs HH. Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N Engl J Med 2006;354(12):1264-72
22
23. Maxwell KN, Breslow JL. Adenoviral-mediated expression of Pcsk9 in mice results in a low-density lipoprotein receptor knockout phenotype. Proc Natl Acad Sci U S A 2004;101(18):7100-5
24. Park SW, Moon YA, Horton JD. Post-transcriptional regulation of low density lipoprotein receptor protein by proprotein convertase subtilisin/kexin type 9a in mouse liver. J Biol Chem 2004;279(48):50630-8
25. Stein EA, Mellis S, Yancopoulos GD, et al. Effect of a monoclonal antibody to PCSK9 on LDL cholesterol. N Engl J Med 2012;366(12):1108-18
26. Gregory AP, Dendrou CA, Attfield KE, et al. TNF receptor 1 genetic risk mirrors outcome of anti-TNF therapy in multiple sclerosis. Nature 2012;488(7412):508-11
27. Scott IC, Seegobin SD, Steer S, et al. Predicting the risk of rheumatoid arthritis and its age of onset through modelling genetic risk variants with smoking. PLoS Genet 2013;9:e1003808
* uses genetics to attempt to predict RA
28. Yarwood A, Han B, Raychaudhuri S, et al. A weighted genetic risk score using all known susceptibility variants to estimate rheumatoid arthritis risk. Ann Rheum Dis 2013;
* uses genetics to attempt to predict RA
29. Daha NA, Toes RE. Rheumatoid arthritis: Are ACPA-positive and ACPA-negative RA the same disease? Nat Rev Rheumatol 2011;7:202-3
30. van der Helm-van Mil AH, Huizinga TW. Advances in the genetics of rheumatoid arthritis point to subclassification into distinct disease subsets. Arthritis Res Ther 2008;10:205
31. Barrera P, van der Maas A, van Ede AE, et al. Drug survival, efficacy and toxicity of monotherapy with a fully human anti-tumour necrosis factor-alpha antibody compared with methotrexate in long-standing rheumatoid arthritis. Rheumatology (Oxford) 2002;41(4):430-9
32. Viatte S, Plant D, Han B, et al. Association of HLA-DRB1 haplotypes with rheumatoid arthritis severity, mortality, and treatment response. JAMA 2015;313(16):1645-56
33. Han B, Diogo D, Eyre S, et al. Fine Mapping Seronegative and Seropositive Rheumatoid Arthritis to Shared and Distinct HLA Alleles by Adjusting for the Effects of Heterogeneity. Am J Hum Genet 2014;94:522-32
34. van der WD, Houwing-Duistermaat JJ, Toes RE, et al. Quantitative heritability of anti-citrullinated protein antibody-positive and anti-citrullinated protein antibody-negative rheumatoid arthritis. Arthritis Rheum 2009;60:916-23
23
35. Viatte S, Plant D, Bowes J, et al. Genetic markers of rheumatoid arthritis susceptibility in anti-citrullinated peptide antibody negative patients. Ann Rheum Dis 2012;71:1984-90
36. Reuther LO, Vainer B, Sonne J, Larsen NE. Thiopurine methyltransferase (TPMT) genotype distribution in azathioprine-tolerant and -intolerant patients with various disorders. The impact of TPMT genotyping in predicting toxicity. Eur J Clin Pharmacol 2004;59(11):797-801
* an example of where genetics is used in clinic to inform treatment
37. Finckh A, Liang MH, van Herckenrode CM, de PP. Long-term impact of early treatment on radiographic progression in rheumatoid arthritis: A meta-analysis. Arthritis Rheum 2006;55(6):864-72
38. Scire CA, Verstappen SM, Mirjafari H, et al. Reduction of long-term disability in inflammatory polyarthritis by early and persistent suppression of joint inflammation: results from the Norfolk Arthritis Register. Arthritis Care Res (Hoboken ) 2011;63(7):945-52
39. Scire CA, Lunt M, Marshall T, et al. Early remission is associated with improved survival in patients with inflammatory polyarthritis: results from the Norfolk Arthritis Register. Ann Rheum Dis 2014;73(9):1677-82
40. Cui J, Stahl EA, Saevarsdottir S, et al. Genome-wide association study and gene expression analysis identifies CD84 as a predictor of response to etanercept therapy in rheumatoid arthritis. PLoS Genet 2013;9(3):e1003394
41. Acosta-Colman I, Palau N, Tornero J, et al. GWAS replication study confirms the association of PDE3A-SLCO1C1 with anti-TNF therapy response in rheumatoid arthritis. Pharmacogenomics 2013;14(7):727-34
42. Musunuru K, Strong A, Frank-Kamenetsky M, et al. From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus. Nature 2010;466:714-9
* an example of a non-coding variant directly affecting disease phenotype
43. Fairfax BP, Makino S, Radhakrishnan J, et al. Genetics of gene expression in primary immune cells identifies cell type-specific master regulators and roles of HLA alleles. Nat Genet 2012;44:502-10
44. Lango AH, Estrada K, Lettre G, et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 2010;467(7317):832-8
45. Nica AC, Montgomery SB, Dimas AS, et al. Candidate causal regulatory effects by integration of expression QTLs with complex trait genetic associations. PLoS Genet 2010;6:e1000895
24
46. Nicolae DL, Gamazon E, Zhang W, et al. Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS. PLoS Genet 2010;6:e1000888
47. Fairfax BP, Humburg P, Makino S, et al. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression. Science 2014;343:1246949
* This study highlights the importance of identifying the cell type in which your variant
of interest exerts its effects showing the presence of response QTLs in monocytes.
48. Lee MN, Ye C, Villani AC, et al. Common genetic variants modulate pathogen-sensing responses in human dendritic cells. Science 2014;343:1246980
* This study highlights the importance of identifying the cell type in which your variant
of interest exerts its effects showing the presence of response QTLs in dendritic
cells.
49. Davison LJ, Wallace C, Cooper JD, et al. Long-range DNA looping and gene expression analyses identify DEXI as an autoimmune disease candidate gene. Hum Mol Genet 2012;21:322-33
* an example of a long range interaction, where disease suceptibility variants in
CLEC16A act on a neighbouring gene, DEXI.
50. Dryden NH, Broome LR, Dudbridge F, et al. Unbiased analysis of potential targets of breast cancer susceptibility loci by Capture Hi-C. Genome Res 2014;24(11):1854-68
51. Jager R, Migliorini G, Henrion M, et al. Capture Hi-C identifies the chromatin interactome of colorectal cancer risk loci. Nat Commun 2015;19:6178
52. Mifsud B, Tavares-Cadete F, Young AN, et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat Genet 2015;47(6):598-606
53. Martin P, McGovern A, Orozco G, et al. Capture Hi-C reveals novel candidate genes and complex long-range interactions with related autoimmune risk loci. Nat Commun 2015;6:10069
** The first publication to systematically investigate the potential interaction between
the disease associated variants of four autoimmune diseases and their functional target
genes. The study has redefined the way candidate genes are allocated to genetic loci and
25
demonstrated that loci from different disease can interact with a common regulatory
element.
54. Wu Y, Liang D, Wang Y, et al. Correction of a genetic disease in mouse via use of CRISPR-Cas9. Cell Stem Cell 2013;13(6):659-62
** CRISPR is used to correct a mutation inmice which causes cataracts. The
corrected allele was shown to be passes on to offspring. Provides proof of
concept for correction for single gene disorders using CRISPR.
55. Schwank G, Koo BK, Sasselli V, et al. Functional repair of CFTR by CRISPR/Cas9 in intestinal stem cell organoids of cystic fibrosis patients. Cell Stem Cell 2013;13(6):653-8
26