258
DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure BSc (Molecular Genetics) A thesis submitted to the University of Western Australia for the degree of Doctor of Philosophy Centre for Molecular Immunology and Instrumentation 2005

DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

  • Upload
    others

  • View
    13

  • Download
    0

Embed Size (px)

Citation preview

Page 1: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 1 -

DUPLICATION AND POLYMORPHISM WITH

PARTICULAR REFERENCE TO REGULATORS

OF COMPLEMENT ACTIVATION

By

Craig Anthony McLure

BSc (Molecular Genetics)

A thesis submitted to the University of Western Australia for the

degree of Doctor of Philosophy

Centre for Molecular Immunology and Instrumentation

2005

Page 2: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 2 -

Page 3: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 3 -

DEDICATION

This thesis is dedicated to my parents John and Helen.

Page 4: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 4 -

Page 5: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 5 -

SUMMARY

For the convenience of the reader, detailed figures and tables have been enlarged and

compiled in Appendix 2, at the end of this thesis.

This thesis is presented as an approach to identify, annotate and detect genomic

duplication and polymorphism within large genomic regions. To demonstrate this, I

have used as a model, the genomic region known as the Regulators of Complement

Activation (RCA).

The RCA complex is located on the long arm of chromosome 1 at position 1q32 and is a

reservoir of complement regulatory proteins. The genes of the RCA share many

similarities implying that all have arisen through multiple complex duplication events.

My analysis of this region in the following chapters demonstrates the complexity of this

duplication and identifies the many functional units within the RCA.

It was my aim at the beginning of these studies to demonstrate an approach that could

define the Ancestral Haplotypes (AHs) of the RCA gene cluster. To do this, extensive

genomic analysis was required and the ever-increasing availability of genomic sequence

has made this thesis possible. Each of the chapters serves to address the following aims

set out at the beginning of this thesis:

1. Further characterise the relationship between the genes (Complement Control

proteins-CCPs) and domains of the Regulators of Complement Activation

(RCA).

2. Identify and examine the duplicated elements within the RCA.

Page 6: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 6 -

3. Examine the effects of retroviruses and other insertions and deletions (indels) in

generating the divergence of duplicated genes.

4. Investigate the applicability of the Genomic Matching Technique (GMT) to

define AH within the region.

5. Examine association of AHs with CCP implicated diseases.

6. Determine the GMT applicability in non-human species.

In addressing the aims listed above, the following observations were made:

1. Short Consensus Repeats (SCRs) from all human and non-human CCPs can be

broadly classified as belonging to one of eleven subfamilies “a,b,c,d,e,f,g,h,i,j &

k” (Chapter 1 & 3).

2. Subfamilies patterns are maintained throughout CCPs. (Chapter 1, 2 & 3).

3. The association of particular subfamilies forms the functional domains. (Chapter

1, 2 & 3).

a. “ajef”- recognition, “bkd” - function & “ch”- membrane attachment.

4. Complement Receptor 1-like (CR1L) genomic sequence contains at least 5

additional, unreported SCRs (e, f, b, k, d). Chapter 1.

5. Primate Complement Receptor 1 (CR1) and CR1L contain a duplication of eight

rather than seven SCRs (Chapter 2).

a. Discovery of the genomic remnants from twelve previously unidentified

SCRs in human and chimpanzee CR1 and CR1L.

6. CR1 and Complement Receptor 2 (CR2) share an additional set. (Chapter 2).

a. “dg” or “dg-like”

7. The differences between CR1 and CR1L are explained by many complex

duplications, retroviral insertions and SCR deletion. (Chapter 1 and 3).

8. Species have evolved by duplicating critical functional domains (Chapter 3).

Page 7: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 7 -

9. Divergence of duplicated domains adds complexity and functionality (Chapter

3).

10. The GMT can identify genomic polymorphism in the RCA alpha block

(CR1/MCP duplicons), (Chapter 4).

11. RCA alpha block profiles segregate through 3-generations. (Chapter 4).

12. CR1 and CR1L polymorphic amplicons confirmed by sequencing. (Chapter 4).

13. There are at least 20 distinct ancestral haplotypes for the RCA alpha block.

(Chapter 4).

14. Frequency distribution of AHs CR1.02 and CR1.08 differ in Recurrent

Spontaneous Abortion (RSA) and Psoriasis Vulgaris (PV).

15. The GMT is applicable to the DQ alpha and beta genes of the canine MHC and

appear promising in determining parentage, AHs and disease association

(Chapter 5).

This thesis consists of six chapters, each containing a manuscript, which has been

published in, submitted to, or in preparation for an international, peer-reviewed journal.

Each chapter is preceded by an introduction, which serves to link the chapters together

and acknowledge the specific contributions by co-authors and colleagues.

Page 8: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 8 -

Page 9: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 9 -

TABLE OF CONTENTS

TITLE 1

DEDICATION 2

SUMMARY 3

TABLE OF CONTENTS 9

ACKNOWLEDGEMENTS 15

PUBLICATIONS ARISING FROM THIS THESIS 17

DECLARATION 19

ABBREVIATIONS 21

GLOSSARY 25

INTRODUCTION 31

1. Evolution and Diversity

2. The Human Major Histocompatability Complex (MHC),

Polymorphic Frozen Blocks (PFBs) and Ancestral

Haplotypes (AHs).

2.1. MHC Polymorphism and PFBs

2.2. MHC AHs

3. Identifying MHC AHs using Haplospecific Features

3.1. The Genomic Matching Technique (GMT)

3.2. Haplospecific Profiles through Electrophoretic Separation of

Amplicons

3.3. Applications of the Block Profiles and Advantages over

Conventional Sequence Based Typing (SBT)

31

32

33

Page 10: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 10 -

4. The Regulators of Complement Activation (RCA)

4.1. Complement Control Proteins (CCPs)

4.2. Short Consensus Repeats (SCRs)

4.3. SCR combinations of CCP functional domains and their

regulatory mechanisms

4.4. Evolution of the CCPs and SCRs

4.5. PFBs within the RCA

4.5.1. Frozen Blocks

4.5.2. RCA Polymorphism

4.6. RCA disease associations and pathogen interactions

4.6.1. Complement related disease

4.6.2. Pathogen Interactions

5. Major Discoveries

5.1. Evolution of the SCRs and CCPs

5.2. Diversity of duplication – Monomers, Tetramers, Octamers and

Segments

5.2.1. SCR Subfamilies – The monomers

5.2.2. Subfamily relationships – Dimers, Tetramers and Octomers

5.2.3. Segmental Duplications

5.3. Degeneration

5.3.1. Genomic analysis reveals five highly conserved, apparently

untranscribed SCRs of CR1L (Chapter 1)

5.3.2. Genomic analysis reveals a duplication of eight rather than

seven SCRs in primate CR1 and CR1L: evidence for an

additional degenerate subfamily. (Chapter 2)

35

46

Page 11: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 11 -

5.4. Human CR1: A model for CCP evolution

5.5. Divergence – RCA Polymorphism

5.5.1. Single Nucleotide Polymorphisms

5.5.2. Conserved subfamily motifs define the critical SNPs of CR1

5.5.3. SNP Haplotypes

5.5.4. Genomic Matching Technique (GMT) haplotypes

5.5.5. Sequencing Results

6. Other Observations

6.1. DLA DQ and DR Typing in the Canine

6.2. Other Projects

53

CHAPTERS 57

PRELUDE TO CHAPTER 1 59

CHAPTER 1

MS 0014 – McLure, C., R. Dawkins, J.

Williamson, R. Davies, J. Berry, N. Longman-

Jacobsen, R. Laird and S. Gaudieri (2004). "Amino

acid patterns within short consensus repeats define

conserved duplicons shared by genes of the RCA

complex." Journal of Molecular Evolution 59(2):

143-157.

61

PRELUDE TO CHAPTER 2 79

CHAPTER 2 MS 0406 – McLure, C., J. Williamson, B. Stewart,

P. Keating and R. Dawkins (2004). "Genomic

analysis reveals a duplication of eight rather than

seven SCRs in Primate CR1 and CR1L: Evidence

81

Page 12: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 12 -

for an additional set shared between CR1 and CR2."

Immunogenetics 56(9): 631-638.

PRELUDE TO CHAPTER 3 91

CHAPTER 3 MS 0402 – McLure, C., J. Williamson, B. Stewart,

P. Keating and R. Dawkins (2005). "Indels and

imperfect duplication have driven the evolution of

human CR1 and CR1-like from their precursor CR1

alpha: Importance of functional sets." Hum

Immunol 66(3): 258-273.

93

PRELUDE TO CHAPTER 4 111

CHAPTER 4 MS 0408 - McLure, C., J. Williamson, L. Smyth,

S. Agrawal, S. Lester, J. Millman, P. Keating, B.

Stewart, and R. Dawkins (In Press). "Extensive

genomic polymorphism of the CR1 region: RCA

ancestral haplotypes, function and disease"

Immunogenetics

113

PRELUDE TO CHAPTER 5 145

CHAPTER 5 MS 0504 - McLure, C., P. Kesners, S. Lester,

D.Male, C. Amadou, J. Dawkins, B. Stewart, J.

Williamson and R. Dawkins. (In Press).

"Haplotyping of the canine MHC without the need

for DLA typing". Intl J. Immunogenetics

147

PRELUDE TO CHAPTER 6 167

CHAPTER 6 MS 0507 - McLure, C., J. Dawkins, B. Stewart,

and R. Dawkins. (In preparation). "Identification of

169

Page 13: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 13 -

traits and function in livestock by genomic

matching: Genetically determining homozygous and

heterozygous polled cattle".

CONCLUSIONS AND FUTURE DIRECTIONS 183

APPENDICES 189

APPENDIX 1 Sequences submitted to Genbank 191

APPENDIX 2 Enlarged Figures And Tables 217 Chapter 1 – MS0014 Figures And Tables 219

Figure 1- Characteristic amino acid patterns of

eleven SCR groups

Chapter 2 – MS406 Figures And Tables 223 Figure 1 - Relationship of additional SCRs.

Chapter 3 – MS0402 Figures And Tables 227 Figure 1a - Sets of SCR subfamilies reveal

relationships between Complement

Control proteins.

Figure 2a - Multiple duplications and divergence

following segmental duplication of

CR1 and MCP precursors.

Figure 2b –Domain and segmental duplications

within primate CR1 explain the

genomic expansion of CR1 and

CR1L not seen in the rodent.

Figure 3a - Analysis of genomic sequence of

CR1L against CR1.

Figure 4 - Phylogenetic and genomic analyses

define a model for the evolution of

Page 14: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 14 -

Hosa CR1 and CR1L.

Table 1 - SCR alignments from nine SCR

groups found in Crry, CR1 and

CR1L.

Chapter 4 – MS0408 Figures And Tables 241 Figure 1 - Multiple binding and amplification

by primer pairs.

Figure 4 - Segregation of ancestral haplotypes Figure 6 - Sequencing reveals the complexity

of the haplospecific element and

differences between CR1 and CR1-

like.

Figure 7 – Polymorphism within SCR

submfamilies

Chapter 5 – MS0504 Figures And Tables 251 Figure 2 – Canine MHC genotyping report

Chapter 6 – MS 0507 Figures And Tables 255 Figure 1 - Pedigree of 920 Cattle. Figure 2 - Beef Breeder database for

phenotypic and genotypic data management.

BIBLIOGRAPHY 261

EXAMINERS REPORTS 283

RESPONSE TO EXAMINERS REPORTS 297

Page 15: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 15 -

ACKNOWLEDGEMENTS

Special thanks must go to my supervisors; (i) Professor Roger Dawkins for his

expertise, inspirational guidance and continual encouragement, (ii) Dr Brent (Charlie)

Stewart for his veterinarian, computational and statistical expertise and (iii) Dr Silvana

Gaudieri for her tuition in bioinformatics and genetics.

To my once fellow student, good friend and colleague, Dr Joseph Williamson, thanks

for your many contributions to this thesis, the company during those late nights in the

lab and of course the many good laughs. Here’s to many more home brews! To all the

past and present laboratory and administrative staff, fellow students, board members

and associates of the C.Y. O’Connor ERADE Village Foundation and the Centre for

Molecular Immunology and Instrumentation, I thank you all very much.

Special thanks must go to the C.Y. O’Connor ERADE Village Foundation, the

Australian Research Council, Equitec, Genetic Technologies and the University of

Western Australia for providing scholarships, research funding and travel awards.

Finally I would like to acknowledege the love and support of my family and friends. To

my parents, John and Helen, my sister Sherrin, my brother-in-law Brendan and my

girlfriend Emily, I sincerlely thank you all for the many years of support and

encouragement you have provided.

Page 16: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 16 -

Page 17: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 17 -

PUBLICATIONS ARISING FROM THIS THESIS

McLure, C., R. Dawkins, J. Williamson, R. Davies, J. Berry, N. Longman-Jacobsen, R.

Laird and S. Gaudieri (2004). "Amino acid patterns within short consensus repeats

define conserved duplicons shared by genes of the RCA complex." Journal of

Molecular Evolution 59(2): 143-157.

McLure, C., J. Williamson, B. Stewart, P. Keating and R. Dawkins (2004). "Genomic

analysis reveals a duplication of eight rather than seven SCRs in Primate CR1 and

CR1L: Evidence for an additional set shared between CR1 and CR2."

Immunogenetics 56(9): 631-638.

McLure, C., J. Williamson, B. Stewart, P. Keating and R. Dawkins (2005). "Indels and

imperfect duplication have driven the evolution of human CR1 and CR1-like from

their precursor CR1 alpha: Importance of functional sets." Hum Immunol 66(3):

258-273.

McLure, C., J. Williamson, L. Smyth, S. Agrawal, S. Lester, J. Millman, P. Keating, B.

Stewart, and R. Dawkins (In Press). "Extensive genomic polymorphism of the CR1

region: RCA ancestral haplotypes, function and disease". Immunogenetics

McLure, C., P. Kesners, S. Lester, D.Male, C. Amadou, J. Dawkins, B. Stewart,

J.Williamson and R. Dawkins. (In Press). "Haplotyping of the Canine MHC without

the need for DLA typing". Intl J. Immunogenetics

McLure, C., J. Dawkins, B. Stewart, and R. Dawkins. (In preparation). " Identification of

traits and function in livestock by genomic matching: Genetically determining

homozygous and heterozygous polled cattle".

Page 18: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 18 -

Page 19: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 19 -

DECLARATION

I certify that this thesis does not contain material which has been accepted for the award

of any degree or diploma in any university or other institution and, to the best of my

knowledge and belief, contains no material previously published or written by another

person, except where due reference is made in the text.

Craig A McLure BSc (Molecular Genetics) Professor Roger L Dawkins MD, DSc, FRACP, FRCPA, FRCP

Page 20: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 20 -

Page 21: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 21 -

ABBREVIATIONS

AH Ancestral haplotypes

AP Alternative Pathway

APO Apolipoproteins

BAC Bacterial artificial chromosome

Bf Properdin factor B

BLAST Basic local alignment search tool

BMT Bone marrow transplantation

bp Base pair

C2 Complement component 2

C3 Complement component 3

C4 Complement component 4

C4BP Complement component 4 binding protein

CCP Complement control protein

cM Centimorgan

CP Classical pathway

CR1 Complement receptor 1

CR1α Complement Receptor 1 alpha

CR1β Complement Receptor 1 beta

CR1L Complement receptor 1-like

CR2 Complement Receptor 2

Crry Complement regulatory related protein

DAA Decay accelerating activity

DAF Decay accelerating factor

Page 22: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 22 -

DDBJ DNA databank of Japan

DLA Dog leukocyte antigen

DNA Deoxyribonucleic acid

EBV Epstein-Barr virus

e-CR1 Erythrocyte bound complement receptor 1

EH Extended haplotype

EMBL European molecular biology laboratory

GMT Genomic matching technique

HCMV Human cytomegalovirus HCT Haemochromotosis

HERV Human endogenous retrovirus

HES haplospecific electrophoretic signatures

HF Factor H

HFE Haemochromatosis gene

HGE Haplospecific geometric element

HHV-6 Human herpesvirus 6

HIV Human immunodeficiency virus

HLA Human leucocyte antigen

Hosa Homo sapiens (human)

HVS Herpesvirus saimiri

HVSCCPH Herpesvirus saimiri complement control protein homolog

IC Immune complex

IMP Inflammation modulatory protein

Indel Insertion or deletion

kb Kilobase

LD Linkage disequilibrium

Page 23: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 23 -

LHR Long homologous repeat

LINE Long interspersed nuclear element

MA Membrane attached

MAC Membrane attack complex

Mb Megabase

MCP Membrane cofactor protein

MCPα Membrane cofactor protein alpha

MCPL Membrane cofactor protein like

MG Myasthenia gravis

MHC Major histocompatibility complex

mRNA Messenger ribonucleic acid

Mumu Mus musculus

MV Measles Virus

NCBI National centre for biotechnology information

NHP Non human primate

NS Non-synonomous

nt Nucleotide

Pacy Papio Cynocephalus

PAGE Polyacrylamide gel electrophoresis

Paha Papio hamadryas

Patr Pan troglodytes (chimpanzee)

PCR Polymerase chain reaction

PFB Polymorphic frozen block

PH Population haplotype

Popy Pongo pygmaeus (orangutan)

PV Psoriasis vulgaris

Page 24: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 24 -

Rano Rattus norvegicus

RCA Regulators of complement activation

RFLP Restriction fragment length polymorphism

RNA Ribonucleic acid

RSA Recurrent spontaneous abortion

S Soluble

SBT Sequence based typing

SCR Short consensus repeat

SINE Short interspersed nuclear element

SIV Simian immunodeficiency virus SLE Systemic lupus erythematosus

SNP Single nucleotide polymorphism

SPICE Smallpox inhibitor of complement enzymes

SS Sjögren’s Syndrome

Tm Primer melting temperature (0C)

VCP Vaccinia virus complement control protein

VV Vaccinia Virus

WGS Whole genome shotgun

Page 25: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 25 -

GLOSSARY

Allele An alternative form of the gene at a given locus.

Amplicon The product of two primers in a polymerase chain reaction.

Ancestral Haplotype

Highly conserved chromosomal segments carrying specific

combinations of alleles at multiple loci. These are identical in

unrelated individuals indicating their derivation from a

common ancestor. Individual ancestral haplotypes have specific

genomic structures and may carry unique deletions and

duplications.

Centimorgan The unit of measurement for distance and recombinate

frequency on a genetic map

Complement A group of proteins in normal blood serum and plasma that

together form a cascade that results in the destruction of foreign

cells

Complement Control Protein

Genes that regulate and control the activation of complement

and the formation of the complement cascade.

Complotype A combination of alleles of C2, Bf and C4 genes, which are

inherited as a unit.

Electrophoresis A technique that separates charged molecules - such as DNA,

RNA or protein - on the basis of relative migration in an

Page 26: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 26 -

appropriate matrix (such as agarose gel or polyacrylamide gel)

subjected to an electric field

Epistasis/ Epigenetic

The interaction between loci irrespective of whether linked (see

below).

Gene The DNA segment which encodes a protein product.

Genome The complete genetic material contained within a cell nucleus.

Genomic Matching Technique

Technique that generates a haplospecific electrophoretic

signature by amplifying, with a single primer pair, the genetic

differences at multiple loci.

Genotype The genetic makeup of an individual.

Haplospecific Characteristically specific for a haplotype.

Haplotype Combination of alleles at several linked loci, which are

inherited en bloc from a parent.

Haplotypic Characteristic, but not specific for a haplotype.

Interspersed repetitive DNA sequences

Accounts for > 20% of the human genome, Principal families

include, Alus, L1s, LTRs, MERs, and HERV elements.

L1 Long interspersed nuclear elements (~9kb) that evolved before

the emergence of primates. Although these constitute 15% of

the human genome, most are highly truncated.

Page 27: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 27 -

Linkage The coexistence of genes inherited together as a result of their

location on the same chromosome. It has been measured by the

percent recombination between loci.

Linkage disequilibrium

Statistical calculation of observed combinations of genetic

markers which occur more, or less frequently in the population

than would be expected from their individual gene frequencies.

It can result from a founder effect, in which there has been

insufficient time to reach equilibrium since one of the markers

was introduced into the population, or from reduced

recombination in the region between the loci.

Locus The specific physical location of a gene on a chromosome

Long Homologous Repeat

A reiterated unit that contains multiple internal domains. In

CR1 the LHR contains at least seven SCRs.

Mastitis An inflammation of the mammary gland (or glands), usually

caused by bacteria.

MHC Region of approximately 4Mb on human chromosome 6p21.3

which includes many immunoregulatory genes.

MHC beta block PFB that contains HLA-A, B and MIC

MHC delta block PFB that contains DQ, DR and DP

Microsatellites Simple, internal repetitive sequence of 1-5bp. Mutations that

vary the number of repetitive elements are more frequent than

Page 28: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 28 -

in other DNA and hence microsatellites are highly

polymorphic. They are abundantly distributed over the

euchromatic part of the genome.

Multiplex The amplification of multiple unlinked loci in a single PCR

reaction by combining several primer pairs.

Mutation Heritable changes in the genetic material.

Pseudogene A DNA sequence which resembles a gene but which has been

inactivated by mutation so that it cannot produce a functional

product.

Phenotype Expressed characteristic, which is genetically determined.

Phylogeny Classification of organisms designed to reflect the sequence in

which they evolved, and their genetic relationships

Polling The phenotypic absence of horns

Polyacrylamide Gel matrix used for the electrophoretic separation of DNA.

Polymerase Chain Reaction

The exponential amplification of specific genomic elements.

Polymorphism Classically, phenotypic variation occurring with a frequency

>1% in the population and not lost by recombination.

Polymorphic Frozen Blocks

Polymorphic genomic segments faithfully inherited through

multiple generations as a complete unit and not split during

chromosomal rearrangements

Page 29: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 29 -

Regulators of Complement Activation

Cluster of complement regulatory proteins located in humans at

1q32. Includes HF, C4BP, DAF, CR2, CR1, MCP and their

respective gene copies

Retrosequence Retroelement

Definable nucleic acid sequence inserted and amplified by

reverse transcriptase

Retrovirus An RNA virus that propagates by conversion into duplex DNA

by reverse transcriptase (a RNA-dependent DNA polymerase)

an example is HIV.

RFLP Restriction fragment length polymorphism - differences in the

pattern of DNA fragmentation from two haplotypes after

digestion with the same restriction enzyme and visualisation by

DNA-DNA hybridisation.

Sequence Based Typing

The detection of alleles by DNA sequencing

Short Consensus Repeat

Short reiterated protein domain, approximately 60 amino acids

in length and containing the defining amino acid motif

C…C…C…W…C.

Subfamily Sub classification that furhter defines the ancestral relationship

between SCRs

Page 30: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 30 -

Page 31: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 31 -

INTRODUCTION

Preface

The following introduction provides background information and develops the

concepts, which form the basis of this thesis. The data, analyses and conclusions

outlined in the preceding summary are addressed in detail in the following chapters.

Introduction

1. Evolution and Diversity

Of all the mechanisms implicated in evolution and the generation of diversity,

duplication, insertions and deletions (indels) are now recognised to be of major

importance. For example, there is a clear relationship between copy number and

polymorphism when genes within the Major Histocompatability Complex (MHC) are

compared (Dawkins et al. 1999) and the same may be true generally. The role of indels

in generating diversity is less well known but many examples have been demonstrated

(Deininger et al. 2003; Gaudieri et al. 1997a; Hughes and Coffin 2001; Kulski et al.

1997). The thoughts and interpretations presented in this thesis are derived from the

many concepts established during the extensive studies of the Human MHC by this

department and others.

Page 32: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 32 -

2. The Human Major Histocompatability Complex, Polymorphic

Frozen Blocks and Ancestral Haplotypes

The MHC is the most characterised region of the human genome and is a reservoir of

clinically important genes (Dawkins et al. 1999). Analysis of MHC has revealed

extensive polymorphism occurring within localised clusters of several hundred

kilobases (kb). We refer to these regions as Polymorphic Frozen Blocks (PFBs).

2.1 MHC Polymorphism and PFBs

Previous analysis of the MHC has shown it to be the most polymorphic region of the

entire genome, with approximately 10% nucleotide difference observed at certain

regions (Gaudieri et al. 2000; Guillaudeux et al. 1998; Horton et al. 1998; Satta et al.

1998). This is a significant increase when compared to the 0.08 to 0.2% average

observed in the remainder of the genome (Li and Sadler 1991; Horton et al. 1998; Lai et

al. 1998; Rowen et al. 1996; Satta et al. 1998). Most of this polymorphism is

concentrated within localised regions or blocks, usually several hundred kb long

(Campbell and Trowsdale 1993; Klein and Takahata 1990; Tokunaga et al. 1988; Zhang

et al. 1990). The blocks are apparently frozen as recombination is not seen within them,

hence the reason for the term “PFBs” (Dawkins et al. 1991; Klein et al. 1991; Marshall

et al. 1993). The polymorphism observed within these PFBs is varied and complex.

Examples of the complexity include combinations of gene copy number, indel content

and single nucleotide differences at each of the many tightly linked loci (Campbell and

Trowsdale 1993; Leelayuwat et al. 1994; Tokunaga et al. 1988; Zhang et al. 1990). The

possible number of haplotypes for each PFB is therefore potentially quite large

Page 33: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 33 -

(multiplication of the alleles from each loci). This group (Dawkins et al. 1983; Degli-

Esposti et al. 1992c) along with others (Awdeh et al. 1983; Yunis et al. 2003) have

shown that this is not the case and that specific haplotypes (combinations of alleles)

occur more frequently than expected within a population, while others are not seen at

all. Interestingly, many groups have now shown that most of the MHC associated

diseases can be explained by a few of these specific haplotypes (Alper 1998; Dawkins

et al. 1999; Dawkins et al. 1983; Degli-Esposti et al. 1992a; Degli-Esposti et al. 1992b;

Yunis et al. 2003).

2.2 MHC Ancestral Haplotypes

Recombination in the MHC is therefore limited to the regions between the PFBs, but

this too is at a significantly decreased level to that observed throughout the rest of the

genome (Dawkins et al. 1991; Klein et al. 1991). The low rate of recombination has

seen many specific combinations or haplotypes of PFBs being conserved through many

generations. These combinations are referred to as Ancestral (AH) (Dawkins et al.

1983; Degli-Esposti et al. 1992c) or Extended Haplotypes (EH) (Awdeh et al. 1983).

Recombination occurring between the PFBs has resulted in shuffling. This has caused

the AHs to split, thereby creating further specific Population Haplotypes (PHs)

(Gaudieri et al. 1997b).

3 Identifying MHC AHs using haplospecific features

As previously mentioned, each MHC AH contains a unique genomic structure

(Dawkins et al. 1989; Tokunaga et al. 1988; Zhang et al. 1990) and sequence (Abraham

Page 34: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 34 -

et al. 1991; Abraham et al. 1992; Wu et al. 1992). Thus by detecting the allele

combinations at highly conserved, polymorphic loci within each PFB, we can identify

the AH. This department has invented the Genomic Matching Technique (GMT) to do

so.

3.1 The Genomic Matching Technique (GMT)

The GMT is a patented diagnostic technique that was designed and developed within

this department (Dawkins & Abraham, US# 6,383,747). The novelty of the GMT is that

it utilises polymorphic haplospecific elements, present in at least 2 duplicons, to

generate multiple amplicons with a single primer pair. The primers are designed to bind

and amplify within the conserved flanking region of each polymorphic haplospecific

element and generate multiple products. The reaction effectively works as a multiplex

Polymerase Chain Reaction (PCR) by generating multiple amplicons of variable

lengths. The results however are more informative than a standard multiplex; this is

because the amplicon combinations represent alleles of tightly linked loci that have

arisen through duplication of a common ancestral segment and since diverged in line

with the specific coding and non-coding differences within each AH. The differences

between each locus are therefore comparable. In the MHC, the term Haplospecific

Geometric Elements (HGE) was introduced to describe those haplospecific elements

that contained unique geometric patterns of repeat combinations (Abraham et al. 1992).

The AHs are defined through the electrophoretic separation of amplified HGEs.

3.2 Haplospecific Profiles through Electrophoretic Separation of Amplicons

Page 35: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 35 -

The combination of amplicons produces an electrophoretic profile (Ketheesan et al.

1999; Leelayuwat et al. 1994). The profiles mark haplotypes of coding and non-coding

sequences spanning hundreds of kb (Gaudieri et al. 2000). Segregation of the profiles

through a 3-generation family reveals linkage associations of the individual components

of the profile. The combinations of components that segregate together are referred to as

haplospecific electrophoretic signatures (HES) or block profiles.

3.3 Applications of the Block Profiles and Advantages Over Conventional Sequence

Based Typing (SBT)

It has been shown by this department (Tay et al. 1995a) and others (Witt et al. 2000)

that matching of the MHC beta and delta block GMT profiles significantly increases the

likelihood of successful Bone Marrow Transplantation (BMT) in unrelated individuals.

The reason for this is that as previously mentioned, the profile acts as a marker for the

entire PFB. Therefore, matching profiles indicate that the unrelated individuals share the

same AH for that block (Tay et al. 1995b). The GMT is an easier, more accurate and

cheaper alternative to determine AHs than the SBT diagnostic methods currently used.

With the intention of establishing whether the MHC is representative of the genome

generally, other regions should be studied and compared. To do this I have used the

Regulators of Complement Activation (RCA) gene cluster.

4. The Regulators of Complement Activation (RCA)

Page 36: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 36 -

Located on the long arm of human chromosome 1 (1q32) is the cluster of genes known

as the Regulators of Complement Activation (RCA) (Heine-Suner et al. 1997; Reid

1986; Rodriguez de Cordoba et al. 1985; Weis et al. 1987). Rodriguez de Cordoba and

colleagues (Rodriguez de Cordoba et al. 1999) have shown that the human RCA gene

cluster, spanning a total of 21.45 cM, contains 61 genes of which 15 are complement

related. The genes of the RCA share many similarities implying that all have arisen

through multiple complex duplication events (Hourcade et al. 1989; Krushkal et al.

2000; McLure et al. 2004a).

4.1 Complement Control Proteins (CCPs)

The RCA genes belong to the CCP family (Reid 1986). The role of CCPs is to prevent

the uncontrolled activation of complement and to assure that damage to autologous cells

is avoided (Krushkal et al. 2000). CCPs regulate complement activation by either (i)

acting as a cofactor for the factor I mediated cleavage of C3b and C4b therefore

inhibiting the binding of Comploment Component 2 (C2) and Properdin Factor (Bf) and

the formation of C3b/C2 and C4b-fB activated complexes or (ii) by accelerating the

decay of those activated complexes that have already formed. The RCA CCPs can be

divided into two groups, the soluble (S) plasma proteins and the membrane-attached

(MA) proteins. The S proteins include C4-Binding Protein (C4BPA and C4BPB) and

Factor H (HF1) while the MA proteins are Membrane Cofactor Protein (MCP),

Complement Receptor 1 (CR1), Complement Receptor 2 (CR2) and Decay Accelerating

Factor (DAF) (Bora et al. 1989; Carroll et al. 1988; Pardo-Manuel et al. 1990; Rey-

Campos et al. 1988). Duplicate copies of these genes have been identified throughout

the cluster but the majority of these are either pseudogenes or genes of unknown

Page 37: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 37 -

function. (Note: The term CCP is used by some researchers to define what I define as

the SCRs or sushi domains.)

4.2 Short Consensus Repeats (SCRs)

The CCP family is defined by the presence of reiterated protein domains known as

SCRs. SCRs typically contain 56-70 amino acids, including four Cysteines and a

Tryptophan (C…C…C…W…C) (Hourcade et al. 1989; Reid 1986). These are essential

for holding the domain in its rigid triple loop structure (Hourcade et al. 1989;

Schwarzenbacher et al. 1999). The SCRs found in CCPs carry a variety of functions,

including protein binding and cofactor activity in complement regulation (Reviewed by

(Hourcade et al. 1989; Reid and Day 1989). Interestingly, several groups have now

demonstrated that it is the involvement of multiple domains, rather than individual

SCRs that determine these functions (Krushkal et al. 2000; McLure et al. 2005). The

modification or loss of these domains results in a modification to (or loss of) that

particular function.

4.3 SCR combinations of CCP functional domains and their regulatory mechanisms

Complement activation is regulated at several steps to prevent uncontrolled activation

and to ensure that damage to autologous cells is avoided. The table below summarises

the reviews by Kirkitadze (Kirkitadze and Barlow 2001) and Krushkal (Krushkal et al.

2000), which discuss the mechanisms by which individual CCPs regulate complement

activation. The data include the following:

1. Name of the gene (as commonly abbreviated)

2. Physical state – soluble (S) or membrane–attached (MA)

3. The SCR combinations within CCPs that form the functional domains

Page 38: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 38 -

4. The subfamilies of the SCRs that form the functional domain (McLure et al.

2004a)-Chapter 1

5. The domains regulatory mechanism and/or complement interaction

Table 1. Functional Domains within the Genes of the RCA

RCA Gene

Phys State%

SCRs involved #

Sub-families^

Function Ref-erences*

FH S 1-20 1-4 or 1-5 1-4, 6-10 & 16-20

Competes with factor B for binding to C3b, Cofactor for factor I-catalysed proteolysis of C3b and DAA of C3bBb Cofactor and DAA C3b binding

1-3

C4BP S 1-8 1-3

Competes with C2 for binding to C4b, Cofactor for factor I-catalysed proteolysis of C4b and DAA of C4bBb C4b binding

4-8

DAF MA 1-4 2 &3 2-4

aaje je aje

DAA of the C3 & C5 convertases Classical pathway convertase regulation Alternative pathway convertase regulation

9-17

CR2 MA 1- 15 or 1-16 1-2

Receptor for C3d, interface between innate and adaptive immunity, iC3b/C3dg binding

18-23

CR1 MA 1-30 (F- allele) 1-3 8-10 & 15-18

(ajefbkd)7ch aje aje & aje

Binds C4a & C4b, DAA of classical and alternative pathways convertases & acts as a cofactor for the factor I mediated proteolysis of iC3b/C4b C4b binding, weak C3b binding & DAA of classical and alternative pathways convertases High C3b and low C4b affinity and acts as a cofactor for the factor I mediated proteolysis of iC3b/C4b

24-26

MCP MA 1-4 1 & 2-4 3 & 4 2 + 3&4

aje f/g a & jef/g ef/g j + e&f/g

Cofactor activity for factor I-catalysed proteolysis of C3b & C4b molecules C4b binding and cofactor activity C3b binding C3b Co-factor activity

27-31

Footnotes

* References

1. (Gordon et al. 1995); 2. (Kuhn and Zipfel 1996); 3. (Sharma and Pangburn 1996); 4. (Scharfstein et al. 1978); 5.

(Dahlback et al. 1983); 6. (Gigli et al. 1979); 7. (Blom et al. 2000); 8. (Blom et al. 1999); 9. (Nicholson-Weller et al.

1981); 10. (Nicholson-Weller et al. 1982); 11. (Caras et al. 1987); 12. (Medof et al. 1987); 13. (Medof et al. 1984);

14. (Coyne et al. 1992); 15. (Brodbeck et al. 1996); 16. (Brodbeck et al. 2000); 17. (Kuttner-Kondo et al. 1996); 18.

Page 39: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 39 -

(Moore et al. 1987); 19. (Matsumoto et al. 1991); 20. (Changelian and Fearon 1986); 21. (Nagar et al. 1998); 22.

(Lowell et al. 1989); 23. (Molina et al. 1995); 24. (Klickstein et al. 1988); 25. (Krych et al. 1991); 26. (Krych-

Goldberg et al. 1999); 27. (Lublin et al. 1988); 28. (Adams et al. 1991); 29. (Iwata et al. 1995); 30. (Liszewski et al.

1991); 31 (Liszewski et al. 2000).

%Physical State

S = Soluble plasma protein MA = Membrane attached protein

# SCRs Involved

SCRs in bold are the critical units that are involved with the specific function. Each SCR is numbered according to its

relative position in the gene

^ Subfamilies Involved

The respective subfamily classification of the functional SCRs (McLure et al 2004a). Note the ajef pattern

4.4 Evolution of the CCPs and SCRs

The CCP family has been conserved from invertebrates to provide regulation of

complement and self-nonself discrimination. The CCPs have evolved in parallel with

the complement cascade and can be regarded as the major agents of regulation. During

vertebrate evolution the family has expanded such that there are numerous copies

clustered throughout the vertebrate and indeed human genome. Functions have diverged

and now include viral receptors (Kotwal and Moss 1988; Miller et al. 1997) regulation

of adaptive immunity (Wei et al. 2001) and maintenance of foetomaternal tolerance

(Bell 2000; Xu et al. 2000).

The SCR domains appear to have an ancient origin that predates their involvement in

CCPs. Examples of these have been identified within insects (Drosphila) (Hoshino et al.

1993), nematodes (Caenorhabtidis elegans) (Consortium 1998), viruses (Spiller et al.

2003) and marine sponges (Blumbach et al. 1998; Pahler et al. 1998).

Page 40: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 40 -

4.5 Polymorphic Frozen Blocks within the Regulators of Complement Activation

4.5.1 Frozen Blocks

Analysis of the RCA cluster has revealed two unlinked genomic blocks, each containing

their own unique set of tightly linked genes (Campbell 1988; Carroll et al. 1988; Heine-

Suner et al. 1997; Heine-Suner et al. 1998; Hourcade et al. 1990; Rey-Campos et al.

1987; Rey-Campos et al. 1988; Rodriguez de Cordoba and Rubinstein 1987). The first

block is ~900 kb-long DNA segment containing the genes 5’- C4BPB-C4BPA-

C4BPAL1-C4BPAL2-DAF-CR2-CR1-MCPL-CR1L-MCP -3’ while the second is 650

kb-long and contains 5’-HF1-FHR1-FHR2-FHR3-FHR4-F13B- 3’ (Pardo-Manuel et al.

1990; Rey-Campos et al. 1990; Rodriguez de Cordoba et al. 1999).

4.5.2 RCA Polymorphism

Several groups have identified polymorphism within the RCA cluster. The majority of

these are coding Single Nucleotide Polymorphisms (SNPs) (Birmingham et al. 2003;

Moulds et al. 2004; Xiang et al. 1999) but examples of structural (Moulds et al. 1996;

Smith et al. 2002; Wong 1990), gene and domain copy (Dykman et al. 1985) number

polymorphisms have also been described. There are numerous SNPs (approximately 1

per 100bp) in keeping with the conclusion that there is extensive polymorphism within

the region. This suggests that, like the MHC, the RCA is split into well-defined PFBs,

each containing conserved AHs of allele combinations. The following table summarises

some of the SNPs identified within the RCA genes (taken from NCBI SNP database and

references above). For this purpose, only non-synonomous (NS) are shown since the

Page 41: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 41 -

ultimate purpose is to determine the difference at the protein level. The data include the

following:

1. Name of the gene (as commonly abbreviated)

2. The PFB where the gene is located. Either RCA alpha (α) or RCA beta (β)

3. The contig, mRNA and protein sequences to which each position refers.

4. The SCR where the polymorphism occurs

5. The resulting change to the mRNA and translated protein

Table 2. Single Nucleotide Polymorphisms within the genes of the RCA

RCA Gene

PFB contig mRNA protein SCR Nuc change Protein change

FH β NT_004487 NM_000186 NP_000177 G-257-A V-62-I

C-1277-T H-402-Y

G-2412-T R-780-I

G-2742-T S-890-I

G-2881-T E-936-D

A-3063-C N-997-T

G-3092-T V-1007-L

G-3101-A A-1010-T

G-3251-C V-1060-L

C-3500-G Q-1143-E

C-3645-T S-1191-L

T-3663-C V-1197-A

C4BPA α NT_021877 NM_000715 NP_000706 C-204-G P-4-A

C-373-T A-60-V

T-1093-C I-300-T

DAF α NT_021877 NM_000574 NP_000565 C-1247-T S-334-F

CR2 α NT_021877 NM_001877 NP_001868

CR1 α NT_021877 NM_000573 NP_000564 C-424-T R-105-C

T-455-C V-115-A

A-1360-G T445A

T-2078-C I-684-T

G-3093-T Q-981-H

A-3650-G H-1208-R

Page 42: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 42 -

C-4334-T T-1408-M

A-4730-G N-1540-S

A-4879-G K-1590-E

A-4912-G R-1601-G

A-4939-T T-1610S

A-4954-G I-1615-V

G-5575-C D-1850-H

C-5591-G P-1827-R

C-5654-T T-1876-I

MCP α NT_021877 NM_172353 NP_758863 G-952A D-266-N

Included in chapter 4 (Fig7) is an alignment of all the Rodent Crry and Primate CR1

and CR1L SCRs. Each of these SCRs has been grouped into subfamilies using their

conserved residues and amino acid motifs. (McLure et al. 2004a; McLure et al. 2005).

The human CR1 SNPs listed in the table above have been annotated on this alignment.

The SNPs that are likely to have a significant effect are those that occur in conserved

residues within each subfamily. Haplotypes of these critical SNPs would therefore be

informative in associating phenotypic variation.

4.6 RCA disease associations and pathogen interactions

4.6.1 Complement related disease

There have been many conditions implicated with the genes of the RCA. These are

generally multi-factorial and involve a number of linked and unlinked genes. The

complement genes on human chromosome 6 play a pivotal role in the development of

these diseases. As previously mentioned, the haplotypic variations in gene expression

have been linked with the development of many autoimmune conditions. The

regulatory effect of the CCPs can provide a protective mechanism for the development

Page 43: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 43 -

of these conditions and is a candidate region for explaining the incomplete penetrance

of the complotypes (Alper et al. 1983). The mechanisms by which the CCPs regulate

complement have already been discussed; the inability of any of these to function

efficiently may result in Autoimmune Disease (AD). Examples of these include

Systemic Lupus Erythemetosus (SLE), Sjogren’s Syndrome (SS), Recurrent

Spontaneous Abortion (RSA), Haemolytic Uraemic Syndrome (HUS) and Age-related

Macular Degeneration (AMD).

With respect to SLE and CR1, there has been a long debate over whether the observed

decrease in CR1 expression on erythrocytes of individuals with SLE is an acquired or

genetic trait (Dawkins et al. 1983; Walport et al. 1985). Walport and colleagues have

suggested that the decreased number of erythrocyte-bound CR1 (e-CR1) molecules is a

direct result of an increased number of circulating immune complexes (IC) and

therefore an acquired phenotype. Wilson and colleagues on the other hand have

provided evidence favouring the genetic hypothesis and demonstrated an association

between SNPs and decreased e-CR1 expression (Herrera et al. 1998; Xiang et al. 1999).

4.6.2 Pathogen Interactions

Many pathogens have devised efficient mechanisms that are designed to evade the

hosts’ immune system. These strategies include direct inactivation of complement either

by encoding complement regulatory proteins or by capturing membrane regulatory

proteins from the host and by using membrane complement receptors to gain cellular

entry (Bernet et al. 2003). The following tables summarise the findings from a number

of recent reviews on RCA genes and their pathogen interactions (Bernet et al. 2003;

Favoreel et al. 2003; Lindahl et al. 2000).

Page 44: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 44 -

Table 3. Microorganisms that acquire RCA proteins or use them for

cellular entry

RCA Gene

Micro-organism

Ligand/ protein

Ligand/protein Interaction SCRs &

Ref*

S. pyogenes Some M Proteins

Bacteria-bound C4BP inhinits C3 convertase activity ?

1 & 2 1-7

N. gonorrhoeae Porin Serum resistance of strains ? ? 8

C4BP

B. pertussis FHA? FHA may bind C4BP directly or stimulates its binding to another bacterial surface ?

? 9

S. pyogenes Some M proteins

Hypervariable regions bind to FH and FHL-1 7 4, 10-14

N. gonorrhoeae LOS Bind to factor H and exploit the complement

inhibitory function 16-20 15

&16 Porin ? S. pneumoniae ? May contribute to phagocytosis resistance

through FH binding ? 17

FH

HIV-1 gp41, gp 120

FH may bind to the envelope proteins? ? 18-22

Vaccinia VCP Incorporation of cellular complement regulators 23 HCMV ? Incorporation of cellular complement regulators 24 HTLV-1 ? Incorporation of cellular complement regulators 25 HIV-1 ? Incorporation of cellular complement regulators 20-22 SIV ? Incorporation of cellular complement regulators 26 E. coli Dr-like

antigens Bind via surface proteins present either in fimbria or in an afimbrail sheath

2&3, or 3

27-31

X adhesin 3 or 4 Echovirus 7 Capsid Binds to DAF which facilitates the intereaction of

a second effector receptor/molecule ? 2,3 &4

32-34

DAF

Coxsackievirus Capsid Binds to DAF which facilitates the intereaction of a second effector receptor/molecule ?

1 or 2 32,34, 56,57

CR2 EBV Gp350/220

Accelerates decay of AP C3-convertase. Factor I cofactor for C3b, iC3b, C4b & iC4b

1 & 2 35-42

P. falciparum PfEMP1 Expressed on the surface of infected erythrocytes, CR1 from uninfected erythrocytes binds and forms rosettes

25? 43

CR1

HIV C3 Bound C3 fragments on surface of HIV interact with complement receptors

44-46

MV Haemagglutinin

Uses MCP as a receptor to initiate infection 1 & 2 47-50

S. pyogenes M6

Protein MCP acts as a keratinocyte receptor ? ? 51-52

N. gonorrhoeae Pili Surface pili promote binding to MCP expressing

cells ? 53

MCP

Page 45: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 45 -

N. meningitidis Pili Surface pili promote binding to MCP expressing cells

? 53

H. pylori BabA

protein ? ? 54

HHV-6 ? Uses MCP as a receptor to initiate infection 1? 55 Vaccinia VCP Incorporation of cellular complement regulators 23 HCMV ? Incorporation of cellular complement regulators 24

SIV ? Incorporation of cellular complement regulators 26 Footnotes

* References

1. (Accardo et al. 1996); 2. (Thern et al. 1995); 3. (Thern et al. 1998); 4. (Johnsson et al. 1996); 5.

(Johnsson et al. 1998); 6. (Liszewski et al. 1996); 7. (Mikata et al. 1998); 8. (Ram et al. 1999); 9.

(Berggard et al. 1997); 10. (Horstmann et al. 1988); 11. (Sharma and Pangburn 1997), 1997; 12.

(Kotarsky et al. 1998); 13. (Blackmore et al. 1998); 14. (Hellwage et al. 1997); 15. (Ram et al. 1998b);

16. (Ram et al. 1998a); 17. (Neeleman et al. 1999); 18. (Stoiber et al. 1995); 19. (Stoiber et al. 1997); 20.

(Marschang et al. 1995); 21. (Stoiber et al. 1996); 22. (Schmitz et al. 1995); 23. (Vanderplasschen et al.

1998); 24. (Cooper 1998); 25. (Spear et al. 1995); 26. (Montefiori et al. 1994); 27. (Nowicki et al. 1993);

28. (Pham et al. 1995); 29. (Jouve et al. 1997); 30. (Peiffer et al. 1998); 31. (Sargiacomo et al. 1993); 32.

(Evans and Almond 1998); 33. (Clarkson et al. 1995); 34. (Shafren et al. 1998); 35. (Mold et al. 1988);

36. (Ahearn and Rosengard 1998); 37. (Moore et al. 1989); 38. (Molina et al. 1991); 39. (Molina et al.

1995); 40. (Fingeroth et al. 1984); 41. (Nemerow et al. 1987); 42. (Tanner et al. 1987); 43. (Rowe et al.

1997); 44. (Robinson et al. 1988); 45. (Stoiber et al. 2001); 46. (Spear et al. 2001); 47. (Naniche et al.

1993); 48. (Dorig et al. 1993); 49. (Manchester et al. 1995); 50. (Casasnovas et al. 1999); 51. (Okada et

al. 1995); 52. (Berkower et al. 1999); 53. (Kallstrom et al. 1997); 54. (Lindahl et al. 2000); 55. (Santoro et

al. 1999). 56. (Bergelson et al. 1995), 57. (Shafren et al. 1997).

As mentioned previously, some viruses have evolved to encode RCA homologous

genes within their own genome. These observations have been reviewed by Bernet

(2003) and Favoreel (2003) and are summarised in the table below.

Page 46: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 46 -

Table 4. Viruses that encode CCPs homologous to the RCA genes.

Virus Family

Virus Viral Protein

Key Feature Refs*

Vaccinia VCP Binds C3b and C4b. Accelerates decay of CP and AP C3-convertases Factor I cofactor for C3b and C4b

1-3

Cowpox IMP Modulates in-vivo complement-mediated

inflammatory responses 4

Pox-viridae

Variola SPICE Inhibits human complement 5 Herpes-viridae

HVS HVSCCPH Blocks C3b deposition 6 &7

Footnotes

CP –Classical Pathway, AP- Alternative Pathway, VCP- Vaccinia Control Protein, HVSCCPH – Herpesvirus

complement control protein homologue, IMP- Inflamation Modulatory Protein, SPICE – Small Pox Inhibitor of

Complement Enzymes

*References

1. (Kotwal et al. 1990), 2. (McKenzie et al. 1992), 3. (Sahu et al. 1998) 4. (Miller et al. 1997), 5.

(Rosengard et al. 2002), 6. (Albrecht and Fleckenstein 1992), 7. (Fodor et al. 1995)

5. Major Discoveries

5.1 Evolution of the SCRs and CCPs

In chapter 1 to 3 we show that RCA genes have evolved through a series of processes

involving duplication and divergence of individual SCRs and more particularly, sets of

SCR domains. It is also clear that individual SCRs and particularly sets of SCRs have

become specialized in terms of position, proximity, and undoubtedly, function. Thus,

for example, the subfamilies ajef, bkdg/g-like and ch (Chapters 1-3) occur on different

proteins but in the same relative position. Duplications, deletions, and other processes of

divergence have contributed to this process of specialisation or selection.

Page 47: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 47 -

5.2 Diversity of duplication – Monomers, Tetramers, Octamers and Segments

5.2.1 SCR Subfamilies – The monomers

Phylogenetic studies have revealed that, with rare exceptions, all SCRs can be classified

into only 11 subfamilies designated a, b, c, d, e, f, g, h, i, j & k. Having demonstrated

that there are specific subfamilies of SCRs, it becomes clear that there must have been

many diverse units of duplication and deletion.

5.2.2 Subfamily relationships –Dimers, Tetramers and Octomers

Analysis of the SCR repertoires from each gene has revealed many conserved sets of

subfamily combinations. The sets we have identified consist of two (c & h), four (ajef or

bkdg) or eight (ajefbkdg) SCRs. The Fearon (Klickstein et al. 1988) and Atkinson

groups (Hourcade et al. 2000; Krych et al. 1994) have demonstrated that it is the sets,

rather than the individual SCRs, that form the functional domains within each gene.

Their analyses revealed that within human CR1, SCRs 1-3, 8-10 and 15-17 (the sets we

define as aje) are the components that form the C3b and C4b binding domains. We have

shown through our classification of SCRs, that the combination of subfamilies is critical

for recognition and binding. The aje pattern is consistent with all SCR combinations

that form binding domains in all human and non-human CCPs (Chapter 1). This

suggests the conservation of a critical subunit throughout the species. Evolution has

resulted in additional domains through duplication of these critical sets (Chapter 3).

Page 48: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 48 -

5.2.3 Segmental Duplications

The 340kb genomic region containing Hosa CR1, MCPL, CR1L and MCP is compared

against itself in Chapter 3 (Fig 2a) and demonstrates the complexity of duplication

within the region. We have demonstrated the segmental duplication of the CR1/MCP

ancestral genes, which resulted in two segments, segment A (CR1 and MCPL) and

segment B (CR1L and MCP). Annotation of the sequence with the location of segments,

genes, Long Homologous Repeats (LHRs) and indels demonstrates that post separation

duplications and deletions of LHRs have created the expansion of segment A (165kb)

and the contraction of segment B (110kb).

5.3 Degeneration

Within the RCA we have revealed several examples of SCR degeneration. This

degeneracy ranges from individual SCRs up to complete LHRs.

5.3.1 Genomic analysis reveals five highly conserved, apparently untranscribed SCRs

of CR1L (Chapter 1)

We first identified SCR degeneracy through genomic analysis of the human CR1L gene

(Chapter 1). Previous studies by Birmingham and colleagues had described seven SCRs

within the gene. This analysis also suggested that these were not expressed in humans

(Logar et al. 2004). The Birmingham group has described the expression of CR1L in

non-human primates and demonstrated its function as the non-human primate immune

adherance molecule (Birmingham and Hebert 2001; Chen et al. 2000).

Page 49: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 49 -

The genomic analysis of CR1L presented in chapter 1 clearly identifies 12 SCRs in

CR1L (we have since found there to be 13 – see below). Subfamily classification of the

additional SCRs was possible, as in each the defining motifs remained relatively

conserved. The analysis clearly showed that the SCRs 8-12 were from the efbkd

subfamilies respectively. Detailed examination of the additional SCRs revealed a

number of nonsense mutations, thereby confirming degeneracy and the unlikely

expression in humans. This may vary between individuals and degeneracy may not

always be the case.

5.3.2 Genomic analysis reveals a duplication of eight rather than seven SCRs in

primate CR1 and CR1L: evidence for an additional set shared between CR1 and

CR2. (Chapter 2)

Chapter 2 reports the discovery of previously unrecognised short consensus repeats

(SCRs) within human and chimpanzee CR1 and CR1L. Analysis of available genomic,

protein and expression databases suggests that these are actually genomic remnants of

the g subfamily SCRs, which occur in other CCPs (McLure et al. 2004a). Comparison

with the nucleotide motifs of the 11 defined subfamilies of SCRs justifies the

designation g-like because of the close similarity to the g subfamily found in CR2 and

MCP. To date, we have identified five such SCRs in human and chimpanzee CR1, one

in human and chimpanzee CR1L, but none in either rat or mouse Crry in keeping with

the number of internal duplications of the long homologous repeat (LHR) found in CR1

and CR1L. At the genomic level, the ancestral LHR must have contained eight SCRs

rather than seven as previously thought. Since g-like SCRs are found immediately

downstream of d SCRs, we suggest that there must have been a functional dg set which

has been retained by CR2 and MCP but which is degenerate in CR1 and CR1L.

Page 50: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 50 -

Interestingly, dg is also present in the CR2 component of murine CR1 (MCR1). The

degeneration of the g SCR must have occurred prior to the formation of primate CR1L

and prior to the duplication events, which resulted in primate CR1. In this context, the

apparent conservation of g-like SCRs may be surprising and may suggest the existence of

mechanisms unrelated to protein coding.

5.4 Human CR1: A model for CCP evolution

Genomic and protein analyses suggest that Primate CR1 and CR1L may have evolved

from a common hypothetical ancestor, CR1 alpha (CR1 α) (Chapter 3). We show that

duplication and deletion, both small and large, appear to be the major contributing

factors in the divergence of the two genes. We postulate duplication events involving

“a, j, e, f, b, k, d, g-like” of CR1 α which have resulted in three copies. This has been

presented as a triplication although there may have been two independent duplication

events. Either way, we envisage three copies prior to the segmental duplication of

PreCR1/PreCR1L (CR1β apparently including MCPα). This model resulted in the

formation of 4 genes, Pre-CR1, MCP, Pre-CR1L and MCPL. Since the formation of

Pre-CR1 and Pre-CR1L, indels of retroviral and genomic sequence have caused each to

diverge. In CR1L, deletion of thirteen Hosa CR1 SCRs 21, 21-22, 22, 26, 27, 28, 28-29,

29, 30, 31, 32, 36 & 37 has occurred through three independent events. In CR1,

insertion of Hosa CR1 SCRs 5 to 18 has occurred through a triplication of “bkdg-like

ajef”(Hosa CR1 SCRs 19 to 25). As with CR1α, it is unclear whether the replication

occurred as two separate duplications. However, it is clear that the region has been

highly conserved throughout CR1 and CR1L evolution. The indels that occurred

throughout the evolution of these genes have resulted in 42 SCRs in Hosa CR1

[“(ajefbkdg-like)5 ch”] and 13 in Hosa CR1L [“a(jefbk)2dg-like”].

Page 51: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 51 -

5.5 Divergence – RCA Polymorphism

5.5.1 Single Nucleotide Polymorphisms

Prior to the commencement and throughout the course of my studies, a considerable

amount of effort has been directed at identifying polymorphism within the RCA region.

These studies have nearly always targeted SNPs and many are now known throughout

the region. With respect to CR1, the effects of non synonomous coding SNPs have been

studied extensively (Birmingham et al. 2003; Moulds et al. 2004; Xiang et al. 1999).

These studies have attempted to relate polymorphisms of a single base and ultimately an

amino acid, with the observed changes to the amount of eCR1 and / or the efficiency of

these to bind C3b/C4b (Birmingham et al. 2003). A reliable diagnostic marker that

could accurately predict either of these would have major implications in disease

prediction. Several studies have shown strong association between individual SNPs and

the amount of CR1 expression but these associations are only found in specific ethnic

groups; analysis of the same SNP within different ethnic groups shows little or no

significance (Herrera et al. 1998; Xiang et al. 1999).

5.5.2 Conserved subfamily motifs define the critical SNPs of CR1

In Chapters 1 and 3, we identified the critical residues and motifs that define each SCR

subfamily by displaying their conservation through rodent and primate evolution

(McLure et al. 2004a; McLure et al. 2005). Using the SNPs described by Moulds

(Moulds et al. 2004) and Birmingham (Birmingham et al. 2003) as an example, and

determining their location within each SCR, we can predict the likely impact that each

Page 52: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 52 -

will have based on whether they occur within a highly conserved residue. The results of

this are presented in Chapter 4 – Fig7.

5.5.3 SNP Haplotypes

This limited success in defining clinically relevant polymorphism is likely to be a

function of the region`s complexity (McLure et al. 2004a; McLure et al. 2004b; McLure

et al. 2005) and suggests that there are many factors within the region that influence

CR1 phenotypes. Defining haplotypes of the SNPs and relating these to phenotypes

should be more informative than associating the individual SNPs alone.

5.5.4 Genomic Matching Technique haplotypes

With this in mind and because of the proven ability of the GMT to identify complex

haplotypic differences within large genomic regions (Gaudieri et al, 2001), we designed

an extension to the technique to identify haplotypes of the RCA alpha block (CR1,

MCPL and CR1, a region spanning 150kb). These haplotypes mark specific

combinations of the many simple and complex polymorphisms (SNPs, duplications and

indels) within the region, as well as the SNP haplotypes that were previously reported

(as demonstrated in Chapter 4 – Fig7). Through this analysis, we identified extensive

polymorphism with at least 20 distinct haplotypes defined so far. These haplotypes will

undoubtedly contain undescribed ancestral polymorphisms (duplications, indels etc) that

will relate to susceptibility and / or protection to disease. These polymorphisms should

be far more informative than SNPs in marking functional variation between individuals.

Page 53: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 53 -

The limit of the RCA alpha block remains unclear although earlier analysis using

microsatellite markers has demonstrated that the region between CR1 and DAF is also

free from recombination (Heine-Suner et al. 1997; Rodriguez de Cordoba et al. 1999). If

this is correct, it implies that the GMT method is defining haplospecific profiles for a

genomic region spanning well over 0.5Mb and containing the coding and non coding

polymorphisms of at least 5 genes (DAF, CR2, CR1, MCPL & CR1) and possibly more

(MCP and CD34).

Further studies will be needed to define the extent of this block but regardless of the

eventual size, the GMT provides a simple, efficient and highly informative way to

characterise ancestral haplotypes of the RCA. These ancestral haplotypes will without

doubt solve many of the issues previously encountered in understanding epigenetic

interaction of RCA and MHC genes and their association with disease.

5.5.4.1 Sequencing results

The amplified elements of CR1 and CR1L have been sequenced and the data submitted

to Genbank. Analysis of these sequences shows a complex pattern of repeat

combinations in addition to the complex CR1L defining region (Chapter 4- Fig 6). From

our experience of the HGEs used in the MHC, we would suggest that these are highly

informative markers for defining RCA AHs.

6. Other Observations

In addition to the RCA GMT extension presented in chapter 4, I have also investigated

the applicability of the GMT in defining PFBs within other species. The idea is that

AHs can produce a signature for identifying complex phenotypic traits.

Page 54: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 54 -

6.1 DLA DQ and DR typing in the Canine.

The work presented in Chapter 5 reports an alternative method for the identification DQ

and DR alleles within the canine MHC (Dog Leukocyte Antigen - DLA). The test

utilises the GMT to amplify polymorphic elements within each copy of the duplicated

DLA DQ genes, DQ α and DQ β.

The results of this were iniatially used to determine parentage within a family of 18

Blue Heeler purebreds. This revealed extensive polymorphism and the results were

more discriminatory than the routine DLA SBT (as described in

http://www.ebi.ac.uk/ipd/mhc/dla/align/html) and microsatellite profiling methods

currently used (DeNise et al. 2004; Francisco et al. 1996).

Segregation of the profiles reveals that each haplotype is faithfully transmitted through

multiple generations and therefore each will undoubtedly have relevance to the

numerous DLA implicated canine diseases (Garlepp et al. 1984; Hedrick et al. 2003;

Ollier et al. 2001; Wagner et al. 1996).

6.2 Other Projects

The following projects have not been reported in this thesis, but preliminary data has

been obtained in relation to GMT diagnostics of the following gene clusters and their

association with particular traits.

1. Keratin genes

a. Polling/Horns

b. Fine wool in sheep

c. Hereditary hair loss in human

Page 55: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 55 -

2. Beta Defensins

a. Mastitis in cattle

b. Bacterial resistance of fish in intensive breeding environments

(Aquaculture)

3. Apolipoproteins (APO) (Dr Peter Keating is the chief investigator on this

project, I have been assisting him with bionformatics, genomic analysis,

primer design and the segregation of profiles)

a. Alzheimers

b. Vascular Disease

Page 56: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 56 -

Page 57: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 57 -

CHAPTERS

Page 58: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 58 -

Page 59: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 59 -

PRELUDE TO CHAPTER 1

CONSERVED MOTIFS DEFINE SCR SUBFAMILES

Chapter 1 consists of a paper published in The Journal of Molecular Evolution. The

paper describes the genes and proteins within the region known as the Regulators of

Complement Activation (RCA), a reservoir of clinically important, complement

regulatory genes known as CCPs.

The study analysed the genomic and amino acid sequences of CCPs from a number of

species, including primates (Human – Homo sapiens, Chimpanzee - Pan troglodytes &

Hamadryas baboon - Papio Hamadryas) and rodents (Norway rat - Rattus norvegicus &

House mouse - Mus musculus). This analysis revealed extensive duplication ranging

from segmental duplications of >100 kb and containing multiple genes to 200

nucleotides (nt) and containing only single domains.

To define relationships between each of the genes and their domains, all SCRs were

extracted and added to a SCR database. This allowed the individual sequences to be

analysed and compared without regard to their presence within a particular CCP. The

analysis revealed further conserved residues, other than the SCR defining

C…C…C…W…C motif. The additional conserved residues are specific to SCR

subgroups, or “SCR subfamilies” as we refer to them. Phylogenetic studies suggest that

there are 11 of these subgroups or subfamilies and that all SCRs can be classed as

belonging to only 1 of these. Each subfamily has been labelled alphabetically from “a”

to “k”.

Page 60: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 60 -

This strategy allowed us to view the functional units of each CCP, according to which

ancestor it originated from. Analysis of the genes shows that similar combinations or

sets of subfamilies are conserved between numerous CCPs, for example “ajef” “bkd”

and “ch”. This suggests that the specific combinations are forming functional domains.

Changes in non-conserved residues within subfamilies has created the variation so that

each gene has evolved to have a unique function.

I am grateful to (i) Professor Roger Dawkins for his ideas, interpretations, comments

and assistance in the compilation of this study, (ii) Dr Silvana Gaudieri for sequence

alignments, synonomous and non-synonomous comparisons and phylogenetic analyses,

(iii) Dr Joseph Williamson for sequence alignments, identifying conserved motifs and

contributions to the text, (iv) Mr Richard Davies for assisting in the compilation of SCR

sequences, (v) Dr Jemma Berry for her assistance with the sequence alignments and

contributions to the text, (vi) Dr Natalie Jacobsen and Miss Rebecca Laird for their

helpful comments and contributions

Page 61: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 61 -

CHAPTER 1

AMINO ACID PATTERNS WITHIN SHORT CONSENSUS

REPEATS DEFINE CONSERVED DUPLICONS SHARED BY

GENES OF THE RCA COMPLEX

The work described in this chapter has been published in :

McLure, C., R. Dawkins, J. Williamson, R. Davies, J. Berry, N. Longman-Jacobsen, R.

Laird and S. Gaudieri (2004). "Amino acid patterns within short consensus repeats

define conserved duplicons shared by genes of the RCA complex." Journal of

Molecular Evolution 59(2): 143-157.

Page 62: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 62 -

Page 63: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

The following papers could not be included in this digital thesis for copyright reasons. Please refer to the physical copy of the thesis, held in the University Library.

Page 64: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 63 -

Page 65: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 79 -

PRELUDE TO CHAPTER 2

ANCESTRAL LHRs CONTAINED EIGHT SCRs

Chapter 2 reports the discovery of a novel SCR subfamily within CR1 and CR1-like.

Previous analysis by us (Chapter 1) and others, has reported that CR1 is composed of

multiple copies of a seven SCR, LHR. The work presented in chapter 2 identifies an

additional, apparently redundant SCR within each LHR and suggests that the ancestral

LHR contained eight, not seven SCRs.

We performed several analyses, in order to demonstrate that the sequences identified,

were in fact additional SCRs. These analyses included (i) relationship to neighbouring

SCRs, (ii) comparison to subfamily sets identified in other CCPs and (iii) sequence

conservation between other defined SCR subfamilies.

Relationship to neighbouring SCRs

The location of the novel SCRs was shown to be between the “d” and “a” subfamilies

of each LHR. Closer analysis revealed the novel SCR to be within 30 nt 3’ of the “d”

subfamily and suggests the “d” and the novel SCR are likely to have formed a

functional set.

Comparison to subfamily sets identified in other CCPs.

In order to prove the association with the “d” subfamily, I analysed the subfamily sets

identified in the other CCPs. In CR2 and mCR1, the “d” and “g” subfamilies occur as a

set, in fact both are within the same exon, which suggests the novel SCRs are likely to

have been part of the “g” subfamily.

Page 66: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 80 -

Sequence conservation between other defined SCR subfamilies

Using the nucleotide sequence to define subfamilies, as opposed to the protein sequence

in chapter 1, I compared the conserved nucleotides of the suspected SCR against the

known subfamilies. The results of this were not convincing but did suggest a close

association with the “f” and “g” subfamilies.

In order to prove the theory that the novel SCRs are in fact “g” we analysed the d&g

encoding exons from CR2 against the “d” & novel SCR sequence from CR1 using

dotter (Sonnhammer et al). This confirms our classification as “g”, but we refer to these

additional SCRs as “g–like” because there is no evidence to suggest that these are

translated.

I am grateful to (i) Professor Roger Dawkins for his ideas, interpretations, comments

and assistance in the compilation of this work (ii) Dr Joseph Williamson for his

assistance in constructing the alignments and suggestions for the presentation of

figures, (iii) Dr Brent Stewart and Dr Peter Keating for their helpful comments in

presentation of the figures and additions to the text.

Page 67: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 81 -

CHAPTER 2

GENOMIC ANALYSIS REVEALS A DUPLICATION OF EIGHT

RATHER THAN SEVEN SHORT CONSENSUS REPEATS IN

PRIMATE CR1 AND CR1L: EVIDENCE FOR AN ADDITIONAL

SET SHARED BETWEEN CR1 AND CR2

The work described in this chapter has been published in:

McLure, C., J. Williamson, B. Stewart, P. Keating and R. Dawkins (2004). "Genomic

analysis reveals a duplication of eight rather than seven SCRs in Primate CR1 and

CR1L: Evidence for an additional set shared between CR1 and CR2." Immunogenetics

56(9): 631-638.

Page 68: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 82 -

Page 69: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

The following papers could not be included in this digital thesis for copyright reasons. Please refer to the physical copy of the thesis, held in the University Library.

Page 70: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 91 -

PRELUDE TO CHAPTER 3

CR1 EVOLUTION

The work presented in Chapter 3 focusses on the region containing the CR1/MCP

segments and more specifically the CR1 and CR1L genes. The analyses presented in this

chapter combines our previous findings presented in Chapters 1 and 2, with our more

recent genomic analyses of rodents and non-human primates.

The aim of this study was to demonstrate that indels and imperfect duplication have

been the driving forces in the evolution of the RCA gene cluster.

Using rodent Crry and primate CR1 and CR1L as our model, we demonstrate how a

critical unit (subfamily combination ajef) is maintained throughout all species. Gene

complexity has accumulated during evolution through (i) the addition of extra SCRs, (ii)

the duplication of critical sets, (iii) the deletion of individual and sets of SCRs and (iv)

the insertion of retroviral sequence. These have resulted in the differences currently

observed.

Figure 4 represents a model for the evolution of the human CR1 and CR1L genes. This

summarises our observations from the current paper, as well as those from Chapters 1

and 2.

I am grateful to (i) Professor Roger Dawkins for his ideas, interpretations, comments

and assistance in the compilation of this study (ii) Dr Joseph Williamson, Dr Brent

Page 71: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 92 -

Stewart, Dr Peter Keating, Mr John Millman and Dr Louise Smyth for their helpful

discussions for the presentation of the figures and for their contributions to the text.

Page 72: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 93 -

CHAPTER 3

INDELS AND IMPERFECT DUPLICATION HAVE DRIVEN THE

EVOLUTION OF HUMAN CR1 AND CR1-LIKE FROM THEIR

PRECURSOR CR1 ALPHA. IMPORTANCE OF FUNCTIONAL

SETS.

The work described in this chapter has published in:

McLure, C., J. Williamson, B. Stewart, P. Keating and R. Dawkins (2005). "Indels and

imperfect duplication have driven the evolution of human CR1 and CR1-like from their

precursor CR1 alpha: Importance of functional sets." Hum Immunol 66(3): 258-273.

Page 73: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 94 -

Page 74: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

The following papers could not be included in this digital thesis for copyright reasons. Please refer to the physical copy of the thesis, held in the University Library.

Page 75: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 111 -

PRELUDE TO CHAPTER 4

RCA POLYMORPHISM

The following chapter reports the discovery of extensive RCA polymorphism. The

study utilised previous genomic analysis, presented in chapters 1 to 3, to identify

complex polymorphic elements within each of the duplicons. Using the GMT, these

products have been amplified and separated by the Polyacrylamide Gel Electrophoresis

(PAGE). The amplicons represent alleles of duplicated loci. Segregation of the

haplospecific patterns through 3 generations supports the previous findings by others

that the region between CR1 and CR1L is part of the same genomic block.

We foresee this example of the GMT will assist in defining the RCA haplotypes that

exhibit susceptibility and resistance to disease. This paper for the first time shows a

relationship between the genes of the Human RCA and RSA.

The results presented here, relate to the application of the GMT in the RCA, but are

presented not just as a paper describing haplotypes of the RCA region but also as a

strategy for identifying genomic polymorphism, AHs and PFBs in any region, from

within any genome.

Post submission addition to Chapter 4

Statistical analyses of the data in Table 2 have been ongoing. In this instance we have

used a contingency table analysis to compare the data and interpreted the differences

observed using odds ratios. The odds ratios were then estimated simultaneously by

logistic regression performed using WinBugs (V 1.4.1 http://www.mrc-

bsu.cam.ac.uk/bugs/winbugs/contents.shtml). WinBugs uses Bayesian MCMC methods

to estimate empirical 95% credible intervals (CI), which are less biased for small

sample sizes. The latest results, confirm that observed differences in haplotype

Page 76: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 112 -

frequencies between different ethnic groups and disease and controls such as Recurrent

Spontaneous Abortion (RSA) are significant.

A comparison between Indians and Caucasians revealed the frequencies of haplotypes

1-3, 6-7, 9 and 11 were all significantly decreased within the Indian group (RSA

samples pooled). This perhaps is not surprising as the Ancestral haplotype list was

derived on a frequency basis from predominantly Caucasian samples.

The following manuscript reveals associations between the CR1.02 and CR1.08

Ancestral Haplotypes and Recurrent Spontaneous Abortion (RSA). Statistical analyses

of these haplotypes in RSA controls (RSA-C) and patients (RSA-P) confirm that

CR1.02 AH is significantly decreased within the RSA-P group with an odds ratio of

0.08 (range 0.01, 0.47). The increase in CR1.08 AH, which appears to be significant

when compared to other groups, does not appear significant using this analysis. This is

most likely due to very low power of this analysis to detect significance of low

frequency haplotypes.

I am grateful to (i) Professor Roger Dawkins for his ideas, interpretations, comments

and assistance in the compilation of this study, (ii) Dr Joseph Williamson for primer 11

and 12 assay, haplotype assignment, statistical analysis and contributions to the text,

(iii) Dr Louise Smyth for the donation of three generation families, (iv) Professor

Suraksha Agrawal for clinical samples and contributions to the text, (v) Mrs Susan

Lester for clinical samples, statistical analyses and contributions to the text, (v) Mr

John Millman for statistical analysis and (vi) Dr Peter Keating and Dr Brent Stewart

for their suggestions and contributions to the figures and text and (vii) Mr Ryan

Southall, Mr Oscar Kolai and Dr Peter Kesners for the laboratory testing of samples.

The contributions of visitors to the CYO quarterly meetings to this manuscript must be

acknowledged. To Professor Stephen Oppenheimer, Professor Michael Chorney and

Professor WenJie Zhang, thankyou for your questions, comments, discussions and

contributions.

Page 77: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 113 -

CHAPTER 4

EXTENSIVE GENOMIC POLYMORPHISM OF THE CR1

REGION: RCA ANCESTRAL HAPLOTYPES, FUNCTION AND

DISEASE

The work described in this chapter has been submitted for publication:

McLure, C., J. Williamson, L. Smyth, S. Agrawal, S. Lester, J. Millman, P. Keating, B.

Stewart, and R. Dawkins (In Press). "Extensive genomic polymorphism of the CR1

region: RCA ancestral haplotypes, function and disease". Immunogenetics

Page 78: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 114 -

Page 79: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 1

Extensive genomic polymorphism of the CR1 region: RCA

ancestral haplotypes, function and disease.

Craig A McLure1,2, Joseph F Williamson

1,2, Louise A Smyth

1,2, Suraksha Agrawal

3

, Susan Lester2,4, John A Millman

2,5,Peter J Keating

2, Brent J Stewart

1,2& Roger

L Dawkins1,2,6

1Centre for Molecular Immunology and Instrumentation, University of Western Australia,

Nedlands, Western Australia 6907

2C.Y. O’Connor ERADE Village, PO Box 5100, Canning Vale, Western Australia 6155

3Department of Medical Genetics, Sanjay Gandhi Institute of Medical Sciences, Lucknow India

226014

4Arthritis Research Laboratory, Hanson Institute, Frome Road Adelaide, South Australia 5000

5TAFEWA Swan Campus, Hayman Road, Bentley, Western Australia 6102

Manuscript number 0408 of the Centre for Molecular Immunology and Instrumentation of the

University of Western Australia and the C Y O’Connor ERADE Village.

Sequences submitted to Genbank: Accession No’s DQ007054-DQ007076

6Correspondence should be addressed to RLD:

Professor Roger Dawkins

CY O’Connor ERADE Village

PO Box 5100

Canning Vale South

Western Australia 6155

Facsimile: +618 9397 1559

Email: [email protected]

Page 80: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 2

Using a combination of genomic markers we show that there are more

than 20 distinct Ancestral Haplotypes (AH)1 of Complement Control

Proteins (CCPs) at 1q32. This extensive polymorphism is important

because CCPs are involved in the Regulation of Complement Activation

(RCA) whilst also serving as viral receptors 2.

To screen for haplotypes, we have developed the Genomic Matching

Technique (GMT) based on the pragmatic observation that extreme

nucleotide polymorphism is packaged with duplicated sequences.

Within the Major Histocompatibility Complex (MHC) at 6p21.3, these

regions or quanta are well characterised and are designated

Polymorphic Frozen Blocks (PFB). At each PFB, there are many

alternative sequences (haplotypes) which have become frozen in that

they are inherited faithfully from very remote ancestors. Such AH are

critical in transplantation and disease 1.

We have compared RCA haplotypes in diverse diseases and report that

the frequencies differ in conditions such Recurrent Spontaneous Abortion

(RSA) and Psoriasis Vulgaris (PV)

The many and diverse duplications of CCPs have been described

elsewhere 3-5. Several categories of duplicons were evaluated using

GMT. For example, as shown in Fig. 1, there is an extensive segmental

Page 81: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 3

duplication involving Complement Receptor 1 (CR1) and Membrane

Cofactor Protein (MCP). With primers P5+6 designed to amplify at

duplicated sites separated by hundreds of kilobases, we observed

multiple diverse products in a screening panel of 60 human subjects

selected to include the major ethnic groups and some relevant diseases.

As shown in Fig. 2, there are 1, 2 or 3 products in the range around 300bp

and 0, 1, 2 or 3 products in the range around 350 bp. Each of the 11

subjects has a unique composite profile. As shown in Fig. 3, these are

highly reproducible with only minor differences under different conditions

of amplification.

We then studied 3 generation families in order to determine whether

combinations of products define transmissible haplotypes. The families

had already undergone MHC typing which was consistent with stated

parentage. In all cases, the RCA haplotypes were unequivocal and

faithfully transmitted. For example, as shown in Fig. 4, each product can

be numbered according to length such that I 1 in family 1 has the 4,5

and 16 profile which resolves through segregation analysis into two

haplotypes (a=4 with null and b=5 with 16) and therefore the genotype

4,0;5,16. Note also that in II 2 (ac), the intensity of product 4 is increased

in keeping with the genotype 4,0;4,14 and homozygosity of 4. Similarly, in

Family 2, I1 (ab) is homozygous for 5.

Page 82: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 4

In spite of some homozygosity, there is extreme polymorphism as

illustrated by the fact that there are 11 different profiles and genotypes in

the 12 subjects. In each family there are 3 unrelated individuals

(ab,cd,ef). In these 6 subjects there are 9 different haplotypes. In the

case of the 4,0 and 5,0 haplotypes the frequencies were 2/12 and 3/12

respectively suggesting that these may be relatively common and

functionally important ancestral haplotypes. We therefore reviewed the

profiles of the panel of 60 subjects and found that most haplotypes

could be assigned using the iterative strategies described in the

methods.

Confirmation of these assignments was obtained by amplifying other

duplicated sequences with primers 11 and 12 shown in Fig. 1 and by

determining the presence or absence of the BstN1 (G3093T) cutting site 6

on different haplotypes (Table 1). These results demonstrated that the

haplotypes contain haplospecific features at multiple sites. For example

02 contains 4,0 with P5+6, and 1, 13 with P11 +12 and is G3093 whereas

08 is P5+6=6,13 and P11+12=5,11 and is G3093T.

We then tested a separate panel of 322 subjects. The frequencies of

haplotypes in this dataset are as expected from the 2 smaller panels and

are shown in Table 1 which also proposes designations for the more

common ancestral haplotypes.

Page 83: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 5

To characterise the haplotypes in more detail we sequenced

representative P5+6 products. Based on the available genomic

sequences, we expected that the products of less than 331 bp would be

from CR1 and those above 331 would be from CR1-like (see Fig. 2). We

therefore established operational criteria for assignment using the

patterns shown in Fig. 6. All sequences were as expected.

As shown in Table 1 and Fig 1, some haplotypes fail to generate a CR1-

like product when amplified with P5+6. Since P11+12 yield 2 products per

haplotype we conclude that there is a further polymorphism, probably

an indel, which negates amplification with P5+6 on the CR1-like null

haplotypes. Of further interest, the data suggest that some haplotypes

contain more than 2 duplicons. In fact, on longer gels there are

additional products which have not been shown here.

CCPs are found in clusters around the genome including the MHC where

they are within the early complement components C2 and Bf. The major

cluster at 1q32, designated the RCA complex, contains critical genes

such as MCP, CR1 and CR2 and has many important functions including

viral receptors. CR1 is a receptor for C4 which is itself a highly

polymorphic component of the MHC gamma PFB. It is not surprising that

CR1 polymorphism has been sought for many decades especially in

relation to the possibility that deficient or defective CR1 might be

expected to be associated with poor control of C4 activation and

Page 84: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 6

therefore disease. Although most results have been disappointing and

controversial there is now evidence that some point mutations or single

nucleotide polymorphisms, such as that revealed by BstN1, may be

important in function and diseases such as in malaria and Systemic Lupus

Erythematosus (SLE) 6,7. These studies illustrate the need for haplotyping

as a means of identifying linked polymorphisms including those of

functional significance.

In the case of CR1-like, some interesting possibilities have been raised by

the finding that there can be an expressed and functional product in

humans as well as Non Human Primates 8. Our demonstration of

extensive polymorphism in both CR1 and CR1-like suggests that there

may be quantitative differences in function and susceptibility to disease.

Meaningful clinical studies using the haplotypes described here are now

possible.

In Table 2 we show the frequencies in the panel of 322 arranged by

clinical subset. The distribution of CR1-01 is similar in all groups but

CR1-02 is rare in patients with RSA and frequent in those with Psoriasis

Vulgaris(PV) (Fig 6). The reverse is seen with CR1-04 and - 08. Indeed

when haplotypes are compared in terms of RSA-P v PV the ratios vary

tenfold. Note also that more than 50% of haplotypes are yet to be

defined in RSA-P whereas the corresponding figure in PV is 10-19%.

Page 85: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 7

These results provide the first evidence for a role of the RCA complex in

RSA. This finding is entirely consistent with the important role of RCA

products in the outcome of pregnancy in mice. 9,10

The present study provides another example of the utility of the GMT

approach. This simple procedure has demonstrated linked

polymorphisms including at least one of functional significance 6. Short of

sequencing and somehow assembling hundreds of kilobases in at least

30 subjects, we know of no other approach which could reveal more

than 20 different haplotypes with such extensive polymorphism. The

rationale for the assay is that sequence polymorphism is concentrated in

some regions or quanta, which, in our experience, are also rich in

duplications. It remains to be determined which particular category of

duplicon is most useful but, in the meanwhile, we recommend the use of

larger segments with major indels and therefore differences in length

when the 2 or more copies are compared. Such features apply to the

MHC as well as to the current study. It is important to emphasise that the

relationship between polymorphism and duplication is complex 5 and

unrelated to whether one copy is a pseudogene.

Insertions and deletions (indels) are also associated with concentrations

of polymorphism 11. These indels are often complex and degenerate

suggesting a mechanism for divergence between the different

duplicons. As described in Fig. 1, the sequence amplified includes an L1

Page 86: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 8

(L1M5 or L1P4) which must have anteceded the duplication but which is

different when the 2 copies are compared. There are also differences in

the 5’ sequence but most of the variations in length are due to the very

complex TC rich region which we refer to as a Polymorphic or

Haplospecific Geometric Element (HGE). This contrasts with a

microsatellite in that there are diverse units of different lengths and yet

the sequences have a geometric pattern (Fig. 6). Other features we

associate with such HGEs are stability, complementary sequences,

uniqueness within the genome and extreme polymorphism. A study using

microsatellites in the vicinity of CR1 revealed little polymorphism but did

suggest that there is limited recombination as predicted by the PFB

hypothesis .12

PFBs are remarkable since, although they contain extreme

polymorphism, duplicons and indels, they behave as though they

become frozen after which they appear to be resistant to recombination

and mutation. In terms of calculations of linkage disequilibrium, higher

values are found within, rather than between PFB, but cannot be

expected when haplotypes share common alleles in different

combinations.

The alternative sequences within a PFB (ancestral haplotypes) are

inherited faithfully over many generations. In the MHC, ancestral

haplotypes which are now found in tens of millions of the population

Page 87: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 9

have proven, when sampled, to be identical at the sequence level. We

expect that the same will be true of CCP region and that these

conserved polymorphisms will be critical in explaining differences in

function and disease (see Fig.7). Included in the possibilities are

inflammatory diseases such as RSA, SLE and SS and differences in

susceptibility to viruses, such as measles, which exploit CCPs, such as

MCP, as receptors. 2.

Methods

To investigate polymorphism within the RCA region we needed an approach

which could interrogate extensive sequences so as to reveal the presence of

haplotypes. The GMT approach is designed to show sequence differences

between the maternal and paternal chromosomes by generating a sequence-

specific profile for each. For example, in the MHC, we have shown that a

particular amplification product may mark a sequence of 200Kb 1,13. Within this

sequence there are many expressed genes, each with a haplotypic allele.

There are also multiple noncoding differences at least some of which appear to

be important in regulation.

Thus, the GMT gives a simple signature for the entire haplotype of the PFB and is

therefore the best approach to matching for bone marrow grafting 14 and to

the identification of disease associations 1.

The GMT procedure used on this occasion involved the following steps:

1. Identification of duplicons.

The genomic region containing CR1, MCP-like, CR1-like and MCP at

1q32, was taken from the NCBI database

Page 88: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 10

(http://www.ncbi.nlm.nih.gov/) (position 1124945–1449694 on contig

NT_021877.16 (gi:37539616); accession numbers AL691452.10,

AL137789.11, AL365178.10 and AL035209.1). This sequence was

compared against itself using Dotter 15 to identify evidence of

duplication 3-5.

2. Selection of primer sites present in all duplicons.

Segment A, containing CR1 and MCP-like was compared to Segment B,

containing CR1-like and MCP. Regions within these two segments which

shared a complex geometric element were identified as targets 3. The

geometric element must vary in size between the duplicates (see Figs. 1

& 6) but also contain enough homology either side of the element so as

to enable the design of primers that will bind and amplify within each

segment. The resulting mix of products has the potential to define

extensive haplotypes.

Duplicons at position 1150081-1150372 (CR1) and 1322386-1322768 (CR1-

like) of NT_021877.16 were aligned using Clustalw

(http://www.es.embnet.org/cgi-bin/clustalw.cgi). Using Primer3

(http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi), primers were

designed so that a single primer pair will bind and amplify both

duplicates or even more if, as expected, there are more than two

duplicated segments on some haplotypes.

Primer sequences were compared to the NCBI databases using BLASTN

(http://www.ncbi.nlm.nih.gov/BLAST/) at low stringency. Sequence

Page 89: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 11

identities which matched the primers in both the forward and reverse

directions were identified. The only significant matches for primers in

question were in close proximity and it could therefore be assumed the

primer pair would amplify within a PFB. Analysis of the amplified

elements with matches from the Celera database (NT_086601 position

1267344-1267734) suggests the duplicated elements are polymorphic

between individuals (Fig. 1). The intention is to amplify as many

duplicated sites as possible so long as there is no amplification of

unlinked sequences. In the case of the RCA complex, there is a risk of

interference from unlinked priming because CCPs are widely distributed.

Accordingly, we used a three generation nuclear family to test the

selected primers. If the primers are valid, segregation through

generations should be apparent.

3. Comparison of products within 3 generation families

Families with disputed paternity were avoided. Individuals were

compared as blind pairs. Amplicon peaks were numbered successively.

4. Assignment of haplotypes

Once the profiles of individual subjects were defined and compared, the

data were interpreted within the context of the family structure. For

example, the grandfather is designated ab and the grandmother cd.

Next, the second generation, designated II, is inspected to determine

which part of the parental profiles were transmitted. In this way a,b,c,

and d haplotypes can be deduced. As a test of the validity of these

assignments, the next generation (III) is examined. Haplotypic profiles

Page 90: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 12

from generation I should be retained even when they are associated

with haplotypes not present in the previous two generations.

5. Determination of population frequencies with comparison of functions

and diseases.

Haplotypic profiles verified by family studies were given a number here

referred to as 01,02…99 (see Table 1). These profiles can then be

recognised in other families and in other homozygotes. Having defined

common ancesteral haplotypes, we then examine heterozygotes to

determine if 2 assigned haplotypes are present. Product intensity is also

considered as illustrated in Fig. 4. We use the Hardy Weinberg test as an

indication of the validity of assignments. Population and disease studies

are then justified.

We also generated all theoretically possible haplotypes from the alleles

found in each subject. Those occurring in more than 3 subjects were

considered further. In some cases, the frequencies were similar to those

shown in Table 1 but there were major differences. Some of the

common theoretically possible haplotypes were not observed as

homozygotes and were not assigned.

Amplification and analysis of CR1 and CR1-like Haplospecific Geometric

Elements (HGEs)

Genomic DNA was prepared using the standard salting-out method.

Primer Sequences:

P5+6

Page 91: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 13

CR1MCP5 5`AAT TCC AAA TTG GCC TGG TTG A 3` and

CR1MCP6 5`CCT TCC CTT TGA GAT GTG GAA CA 3`.

P11+12

CR1MCP11 5` GTC AGC TTG GAT TGC CCT TGG TTC TA

CR1MCP12 5` CCT GGG CAA CAA AGC AAG ACA TTG T

Polymerase Chain Reaction:

PCR reactions were performed in a 96-well Palm Cycler (Corbett Research) in

20µl volumes using 100 ng of template DNA, 1.3 U Taq Polymerase (Fisher

Biotec), 10 pmol of the forward and reverse CR1MCP primers, 200µM of each

dNTP, 2 mM MgCl2 and 1X PCR buffer (Fisher Biotec). The samples were

denatured at 940C for 5 min, followed by 30 cycles each comprising 30 seconds

at 940C, 45 seconds at 580C and 45 seconds at 720C. The last cycle was

followed by an additional extension for 5 minutes at 720C.

Detection of amplicons and haplotypes:

The separation and detection of the allelic variants of CR1 and CR1-like was

done with the Corbett Research GS-3000 automated gel analysis system. One

microlitre of PCR product was mixed with 1 µl of loading buffer containing

Puc19 molecular weight ladder. One microlitre of the PCR sample and loading

buffer mixture was then added to a 32 cm long, 48 well, 4% polyacrylamide,

ultra-thin gel and pulsed for 10 seconds. Excess sample was then flushed and

the gel was run at 2000 V for 180 minutes.

Page 92: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 14

Gel analysis and profile generation.

The gel image was analysed using BioRad Quantity One gel analysis software.

Lanes were defined, amplicons detected and standards assigned. Densimetric

profiles were generated and lanes were aligned using the internal pUC19/Hpa II

(Fisher Biotec) standards.

CR1 and CR1-like Sequencing

Primer Sequences:

CR1 specific primers

CR1-F1 5’AAT TCC AAA TTG GCC TGG TT 3’ and

CR1-R1 5`AAA CTTT AAC TTT GAG ATG TGG AAC A 3’

CR1-like specific primers

CR1MCP5 5`AAT TCC AAA TTG GCC TGG TTG A 3` and

CR1MCP6 5`CCT TCC CTT TGA GAT GTG GAA CA 3`.

Band purification and sequencing

PCR products were analysed using a 2% agarose gel. Individual bands were cut

from the gel and purified using Amersham Biosciences GFX PCR Gel Band

Purification Kit. The purified products were amplified as above and sequenced.

BstN1 digestion

Page 93: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 15

Polymorphism at nucleotide 3093 was detected using PCR amplification

and BstN1 digestion. This was performed using primers and methods

detailed by Birmingham6. PCR conditions were as above, except the

annealing step was at 60°C for 45 seconds. Sequence analysis suggest

that the primers amplify the site telomeric of CR1 j1 (repeated in CR1 as

shown in Fig. 1) but not CR1-like because of differences in the primer

sites.

ACKNOWLEDGMENTS

We are grateful to Dr D Sayer and Royal Perth Hospital for the

sequencing of the products, Dr Peter Kesners for advice and Ms Wendy

Ford for administration. The samples designated RSA-C and RSA-P are

from the Sanjay Gandhi institute of Medical Sciences (SA) Those

designated ARL-C, SLE-P and SS are from the Arthritis Research

Laboratory, Hanson Institute (SL) with acknowledgements to Dr Maureen

Rischmueller of Rheumatology, The Queen Elizabeth Hospital, Adelaide .

The HCT and PV samples were described previously 16. Supported by

Australian Research Council, C Y O’Connor Village Foundation and

Genetic Technologies Ltd., Fitzroy, Victoria 3065, Australia.

CONFLICTS

As stated in the Form.

Page 94: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 16

REFERENCES

1. Dawkins, R. et al. Genomics of the major histocompatibility complex:

haplotypes, duplication, retroviruses and disease. Immunological

Reviews 167, 275-304 (1999).

2. Dhiman, N., Jacobson, R. & Poland, G. Measles virus receptors: SLAM

and CD46. Reviews in Medical Virology 14, 217-229 (2004).

3. McLure, C. et al. Amino acid patterns within short consensus repeats

define conserved duplicons shared by genes of the RCA complex.

Journal of Molecular Evolution 59, 143-157 (2004).

4. McLure, C., Williamson, J., Stewart, B., Keating, P. & Dawkins, R. Genomic

analysis reveals a duplication of eight rather than seven SCRs in Primate

CR1 and CR1L: Evidence for an additional set shared between CR1 and

CR2. Immunogenetics 56, 631-638 (2004).

5. McLure, C., Williamson, J., Stewart, B., Keating, P. & Dawkins, R. Indels

and imperfect duplication have driven the evolution of human CR1 and

CR1-like from their precursor CR1 alpha: Importance of functional sets.

Hum Immunol 66, 258-273 (2005).

6. Birmingham, D.J. et al. A CR1 polymorphism associated with constitutive

erythrocyte CR1 levels affects binding to C4b but not C3b. Immunology

108, 531-538 (2003).

7. Moulds, J.M. et al. Molecular identification of Knops blood group

polymorphisms found in long homologous region D of complement

receptor 1. Blood 97, 2879-2885 (2001).

8. Logar, C.M., Chen, W., Schmitt, H., Yu, C.Y. & Birmingham, D.J. A human

CR1-like transcript containing sequence for a binding protein for iC4 is

expressed in hematopoietic and fetal lymphoid tissue. Molecular

Immunology 40, 831-40 (2004).

Page 95: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 17

9. Xu, C. et al. A Critical Role for Murine Complement Regulator Crry in

Fetomaternal Tolerance. Science 287, 498-501 (2000).

10. Bell, E. Murine embryonic survival depends on regulation of complement.

Immunology Today 21, 109 (2000).

11. Longman-Jacobsen, N., Williamson, J.F., Dawkins, R.L. & Gaudieri, S. In

Polymorphic Genomic Regions Indels Cluster with Nucleotide

Polymorphism: Quantum Genomics. Gene 312, 257-261 (2003).

12. Heine-Suner, D. et al. A high-resolution map of the regulator of the

complement activation gene cluster on 1q32 that integrates new genes

and markers. Immunogenetics 45, 422-427 (1997).

13. Gaudieri, S., Longman-Jacobsen, N., Tay, G.K. & Dawkins, R.L. Sequence

Analysis of the MHC Class I Region Reveals the Basis of the Genomic

Matching Technique. Human Immunology 62, 279-285 (2001).

14. Witt, C. et al. Unrelated Donors Selected Prospectively by Block-

matching Have Superior Bone Marrow Transplant Outcome. Human

Immunology 61, 85-91 (2000).

15. Sonnhammer, E. & Durbin, R. A dot-matrix program with dynamic

threshold control suited for genomic DNA and protein sequence analysis.

Gene 167, GC1-10 (1995).

16. Korendowych, E. et al. Haplotypes associated with psoriasis, psoriatic

arthritis and haemochromatosis. in IHWG Press (in press, Seattle, 2002).

17. Hourcade, D., Holers, V.M. & Atkinson, J.P. The regulators of complement

activation (RCA) gene cluster. Advances in Immunology 45, 381-416

(1989).

18. Xiang, L., Rundles, J.R., Hamilton, D.R. & Wilson, J.G. Quantitative alleles

of CR1: coding sequence analysis and comparison of haplotypes in two

ethnic groups. J Immunol 163, 4939-45 (1999)

Page 96: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 18

.Figure Legends

Figure 1. Multiple binding and amplification by primer pairs. Schematic

representation of the genomic region on 1q32 showing the duplicated

segments (purple and blue bars) containing the CR1 and MCP genes. The red

and green lines indicate the positions of the forward (CR1MCP 5) and reverse

primers (CR1MCP 6) designated P5+6. The amplified sequences of CR1 (purple)

and CR1-like (blue), including Celera, have been aligned to show conserved

regions flanking a polymorphic geometric element containing multiple complex

components which distinguish CR1 and CR1-like sequences. Black shading and

white text indicates conserved sequence. Numbers above and below the

alignment represent nucleotide positions of CR1-like (Celera - NT_086601) and

CR1 (NCBI – NT_021877.16) respectively. Also shown are locations of primers P

11+12 and BstN1 cutting sites (see Table 1). Conserved nucleotides at CR1-like

positions 289-391 are part of a L1 element.

Figure 2. Genomic polymorphism within the CR1/MCP duplicons. GMT P5+6

profiles following polyacrylamide gel separation were overlayed using internal

molecular weight markers of 242, 331 and 404bp (solid vertical lines) .

Amplicons differ between individuals (broken vertical lines). Bands have been

assigned numbers from the smallest (1) to the largest (19). Some such as 8 are

rare in Caucasians

Page 97: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 19

Figure 3. Reproducibility of the GMT profiles. GMT P5+6 profiles using different

PCR conditions demonstrate the reproducibility of the method. The internal

markers are as in Fig. 2.

Figure 4. Segregation of ancestral haplotypes. GMT P5+6 profiles from 3-

generation families confirm unequivocal segregation of haplotypes. In each

case the profile overlay has been restricted to 2 generations. Individual profiles

are coloured as shown in the family tree and the laboratory specimen codes.

The number assigned to each band is derived from Fig 2.

Figure 5. CR1.02 and CR1.08 haplotype frequencies differ in different clinical

groups.

RCA-C Recurrent Spontaneous Abortion control group; RCA-P - Recurrent

Spontaneous Abortion; HCT - Haemochromatosis; PV - Psoriasiss Vulgaris; ARL-C

– Adelaide Research Laboratory control group; SLE-P - Systemic Lupus

Erythematosus; SS - Sjögren's Syndrome; AH 02 = Ancestral Haplotype 02 -

P5+6=4,0;P11+12=1,13;BstN1-G is rare in RSA but common in PV whereas AH 08 -

Ancestral Haplotype 08 - P5+6=6,13;P11+12=5,11;BstN1-T shows the opposite.

Although less dramatic, the binomial probability mass function (EXCEL) shows a

decrease in 02 (p=0007) and an increase in 08 (p=002) when RSA-S is compared

to RSA-C.

Page 98: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 20

Figure 6. Sequencing reveals the complexity of the haplospecific element and

differences between CR1 and CR1-like. Sequence alignment identifies

potential indels and polymorphic elements. The TC rich region is highly

polymorphic in keeping with other haplospecific elements. Black shading and

white text indicates consensus sequence on either side of the indel

polymorphic region. The differences between CR1-like and CR1 are (i) G at 101,

105, 109, 113, 126 and 130 (*); (ii) length differences between 102 and 281bp;

(iii) other indels. For the purposes of classifying the sequences of products we

used (i) with or without the remainder. Numbers above and below the

alignment represent nucleotide position of CR1-like (Celera - NT_086601) and

CR1 (NCBI - NT _021877.16) respectively. Note “Y” indicates nucleotide C/T.

Figure 7. Polymorphisms within SCR subfamilies

CCPs such as CR1, CR1-like and Crry contain Short Consensus Repeats 17 which

we have classified into subfamilies as a, b, c etc.3-5 Each CCP has its particular

order such as (ajefbkd)5 ch in the case of CR15 but the subfamilies are

remarkably conserved as indicated by the degree of shading. Some of the

known SNPs 6,7,18 have been mapped to the subfamilies since those changing

conserved residues are likely to have profound functional effects. SNPs within

a, j or e are likely to alter ligand binding.6 The BstN1 site is within j.

^ Translated from the mRNA sequence but absent in respective protein

sequence.

Note Hosa is Homo sapien, Mumu is Mus musculus, Rano is Rattus norvegicus,

Patr is Pan troglodytes, Paha is Papio hamadryas and Pacy is Papio

Cynocephalus.

Page 99: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 21

Table 1. RCA haplotypes in an ethnically diverse DNA panel. The P5+6

haplotypes identified in the segregation studies and homozygotes were used to

deduce the haplotypes of additional unrelated individuals. A similar approach

was taken with P11+12 and the combination of P5+6 and P11+12 used to assign

the Ancestral Haplotype number. No deviation from Hardy-Weinberg

equilibrium was observed confirming that heterozygotes can be assigned. Only

the 15 most common are shown here. These account for approximately 70% of

the population studied. After assignment of these, BstN1 typing revealed that

each had either G or T at the cutting site on CR1. At least 15 rarer haplotypes

were identified but at a frequency of less than 1%. Some of these may be

ethnic specific. Some haplotypes also differ in minor bands not illustrated here.

Table 1

Ancestral GMT typing BstN1 Frequency

Haplotype P5+6 P11+12 typing n %

01 5,0 1,13 G 156-165 24-25

02 4,0 1,13 G 80-83 12-13

03 5,16 1,15 G 75-77 12

04 5,13 1,11 G 18-24 3-4

05 6,0 4,13 T 24-29 4

06 5,14 1,15 G 18 3

07 5,17 1,15 G 16 2

08 6,13 5,11 T 11-16 2

09 5,15 1,15 G 15 2

10 6,0 1,13 T 12-13 2

11 6,9 4,17 G 7-9 1

12 4,0 1,12 G 8 1

13 5,0 1,19 G 6-8 1

14 5,0 1,18 G 7-8 1

15 4,14 1,11 G 8 1

Sub-Total 461-497 71-77

Other 108-152 17-23

Total 569-649 88-100

Page 100: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

McLure et al Page 22

Table 2. Percentage frequencies of ancestral haplotypes in different clinical

groups. Abbreviations as in Fig. 5. The n value refers to the number of

chromosomes and adds to 644. Because of some ambiguities, ranges of

frequency are shown in some instances and the total number of possible

haplotypes is 682. The percent frequencies are similar in the two control groups

and in HCT, SLE and SS but some haplotypes are strikingly different when RSA-P

and PV are compared.

Table 2

Diseasegroup

RSA-C RSA-P HCT PV ARL-C SLE-P SS

Number of chromosomes in sample

Ancestral n=74 n=92 n=48 n=132 n=84 n=58 n=156

Haplotype Haplotype Frequencies (%)

01 15 17-18 25 23-25 22-24 21-24 32

02 12 2-3 10 20 9-10 13-15 13

03 11 6-7 17 7 11-12 8 18

04 3 3-5 2 2-3 3 3 3-4

05 8 2 6 4-7 2-3 3 2

06 2 1 3 5 6

07 6 3 3 2 3

08 1 6-9 2 3 1 2-3 1

09 1 2 4 2 5 1

10 1-3 2 2 4 1 1

11 2 1-2 2 2 1

12 4 1 3 1

13 3 1 2 1-2

14 1 2-3 2 3 1

15 3 1 1 1 2

Other 42-43 51-59 17 10-19 29-34 26-32 15-17

Number of possible haplotypes

n=76 n=102 n=48 n=142 n=92 n=62 n=160

Page 101: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

BstN

1 R

FLP locatio

ns

Po

lym

orp

hic

Geo

metr

ic E

lem

en

t

5' C

on

served

Reg

ion

1

1 0

2 0

2 5

2 6

2 9

3 0

3 1

4 0

4 1

5 0

6 0

7 0

8 0

8 1

8 2

9 0

9 5

CR1-like (NT_086601)

AATTCCAAATTGGCCTGGTTGACACTGTACAAAACCACCAGATAATTATAATTTTATTTAACTCTTTGTCTTCTTTTCTTT----CCTTCCCTCCTTCC

CR1-like (NT_021877.16)AATTCCAAATTGGCCTGGTTGACACTGTACAAAACCACCAGATAATTATAATTTTATTTAACTCTTTGTCTTCTTTTCTTT----CCTTCCCTCCTTCC

CR1 (NT_021877.16)

AATTCCAAATTGGCCTGGTTGACATGGTGCCAAACCACCAAATAATTATAATTTTATTTAACTCTTTGTCTTCTTTTCTTTCTTTCCTTCCCTCCCTCC

11 0

2 0

2 5

2 6

2 9

3 0

3 1

4 0

4 1

5 0

6 0

7 0

8 0

8 2

8 3

8 4

8 5

9 0

9 9

9 6

9 8

1 0 0

1 1 0

1 2 0

1 3 0

1 4 0

1 5 0

1 6 0

1 7 0

1 8 0

1 9 0

1 9 4

CR1-like (NT_086601)

CTTCTGCCTGCCTGCTTGCCTTCCTTCTTTGCTTGCTTCCTTCCTTCCTCCCTCCCTCCATCCCTCCCTTCCTCCCTCCCTCCCTTCCTTCCTTCCTTC

CR1-like (NT_021877.16)CTTCTGCCTGCCTGCTTGCCTTCCTTCTTTGCTTGCTTCCTTCCTTTCTCCCTCCCTTCCTCCCTCCTTTCCTTCCTTCCTCCCTCCCTCCCTCCCTTC

CR1 (NT_021877.16)

CTCCT-----------------------------------------------------------------------TTCCTCCCTCCCTCCCTCCCTCC

1 0 0

1 0 2

1 0 4

1 0 5

1 1 0

1 2 0

1 2 7

Po

lym

orp

hic

Geo

metr

ic E

lem

en

t3

' C

on

served

Reg

ion

1 9 5

2 0 0

2 1 0

2 2 0

2 3 0

2 4 0

2 5 0

2 6 0

2 7 0

2 8 0

2 9 0

2 9 3

CR1-like (NT_086601)

CTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCCTCCTTCCCTCCTTCCCTCCTTCCCTCCTTCCCTCCTTCCCTCCTTTCCTTCTCCTTATTTT

CR1-like (NT_021877.16)CTCCCTCCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCCTCCTTCCCTCCTTCCCT--------CCTTTCCTTCTCCTTATTTT

CR1 (NT_021877.16)

CTCCCTCCCTTCCTTCCTTCCTTCCTTCTTTCCTTCCTTCCTTCCTTCCTTCCCT-------------------------CTTTCCTT------ATTTT

1 2 8

1 3 0

1 4 0

1 5 0

1 6 0

1 7 0

1 8 0

1 8 2

1 8 3

1 9 0

1 9 1

1 9 5

2 9 4

3 0 0

3 1 0

3 1 1

3 1 2

3 2 0

3 3 0

3 4 0

3 5 0

3 6 0

3 7 0

3 8 0

3 8 6

3 8 7

3 9 0

3 9 1

CR1-like (NT_086601)

CTTTCTTCTTTACCACAC-GGCTAGGACCACCAGTATAACATTGAACATTGGTAGCAATAGATGTCATCCTTGTCTTGTTCCACATCTCAAAGGGAAGG

CR1-like (NT_021877.16)CTTTCTTCTTTACCACAC-GGCTAGGACCACCAGTATAACATTGAACATTGGTAGCAATAGATGTCATCCTTGTCTTGTTCCACATCTCAAAGGGAAGG

CR1 (NT_021877.16)

CTTTCTTCTTTACCACGCTGGCTAGGACCACCAGTATAACATTGAACATTGGTAGCAATAGATGTCATCCTTGTCTTGTTCCACATCTCAAAGTTAAAG

1 9 6

2 0 0

2 1 0

2 1 2

2 1 4

2 2 0

2 3 0

2 4 0

2 5 0

2 6 0

2 7 0

2 8 0

2 8 9

2 9 0

2 9 3

2 9 4

3' C

on

served

Reg

ion

a

10

0 k

b3

00

kb

20

0 k

b

CR

1M

CP

LC

R1L

MC

P

Segm

entA

Segm

ent B

CR

2C

D34

40

0 k

b5

00

kb

Figu

re 1

- M

ultip

le b

indi

ng a

nd a

mpl

ifica

tion

by p

rim

er p

airs

CR

1MC

P11

&C

R1M

CP

12

CR1MCP6

CR1MCP5

CR

1MC

P11

&C

R1M

CP

12

CR

1MC

P5

& C

R1M

CP

6C

R1M

CP

5&

CR

1MC

P6

Page 102: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0 R 8 8 / 1 4 0 6 2

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0 C 0 4 / 5 4 9 K

0

5 0

1 0 0

1 5 0

2 0 0

2 5 0 R 8 6 / 1 2 2 9 3

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0 R 8 / 5 2 5 1 9

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0 U 9 8 / 4 2 3 9 K

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0 C 0 4 / 5 5 9 C

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0 C 0 4 / 15 6 F

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0 C 0 4 / 1 5 7 M

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0 C 0 4 / 5 6 7 G

0

5 0

1 0 0

1 5 0

2 0 0 R 9 2 / 2 2 8 4 3 W

0

2 0

4 0

6 0

8 0

1 0 0 C 9 7 / 10 3 9 V

0 1 20 39 58 77 96 115

134

153

172

191

210

229

248

267

286

305

324

343

362

381

400

419

438

457

476

495

514

533

552

571

404bp

191817161514131211109

331bp

87654321

242bp

Run Duration

Ban

d In

tens

ity

Figure 2. Genomic polymorphism within the CR1/MCP duplicons

R88/14062

C04/549K

R86/12293

R88/52519

U98/4239K

C04/559C

C04/156F

C04/157M

C04/567G

R92/22843W

C97/1039V

Page 103: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0

1 6 0C 0 4 / 15 6 F " L R "

Ban

dIn

tens

ity

C04/156F

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0C 0 4 / 15 6 F " L M "

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0C 0 4 / 15 6 F " K Y "

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0C 0 4 / 15 6 F " L P "

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0C 0 4 / 15 6 F " L Q "

0 1 16 31 46 61 76 91 106

121

136

151

166

181

196

211

226

241

256

271

286

301

316

331

346

361

376

391

406

421

436

451

466

481

496

511

526

541

PCR Annealingtemp = 56oC

PCR Annealingtemp = 56oC

PCR Annealingtemp = 59oC

PCR Annealingtemp = 56oC

PCR Annealingtemp = 58oC

Run Duration

Ban

dIn

tens

ity

Run Duration

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0C 0 4 / 15 7 M " L R "

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0C 0 4 / 1 5 7 M " K Y "

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0C 0 4 / 15 7 M " L P "

0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

1 4 0

1 6 0C 0 4 / 15 7 M " L Q "

0 1 16 31 46 61 76 91 106

121

136

151

166

181

196

211

226

241

256

271

286

301

316

331

346

361

376

391

406

421

436

451

466

481

496

511

526

541

PCR Annealingtemp = 58oC

PCR Annealingtemp = 56oC

PCR Annealingtemp = 56oC

PCR Annealingtemp = 56oC

C04/157M

Figure 3 – Reproducibility of the GMT profiles

Page 104: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

020406080100

120

140 0

20

40

60

80

100

120

140

I 1 C04/0162X

5I 1a C04/0163D

51617

II 1 C04/0175J

516

II 1a C04/0176Q

56

20

III 1 C04/0174C

516

20

III 2 C04/0159A

56

III 3 C04/0177X

56

Obs

erve

dS

egre

gati

on

242bp

331bp

Ded

uced

Gen

otyp

es

404bp

BandIntensity BandIntensity

489bp

501bp

5

6

1617

16

5

20

IIIIII

1a1

41

21a

3

1

IIIIII

1a1

41

21a

3

1

ab

= 5, 0; 5, 0

cd

= 5,16; 5,17

ac = 5, 0; 5,16

ef

= 5,20; 6, 0

ce

= 5,16; 5,20

af

= 5, 0; 6, 0

af

= 5, 0; 6, 0

4

I

1a

1

41

21a

3

1

IIIII

4

I

1a

1

41

21a

3

1

IIIII

020406080100

120

140 020406080100

120

140

I 1

C04/0157M

45

16

I 1a

C04/0156F

46

914

II 2

C04/0172P

414

II 3

C04/0164K

56

916

II 3a

C04/0220C

45

III 1

C04/0165R

56

9III 2

C04/0166Y

46

9

BandIntensity BandIntensity

Obs

erve

dSe

greg

atio

nD

educ

edG

enot

ypes

16

9

645

242bp

331bp

404bp

489bp

501bp

I

IIIII

1

12

3

12

33a

211a

2

I

IIIII

1

12

3

12

33a

211a

2

4

56

91416

I

IIIII

1a

1

12

33a

12

1aI

IIIII

1a

1

12

33a

12

1a1

a1

12

33a

12

1a

ab= 4, 0; 5,16

cd= 4,14; 6, 9

ac = 4, 0; 4,14

bd= 5,16; 6, 9

ef= 4, 0; 5, 0

fd= 5, 0; 6, 9

ed = 4, 0; 6, 9

Family1

Family2

Figure4-Segregationofancestralhaplotypes

Page 105: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

RSA-C RSA-P HCT PV ARL-C SLE-P SS Total

Clinical Status

0

10

20

2

4

AH

02

(%

Fre

qu

en

cy)

AH

08

(%

Fre

qu

en

cy)

6

Figure 5. CR1.02 and CR1.08 haplotype frequencies differ in different clinical groups.

8

Page 106: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

5' Conserved R

egio

nPoly

morphic

Geom

etric

Ele

ment

3' Conserved R

egio

n

2 5

2 6

2 9

3 1

4 1

5 2

8 1

9 0

9 7

9 8*

**

*

1 4 1

1 5 0

1 5 1

1 5 2

1 5 3

1 5 5

1 5 7

1 6 5

1 6 6

1 7 3

1 7 7

1 7 8

2 0 5

2 0 7

2 2 6

2 7 3

2 7 4

2 7 8

2 7 9

2 8 4

2 8 9

3 1 0

3 2 7

3 8 6

3 8 7

3 9 0

CR1-like

(N

T086601)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)7

C(

CC

TT

CC

CT

)6

CT

CC

TT

CT

AA

-T

GG

G

CR1L_09001

(D

Q007064)

CT

AA

GA

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)1

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-C

GG

G

CR1L_09501

(D

Q007065)

CT

AA

GA

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)2

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-C

GG

G

CR1L_09501

(D

Q007066)

CT

AA

GA

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)2

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-C

GG

G

CR1L_09501

(D

Q007067)

CT

AA

GA

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)2

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-C

GG

G

CR1L_10001

(D

Q007068)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)3

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-T

GG

G

CR1L_15001

(D

Q007069)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)6

C(

CC

TT

CC

CT

)6

CT

CC

TT

CT

AA

-T

GG

G

CR1L_13001

(D

Q007070)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)6

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-T

GG

G

CR1L_13001

(D

Q007071)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)6

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-T

GG

G

CR1L_17001

(D

Q007072)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)8

C(

CC

TT

CC

CT

)6

CT

CC

TT

CT

AA

-T

GG

G

CR1L_11001

(D

Q007073)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)6

C(

CC

TT

CC

CT

)4

CT

CC

TT

CT

AA

-T

GG

G

CR1L_14001

(D

Q007074)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)7

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-T

GG

G

CR1L_14001

(D

Q007075)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)7

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-T

GG

G

CR1L_14001

(D

Q007076)

CT

AA

GT

T-

--

-C

CT

TC

CC

TT

CT

G(

CC

TG

)2

CT

TG

(C

CT

T)2

CT

TT

(G

CT

T)2

(C

CT

T)2

-C

CT

CA

CT

(C

CT

C)2

C(

CC

TT

)7

C(

CC

TT

CC

CT

)5

CT

CC

TT

CT

AA

-T

GG

G

CR1_05001

(D

Q007054)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)5

C(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_05001

(D

Q007055)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)5

C(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_05001

(D

Q007056)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)5

Y(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_05001

(D

Q007057)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)5

Y(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_04001

(D

Q007058)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)4

C(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_04001

(D

Q007059)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)4

C(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_06001

(D

Q007060)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)4

C(

CC

TT

)7

C(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_07001

(D

Q007061)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)4

C(

CC

TT

)8

C(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_03001

(D

Q007062)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)3

C(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1_04002

(D

Q007063)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

T(

CC

TC

)4

C(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

CR1

(N

T_021877.1

6)

TG

GC

AT

TC

TT

T-

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

--

-C

--

--

TT

C(

CC

TC

)4

C(

CC

TT

)5

T(

CC

TT

CC

CT

)1

-T

--

--

--

AG

TT

TT

A

2 5

2 6

2 9

3 1

4 1

5 2

8 1

8 2

8 3

8 4

8 5

9 4

1 0 4

1 0 6

1 1 4

1 1 5

1 3 0

1 3 4

1 3 5

1 5 4

1 5 6

1 7 5

1 8 2

1 8 6

1 9 1

2 1 2

2 1 4

2 3 0

2 8 9

2 9 0

2 9 3

CR1-like D

efinin

g R

egio

nPoly

morphic

Ele

ments

Fig

ure

6.

Se

qu

en

cin

g r

eve

als

th

e c

om

ple

xity o

f th

e h

ap

losp

ec

ific

ele

me

nt

an

d d

iffe

ren

ce

s b

etw

ee

n CR1

an

d CR1-like

.

Page 107: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

§ No residues

a C x x P x x x x x A x x x x x x x x x x F P x G T x L x Y E C x P x Y x x x x F S I x C x x x x x W x x x x D x C 57

b C Q P P P x x L H x E x x x x x x x x F x x G x E V x Y x C x P x Y D L R G x x x x x C x P Q G D W x P x x P x C 57

Hosa_CR1_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T C E V K S

Hosa_CR1_12 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T C E V K S

Hosa_CR1_19 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T C E V K S

Patr_CR1_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S L R C T P Q G D W S P A T P T C E V K S

Patr_CR1_14 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S L R C T P Q G D W S P A A P T C E V K S

Paha_CR1_5 C Q P P P D V L H G E R T Q R D K D I F Q T G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R C E V K S

Paha_CR1_12 C Q P P P D V L H G E R T Q R D K D I F Q P G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R C E V K S

Hosa_CR1-like_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G S T Y L H C T P Q G D W S P A A P R C

Patr_CR1-like_5 C Q P P P D V L H G E R T Q R D K D N F S P G E E V Y Y S C E P G Y D L R G S T Y L H C T P Q G D W S P E A P R C

Hosa_CR1_26 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C A V K S

Patr_CR1_21 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C A V K S

Patr_CR1_28 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C T V K S

Paha_CR1_26 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P I C T V K S

Pacy_CR1-like_5 C Q P P P E I L H G E H T P S H Q D F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W N P E A P I C

Paha_CR1_19 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C A V K S

Hosa_CR1_33 C Q P P P E I L H G E H T L S H Q D N F S P G Q E V F Y S C E P S Y D L R G A A S L H C T P Q G D W S P E A P R C T V K S

Patr CR1-like 10 C Q P P P E I L H G E H T L S H Q D N F S P G Q D V F Y S C E P G Y D L R G A A S L H C T P Q G D W T P E A P R C

Hosa_CR1-like_10 C Q P P P E I L H G E H T L S H Q D N F L P G Q E V F Y S C E P S Y D L R G A A S L H C M P Q G D W T P E A P R C

c C P x P P K I Q N G H x I G G H V S L Y L P G M T I x Y I C D P G Y L L V G K G x I F C T D Q G I W S Q L D H Y C 57

Hosa_CR1_36 C P D P P K I Q N G H Y I G G H V S L Y L P G M T I S Y I C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y C K E V N

Patr_CR1_31 C P H P P K I Q N G H D I G G H V S L Y L P G M T I S Y I C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y C K E V N

Paha_CR1_29 C P H P P K I Q N G H Y I G G H V S L Y L P G M T I G Y I C D P G Y L L V G K G I I F C T D Q G I W S Q L D H Y C K E V N

d C P x P P x I x N G R H x G x x x x x x P x G K x x x Y x C D P H x D R G x x x x L I G E S x I R x T S x x x G N G V W S S x A P R C 67

Patr_CR1_8 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G H

Patr_CR1_16 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G H

Patr_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G H

Hosa_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G H

Hosa_CR1_21 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G H

Hosa_CR1_14 C P S P P V I P N G R H T G K P L E V F P F G K T V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G H

Paha_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G H

Paha_CR1_14 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S R A P R C G I L G H

Hosa_CR1_28 C P N P P A I L N G R H T G T P S G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R C E L S V R A G H

Patr_CR1_23 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R C E L P V H A G H

Hosa_CR1_35 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D R Q G N G V W S S P A P R C E L S V P A A

Patr_CR1_30 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D P Q G N G V W S S P A P R C E L S V P A A

Hosa CR1-like 12 C P N P P A I L N G R H T G T P P G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R R T S E P H G N G V W S S P A P R C

Patr CR1-like 12 C P N P P A I L N G R H T G T P L G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R C

Paha_CR1_21 C P N P P A I L N G R H T G A L L G D I P Y G K E I S Y T C D P H R D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R C E L S V R A G H

Paha_CR1_28 C P N P P A I L N G R H T G T P L G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R C E L S V P A A

Pacy_CR1-like_7 C P N P P A I L N G R H I G A P L G D I P Y G K E V S Y I C D P H P D R G M T V N L I G E S T I R C T S D P Q G N G V W S S P A P R C

e C x x P P x I x N G D F x S x x R x x F x x x x V V T Y x C x x x x x x x x x F x L V G E x S x x C T S x x x x x G x W S x P x P x C 67

Hosa_CR1_10 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N K

Hosa_CR1_17 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N K

Hosa_CR1_3 C G L P P T I T N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N K

Patr_CR1_3 C G L P P T I T N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N K

Hosa_CR1_24 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N K

Patr_CR1_12 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N K

Patr_CR1_19 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W A P Q C I I P N K

Paha_CR1_10 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q C I I P N K

Pacy_CR1-like_3 C G L P P T I A N G D F I S T S R E Y F P Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q C

Paha_CR1_17 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q C

Hosa_CR1-like_3 C G L P P T I A N G D F T S I S R E Y F H Y G S V V T Y H C N L G S R G K K V F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q C

Patr_CR1-like_3 C G L P P T I A N G D F T S I S R E Y F H Y A S V V T Y H C N L G S G G K K V F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q C

Paha_CR1_3 C G L P P T I D N G D F F S A N K E Y F H Y G S V V T Y R C N L G S G G R K L F E L V G E P S I Y C T S N E D Q V G I W S G P A P Q C I I P N K

Hosa_CR1-like_8 C G L P P N I T N G Y F I S T D R E Y F H Y G S V V T Y H C N L G S R G R K V F E L V G E P S I Y C T S K D D Q V G V W S G P V P Q C

Patr_CR1-like_8 C G L P P N I T N G Y F I S T D R E Y F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S K G D Q V G V W Q C

Rano_Crry_3 C E I P P S I P N G D F F S P N R E D F H Y G M V V T Y Q C N T D A R G K K L F N L V G E P S I H C T S I D G Q V G V W S G P P P Q C I E L N K

Mumu_Crry_3 C E I P P G I P N G D F F S S T R E D F H Y G M V V T Y R C N T D A R G K A L F N L V G E P S L Y C T S N D G E I G V W S G P P P Q C I E L N K

Hosa_CR1_31 C E P P P T I S N G D F Y S N N R T S F H N G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R C I S T N K

Patr_CR1_26 C E P P P T I S N G D F Y S N N R A S F H N G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R C I S T N K

Paha_CR1_24 C K P P P T I S N G D F Y S N N R T S F H S G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G A W S S P P P R C I S T N K

f C x x P x V x x x x x x x x N x S x F S L x x x V x F R C x x G F x M x G x x x V x C x x x x x W x P x L P x C 56

Hosa_CR1_32 C T A P E V E N A I R V P G N R S F F S L T E I V R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H C S R V

Patr_CR1_27 C T A P E V E N A I R V P G N R S F F S L T E I V R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H C S R V

Paha_CR1_25 C T A P E V K N G I R V P G N R S F F S L N E I V R F R C Q P G F V M V G S H T V Q C Q T N N R W G P K L P H C S R V

Rano_Crry_4 C T P P H V E N A V I V S K N K S L F S L R D M V E F R C Q D G F M M K G D S S V Y C R S L N R W E P Q L P S C F K V K S

Mumu_Crry_4 C T P P P Y V E N A V M L S E N R S L F S L R D I V E F R C H P G F I M K G A S S V H C Q S L N K W E P E L P S C F K G V I

Hosa_CR1_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R V

Hosa_CR1_11 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R V

Hosa_CR1_18 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R V

Hosa_CR1_25 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R V

Patr_CR1_13 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F A M K G P R R V K C Q A L N K W E P E L P S C S R V

Patr_CR1_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P P R V K C Q A L N K W E P E L P S C S R V

Patr_CR1_20 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P H R V K C Q A L N K W E P E L P S C S R V

Patr_CR1-like_9 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C L P G F V M K R P P P R V Q C Q A L N K W E T E L P S C

Hosa_CR1-like_9 C T P P N V E G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P H R V Q C Q A L N K W E T E L P S C

Pacy_CR1-like_4 C M P P N V E N G V L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R H V Q C Q A L N K W E P E L P S C

Patr_CR1-like_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R H V H C Q A L N K W E P E L P S C

Paha_CR1_11 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S C S R V

Paha_CR1_18 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S C S R V

Paha_CR1_4 C T P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S C S R V

Hosa_CR1-like_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F G M K G P S H V K C Q A L N K W E P E L P S C

g-like x x x P H I x N G F R I x x x x P x x F x x x x x x x x x x x x x x x x x x x x x x x x x x

h C x x P x x M x G x x K x L x M K K x Y x Y G x x V x L x C E D G Y x L E G S x x S Q C Q x D x x W x P x L x x C 57

j C x x P x x P x N G x V H x x x x x x x G S x x x Y x C x x G x R L x G x x x x x C x x x x x x x x W x x x x P x C 58

Pacy_CR1-like_2 C R N P K D P V N G M V H V I K D I Q F G S Q I N Y S C N K G Y R L I G S S S A T C I I S G N T V I W D N E T P I C

^ Paha_CR1_2 C R N P R D P V N G M V H V I K D I Q F G S Q I N Y S C T E G H R L I G S S S A T C I I S G N T V I W D N E T P I C E K I S

Patr_CR1-like_2 C R N P P D P V N G M V H V I K D I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G N T V I W D N K T P V C

Hosa_CR1-like_2 C R N P P D P V N G M A H V I K D I Q F G S Q I K Y S C P K G Y R L I G S S S A T C I I S G N T V I W D N K T P V C

Patr_CR1_2 C R N P P D P V N G M V H V I K D I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I C D R I P

Hosa_CR1_2 C R N P P D P V N G M V H V I K G I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I C D R I P

Rano_Crry_2 C E T P L D P Q N G I V H V N T D I R F G S S I T Y T C N E G Y R L I G S S S A M C I I S D Q S V A W D A E A P I C E S I P

Mumu_Crry_2 C K T P S D P E N G L V H V H T G I Q F G S R I N Y T C N Q G Y R L I G S S S A V C V I T D Q S V D W D T E A P I C E W I P

Hosa_CR1_9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I C Q R I P

Hosa_CR1_16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I C Q R I P

Patr_CR1_11 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I C Q R I P

Patr_CR1_18 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N S A H W S T K P P I C Q R I P

Hosa_CR1_23 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N T A H W S T K P P I C Q R I P

Paha_CR1_16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I I S G N T A H W S T K P P I C Q R I P

Paha_CR1_9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C V T S G N T A H W S T K P P I C Q R I P

Hosa_CR1-like_7 C E T P P V P V N G M V H V I T D I H V G S R I N Y S C T T G H R L I G H S S A E C I L S G N T A H W S M K P P I C

Patr_CR1-like_7 C E T P P V P V N G M V H V I T D I H V G S R I N Y S C I T G H R L I G H S S A E C I L S G N T A H W S M K P P I C

Hosa_CR1_30 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I C E I I S

Patr_CR1_25 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I C E I I S

Paha_CR1_23 C G T P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K E A P I C E I I S

k C x x x x G x L x x G x V x x P x x L Q L G A K V x F V C x x G x x L K G x x x S x C V L x G x x x x W N x S V P V C 59

Figure 7. Polymorphism within SCR submfamilies

A3650G

H1208R

G5575C

D1850H

A4041C

SILENTC5507G

P1827R

A4828G

R1601G

T4855A

S1610T

A4870G

I1615V

G3093T

Q981H

C5654T

T1876I

T455C

V115A

A1360G

T445A

T2078C

I684T

A4795G

K1590E

Page 108: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 115 -

Page 109: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 145 -

PRELUDE TO CHAPTER 5

GMT APPLICATIONS IN THE CANINE

The following chapter extends the application of the GMT to the canine genome by

utilising the duplication of DQ alpha and beta genes in the Dog Leukocyte Antigen

(DLA).

To do this, I used a similar approach as that described in chapter 4, by firstly identifying

the duplication, examining the duplicates for polymorphic loci and designing primers

that would amplify within each duplicon.

Initial results revealed extensive polymorphism within a family of 15 Blue Heelers.

Segregation of the profiles was faithful and in line with the results obtained through

conventional typing methods. As we would have predicted, the GMT results are in fact

more discriminatory than the current SBT and microsatellite methods used.

The discrimination by the GMT, in addition to its reduced cost, makes it a more feasible

alternative than those methods currently employed to determine MHC alleles and

parentage.

I am grateful to (i) Professor Roger Dawkins for his ideas, interpretations, comments

and assistance in the compilation of this study (ii) Dr Peter Kesners for the compilation

of data and figures, (iii) Ms Susan Lester for the DLA DQ and DR SBT of the Blue

Heeler family (iv) Dr Claire Amadou for annotation of the DLA class II genes and the

MHC class II analyses, (v) Dr Brent Stewart for the collection of specimens, (vi) Dr

Page 110: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 146 -

Dean Male and Genetic Technologies Ltd for the Microsattelite typing of the Blue

Heeler Family and for providing other canine sspecimens, (vii) Dr Joseph Williamson

for his comments and suggestion to the text. Mr Ryan Southall and Mr Oscar Kalai for

for the laboratory testing of samples.

Page 111: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 147 -

CHAPTER 5

HAPLOTYPING OF THE CANINE MHC WITHOUT THE NEED

FOR DLA TYPING

The work described in this chapter is in preparation for publication:

McLure, C., P. Kesners, S. Lester, D.Male, C. Amadou, J. Dawkins, B. Stewart, J.

Williamson and R. Dawkins. (In Press). "Haplotyping of the Canine MHC without the

need for DLA typing". Intl J. Immunogenetics

Page 112: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 148 -

Page 113: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 149 -

Haplotyping of the Canine MHC without the need for DLA

typing

Craig A McLure1,2, Peter W Kesners1,2, Susan Lester2,3, Dean Male4, Claire

Amadou2, John R Dawkins2, Brent J Stewart1,2, Joseph F Williamson1,2 & Roger L

Dawkins1,2,6

1Faculty of Medicine and Dentistry, University of Western Australia, Nedlands, Western

Australia 6907

2C.Y. O’Connor ERADE Village, PO Box 5100, Canning Vale, Western Australia 6155

3Arthritis Research Laboratory, Hanson Institute, Frome Road Adelaide, South Australia 5000

4 Genetic Technologies Ltd, 66 Hanover St, Fitzroy, Victoria 3065

Manuscript number 0504 of the C Y O’Connor ERADE Village.

6Correspondence should be addressed to RLD:

Professor Roger Dawkins

CY O’Connor ERADE Village

PO Box 5100

Canning Vale South

Western Australia 6155

Facsimile: +618 9397 1559

Email: [email protected]

Page 114: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 150 -

Abstract

The Genomic Matching Technique has proven useful in MHC haplotyping in man. We

have adopted a similar approach in Australian Cattle Dogs and report that genotyping

can be achieved with a single assay.

Keywords

Genotyping, Haplotypes, MHC, Dog.

Introduction

Recently, there has been widespread demand for genotyping of dogs. The purposes can

be classified into three major categories:

1. PROOF OF IDENTITY AND PARENTAGE

Identification through DNA profiling has become necessary for purebred dogs to be

registered with the American Kennel Clubs (DeNise et al. 2004) and is already

established as the gold standard in other species. Electronic methods such as

microchips do not, in themselves, provide identity, parentage or genetic background.

Microchip identification has been subject to abuse.

2. BREED CHARACTERISATION AND EVOLUTION

Many dog breeds are being characterised with the intention of defining DNA based

criteria. There is increasing interest in the evolution of breeds and in addressing the

paradox whereby genetically distinct breeds have evolved very recently from a

Page 115: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 151 -

common ancestral pool.(Hedrick et al. 2003; Kennedy et al. 2002a) As in other

species, the first target is to be able to confirm or deny outcrossing but the ultimate

hope is to identify the genes which are responsible for the characteristics of each

breed whilst allowing selection for desirable traits.

3. ELIMINATION OF DISEASE AND IDENTIFICATION OF GENES IN

CANINE MODELS OF HUMAN DISEASE

Although, as expected with inbreeding, there are many recessive monogenic

diseases, it is now appreciated that there are many others which are polygenic and

therefore difficult to address using existing methods. It is known that dogs have

similar genomic architecture to man and develop many diseases which closely

resemble their human counterparts suggesting that the genetics and the

pathophysiology may be similar.(Debenham et al. 2005; Ollier et al. 2001; Wagner

2003) In some instances, such as myasthenia gravis, the diseases appear to be the

same (Garlepp et al. 1984). Identification of disease susceptibility genes in either

species would benefit both.

Existing methods have shortcomings with respect to one or more of these purposes.

1. DNA PROFILING

Although microsatellites have proven useful in terms of identity checking,

evolutionary studies and disease linkage, they are of limited value in distinguishing

between members of a highly inbreed group. Further, some of the selected

microsatellites are relatively uninformative whether due to instability or limited

Page 116: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 152 -

polymorphism. Improvements can be obtained by testing more sites and by

selecting more stable and more complex sequences (Williamson et al. 2003)but

advantages in terms of identification come at cost in terms of breed characterisation

and other applications. Even with high volumes and multiplexing, the capital cost

can be high.

2. MONOGENIC TESTING

Although useful in relation to the detection of known mutations, such as those

responsible for blindness, (Acland et al. 1998; Kijas et al. 2003) these tests have

limited utility in other respects. As the number of mutation-specific tests increases,

so the battery of tests required becomes impractical.

3. GENOTYPING OF POLYMORPHIC LOCI

Molecular typing of the DR and DQ genes is now well established (Kennedy et al.

2002b) and serves as an example of the power of typing polymorphic loci. Based

on the experience in man, this approach will be useful in all respects, including

disease association, simply because of the extraordinary degree of stable

polymorphism. (Sayer et al. 2001) However, the cost of sequence based typing is

significant and, as other loci are added, the problem compounds.

With the above purposes and shortcomings in mind, we have evaluated an approach,

which has proven useful in other species.(Gaudieri et al. 2001; Martinez et al. 1994) At

issue was whether it would be possible to apply a single cost effective approach to

contribute to all major issues requiring solutions.

Page 117: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 153 -

The principle of the assay is to target regions of the genome with maximal information

content. We have found that the genome is actually quite uneven in the distribution of

critical polymorphic regions. Polymorphic frozen blocks are rich in nucleotide diversity,

indels, duplications and disease genes and can be located using appropriate

bioinformatic tools(Dawkins et al. 1999).

Methods

The 3 generation family

The test panel consisted of 21 Blue heelers (Australian Cattle Dogs) including, 1

grandmother designated “cd”, 2 parents designated “ac” and “ef” and 15 puppies from 2

matings designated “ae”, “af”, “ce” or “cf”.

DLA Sequence Based Typing (SBT)

DLA DRB1, DQA1, DQB1 typing was performed by sequence based typing using 14th

IHWC DLA Workshop protocols. Allele nomenclature and sequence alignments.

(http://www.ebi.ac.uk/ipd/mhc/dla/align/html)

Microsatellite Profiling

Microsatellite profiling was undertaken using sequences as described by Francisco

(Francisco et al. 1996) and Halverson (Halverson et al. 1995)

GMT Approach

The GENOMIC MATCHING TECHNIQUE is based on generating haplotype markers

with a single primer pair which amplifies duplicated sites (Figure 1). A single test

identifies maternal and paternal haplotypes of sequences of up to several hundred

Page 118: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 154 -

kilobases. Within this sequence are multiple linked polymorphisms, both coding and

non coding, indels and duplications. Thus, differences in copy number and regulation

can be detected and, in this way, there is more information than with the alternative

tests. Importantly, the cost is less rather than more. The DLA GMT method was

designed using the following steps:

1. Identification of duplicons.

The genomic region containing DLA DQA1 and DQB1 , was taken from the

NCBI database (http://www.ncbi.nlm.nih.gov/) (positions DQA1, GeneID:

474861, pos 133484 to 168500 and DQB1, GeneID: 474862 pos 185720C to

222182 on contig NW_139872.1 (gi: 54126008). This sequence was compared

against itself using Dotter (Sonnhammer and Durbin 1995) to identify evidence

of duplication.

2. Selection of primer sites present in all duplicons.

Segment A, containing DQA1 was compared to Segment B, containing DQB1.

Regions within these two segments which shared a complex geometric element

were identified as targets. The geometric element must vary in size between the

duplicates but also contain enough homology either side of the element so as to

enable the design of primers that will bind and amplify within each segment.

The resulting mix of products has the potential to define extensive haplotypes.

Duplicons at position 143791-144189 (DQA1) and 210163-210376 (DQB1) of

NW_139872.1 were aligned using Clustalw (http://www.es.embnet.org/cgi-

bin/clustalw.cgi). Using Primer3 (http://frodo.wi.mit.edu/cgi-

bin/primer3/primer3_www.cgi), primers were designed so that a single primer

Page 119: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 155 -

pair will bind and amplify both duplicates or even more if, as expected, there are

more than two duplicated segments on some haplotypes.

Primer sequences were compared to the NCBI databases using BLASTN

(http://www.ncbi.nlm.nih.gov/BLAST/) at low stringency. Sequence identities

which matched the primers in both the forward and reverse directions were

identified. The only significant matches for primers in question were in close

proximity and it could therefore be assumed the primer pair would amplify

within a PFB. The intention is to amplify as many duplicated sites as possible

so long as there is no amplification of unlinked sequences. Linkage of amplified

products is tested by segregation through a three generation nuclear family. If

the primers are valid, segregation through generations should be apparent.

3. Amplification and analysis of DLA DQ alpha and DLA DQ beta

Haplospecific Geometric Elements (HGEs)

Genomic DNA was prepared using the standard salting-out method.

Primer Sequences:

Cafa class II

F1 5` AAA TAG CCT GGC TTT CTT TAC A 3` and

R1 5’ CAA CCA ACA ATT TCT GGG ACA 3`

F2 5` TCT GGC ATA TAG AAA GCA CTC TG 3` and

R2 5` CCT CCC TTC CCA CCT AAT GT 3`

F3 5` ATT TGC TCA AGG TCC ATC CA 3` and

R3 5` TAT GGA AAC CAC CAA AGT GC 3`

Page 120: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 156 -

Polymerase Chain Reaction:

PCR reactions were performed in a 96-well Palm Cycler (Corbett Research) in

20µl volumes using 100 ng of template DNA, 1.3 U Taq Polymerase (Fisher

Biotec), 10 pmol of the forward and reverse Cafa class II primers, 200µM of

each dNTP, 2 mM MgCl2 and 1X PCR buffer (Fisher Biotec). The samples were

denatured at 940C for 5 min, followed by 30 cycles each comprising 30 seconds

at 940C, 45 seconds at 590C and 45 seconds at 720C. The last cycle was followed

by an additional extension for 5 minutes at 720C.

Detection of amplicons and haplotypes:

The separation and detection of the allelic variants of DQ alpha and DQ beta

was done with the Corbett Research GS-3000 automated gel analysis system.

One microlitre of PCR product was mixed with 1 µl of loading buffer containing

pUC19 molecular weight ladder. One microlitre of the PCR sample and loading

buffer mixture was then added to a 32 cm long, 48 well, 4% polyacrylamide,

ultra-thin gel and pulsed for 10 seconds. Excess sample was then flushed and the

gel was run at 2000 V for 180 minutes.

Gel analysis and profile generation.

The gel image was analysed using BioRad Quantity One gel analysis software.

Lanes were defined, amplicons detected and standards assigned. Densimetric

Page 121: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 157 -

profiles were generated and lanes were aligned using the internal pUC19/Hpa II

(Fisher Biotec) standards.

Results

In the first instance we chose to compare the GMT of the delta block of the MHC with

molecular typing of DR and DQ. The results are shown in Figure 2.

Individuals are indicated by name and generation number. Thus Caddy’s daughter,

Petal, was mated twice with Plugger and had 7 and then 8 puppies. By convention I 1a

is cd, and II 1a is ef. After amplification with primer pair 1+2, each family member

gives 2 polymorphic products and these segregate as the maternal and paternal

haplotypes. Each animal can be assigned a genotype such as cf in III 1 and III 3 and ae

in III 2 and III 6.

When the family members were typed by SBT at the DR and DQ loci, the conclusions

were the same with the exception that the GMT results avoided ambiguities in allele

calling due to heterozygosity. Surprisingly, given the experience in humans, we are

able to correlate GMT products to alleles as well as haplotypes. For example, at least

within this family, we obtain surrogate DQB1 typing without the need for SBT. For

example P2, 4 and 5 mark DQB1 00301, 00501 and 02002 respectively. P9 and 8 split

DQB 00101 suggesting that GMT is not only simpler but also more informative.

When additional siblings were tested the conclusions were the same. When unrelated

Blue heelers were compared, additional products were seen but these also correlated

with DQB1 typing. Thus Ned had P5 and DQB1 2002, as expected, whereas the P3

Page 122: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 158 -

product marks DQB1 02301 and P7, as with P8 and 9, marks DQB1 00101. Kinky

appeared to be homozygous by molecular typing but with GMT there was a second

higher molecular weight product which is yet to be characterized.

When the results were considered in terms of haplotypes, a and f are identical at

DRB1(00201), DQA1(00901) and DQB1(00101) but GMT indicates that there is a

specific and consistent difference at the sequence level. Note that a and f were present

in 2 generations and occurred separately or together in 15 subjects and can therefore be

assumed to be stable ancestral haplotypes. Further breeds will be examined to

determine the distribution of such haplotypes.

When microsatellite DNA profiling was compared with GMT, the conclusions were the

same in terms of parentage but more sites had to amplified and, as yet, there is no way

of haplotyping the MHC.

Conclusions

We conclude that GMT is a cost-effective alternative to SBT and microsatellite

profiling at least within the MHC. One simple assay produced unambiguous

genotyping of a three generation family. Each haplotype gave a unique product.

Surprisingly, these products also identified the DQB1 alleles unequivocally suggesting

that the polymorphic elements are remarkably specific at least within the Blue heeler

breed. Furthermore, GMT is more polymorphic than DQB1 in that the DRB1 00201,

DQA1 0901, DQB1 00101 haplotype by molecular typing actually splits into 2 ancestral

haplotypes. For the purpose of associating functions or diseases with MHC haplotypes,

GMT appears preferable.

Page 123: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 159 -

The present study also demonstrates proof of principle with respect to the strategy of

identifying PFB and then duplicons which can be used to expand polymorphism over

those which could be demonstrated with a single amplicon. Undoubtedly, this

expansion is due to the differences in length and sequence content of the element,

differences in priming efficiency and potential interactions between the products.

It is now appropriate to apply the GMT strategy to the genotyping of other genomic

regions known to be highly polymorphic. The next region to be evaluated will be class

I within the MHC so that megabase (rather than kilobase) haplotypes can be identified

as in man. We will then address the RCA region which has already been haplotyped

successfully in man (McLure et al. Submitted). Similarly, we will use other gene

clusters known to encode disease susceptibility. In this way, a single GMT assay will

be able to contribute to the identification of individuals, breeds and diseases.

The same approach can be used to discover other polymorphic genomic regions. We

use bioinformatic tools to identify duplications and then PFB throughout the genome.

We expect that most of the important polymorphism will be contained within perhaps

10 regions each consisting of 2 or 3 blocks of several hundred kilobases. Thus as little

as 1% of the genome will provide a relatively high yield for all of our purposes.

In parallel with targeting blocks, we will take account of existing information on the

likely locations of disease genes with the intention of developing GMT assays which

will assist in the evaluation of monogenic as well as polygenic diseases.

Page 124: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 160 -

References

Acland, G. M., Ray, K., Mellersh, C. S., Gu, W., Langston, A. A., Rine, J., Ostrander,

E. A., and Aguirre, G. D.: Linkage analysis and comparative mapping of canine

progressive rod-cone degeneration (prcd) establishes potential locus homology

with retinitis pigmentosa (RP17) in humans. Proc Natl Acad Sci U S A 95:

3048-53, 1998

Dawkins, R., Leelayuwat, C., Gaudieri, S., Tay, G., Hui, J., Cattley, S., Martinez, P.,

and Kulski, J.: Genomics of the major histocompatibility complex: haplotypes,

duplication, retroviruses and disease. Immunological Reviews 167: 275-304,

1999

Debenham, S. L., Hart, E. A., Ashurst, J. L., Howe, K. L., Quail, M. A., Ollier, W. E.,

and Binns, M. M.: Genomic sequence of the class II region of the canine MHC:

comparison with the MHC of other mammalian species. Genomics 85: 48-59,

2005

DeNise, S., Johnston, E., Halverson, J., Marshall, K., Rosenfeld, D., McKenna, S.,

Sharp, T., and Edwards, J.: Power of exclusion for parentage verification and

probability of match for identity in American Kennel Club breeds using 17

canine microsatellite markers. Anim Genet 35: 14-7, 2004

Francisco, L. V., Langston, A. A., Mellersh, C. S., Neal, C. L., and Ostrander, E. A.: A

class of highly polymorphic tetranucleotide repeats for canine genetic mapping.

Mamm Genome 7: 359-62, 1996

Garlepp, M. J., Kay, P. H., Farrow, B. R., and Dawkins, R. L.: Autoimmunity in

spontaneous myasthenia gravis in dogs. Clinical Immunology and

Immunopathology 31: 301 - 306, 1984

Page 125: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 161 -

Gaudieri, S., Longman-Jacobsen, N., Tay, G. K., and Dawkins, R. L.: Sequence

Analysis of the MHC Class I Region Reveals the Basis of the Genomic

Matching Technique. Human Immunology 62: 279-285, 2001

Halverson, J., Dvorak, J., and Stevenson, T.: Microsatellite sequences for canine

genotyping., USA, 1995

Hedrick, P. W., Lee, R. N., and Buchanan, C.: Canine parvovirus enteritis, canine

distemper, and major histocompatibility complex genetic variation in Mexican

wolves. J Wildl Dis 39: 909-13, 2003

Kennedy, L. J., Barnes, A., Happ, G. M., Quinnell, R. J., Bennett, D., Angles, J. M.,

Day, M. J., Carmichael, N., Innes, J. F., Isherwood, D., Carter, S. D., Thomson,

W., and Ollier, W. E.: Extensive interbreed, but minimal intrabreed, variation of

DLA class II alleles and haplotypes in dogs. Tissue Antigens 59: 294-304, 2002a

Kennedy, L. J., Barnes, A., Happ, G. M., Quinnell, R. J., Courtenay, O., Carter, S. D.,

Ollier, W. E., and Thomson, W.: Evidence for extensive DLA polymorphism in

different dog populations. Tissue Antigens 60: 43-52, 2002b

Kijas, J. W., Miller, B. J., Pearce-Kelling, S. E., Aguirre, G. D., and Acland, G. M.:

Canine models of ocular disease: outcross breedings define a dominant disorder

present in the English mastiff and bull mastiff dog breeds. J Hered 94: 27-30,

2003

Martinez, O. P., Witt, C. S., Tay, G., Christiansen, F. T., and Dawkins, R. L.:

Immunogenetic analysis of successful and rejected bone marrow grafts within

one family. European Journal of Immunogenetics 21: 365-372, 1994

McLure, C., Williamson, J., Smyth, L., Agrawal, S., Lester, S., Millman, J., Keating, P.,

Stewart, B., and Dawkins, R.: Extensive genomic and functional polymorphism

of Complement Control Proteins. Submitted

Page 126: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 162 -

Ollier, W. E., Kennedy, L. J., Thomson, W., Barnes, A. N., Bell, S. C., Bennett, D.,

Angles, J. M., Innes, J. F., and Carter, S. D.: Dog MHC alleles containing the

human RA shared epitope confer susceptibility to canine rheumatoid arthritis.

Immunogenetics 53: 669-73, 2001

Sayer, D., Whidborne, R., Brestovac, B., Trimboli, F., Witt, C., and Christiansen, F.:

HLA-DRB1 DNA sequencing based typing: an approach suitable for high

throughput typing including unrelated bone marrow registry donors. Tissue

Antigens 57: 46-54, 2001

Sonnhammer, E. and Durbin, R.: A dot-matrix program with dynamic threshold control

suited for genomic DNA and protein sequence analysis. Gene 167: GC1-10,

1995

Wagner, J. L.: Molecular organization of the canine major histocompatibility complex.

J Hered 94: 23-6, 2003

Williamson, J. F., Gaudieri, S., Longman-Jacobsen, N., and Dawkins, R. L.: The use

and abuse of microsatellites markers. In J. A. Hanson and B. Dupont (eds.),

HLA 2002 Immunobiology of the Human MHC, Seattle. IHWG Press, in press.

Page 127: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 163 -

Figure legends

Figure 1. Differences between two ancestral haplotypes in the Human MHC.

Primers amplify polymorphic and duplicated sequences resulting in block specific PCR

profiles.

The polymorphic element varies in sequence structure and length. Different haplotyes

carry different numbers of copies of the polymorphic sequences.

Figure 2. MHC Genotyping Report.

Genotype analysis of 21 Blue Heeler dogs using GMT and a comparison to DLA alleles

assigned by conventional SBT. Figure 1a reports the results of GMT typing and DLA

DR, DQA and DQB of the 3-generation family. 3 unrelated individuals have also been

included. 1b shows the family tree of the grand parents, parents and offspring that have

resulted from two separate matings. 1c is a summary of the deduced DLA haplotypes.

Three unrelated Blue Heelers (Ned, Theo, Kinky) are included.

COMMENT

1. The Cafa_classII 1&2 primers amplify within DLA DQ alpha 1 and DQ beta 1

(NW_139872, positions 143791-144189 and 210163-210376 respectively).

Page 128: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 164 -

Page 129: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure
Page 130: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure
Page 131: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 167 -

PRELUDE TO CHAPTER 6

GMT APPLICATIONS IN LIVESTOCK

The following chapter describes an area of interest I have had since the commencement

of my studies. The exciting results obtained through the study of the RCA genes have

resulted in limited time to devote to the livestock projects. I have however compiled a

DNA collection of over 250 specimens. These include several complete four generation

families from Simmentals, Dexters, Angus, Friesian and assorted crosses. All the

specimens collected have detailed phenotypic data, including whether they are

horned/polled, their temperament, coat colour and pedigree. This information has been

compiled in a database that was created specifically for this project by Dr Brent Stewart.

Screen shots of this software are included in the following manuscript along with a

detailed family tree of all Professor Dawkins cattle, from the last 20 years.

I have used the polling project as an example of the application of the GMT in

livestock. I have however, preliminary data for several other projects, including mastitis

and fine wool in sheep.

I am grateful to (i) Professor Roger Dawkins for his instruction and guidance in

working with cattle, collecting specimens and in the compilation of this manuscript, (ii)

Mr John Dawkins for assistance in the collection of specimens, (iii) Dr Brent Stewart

for the may hours spent creating “Beef Breeder” a database capable of handling the

large volume of phenotypic, specimen and genetic data that has been generated to date,

(iv) Mr Ryan Southall for assistance with DNA extractions and Dr Yang Da from the

Page 132: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 168 -

University of Minnesota for kindly producing the complex family tree with his pedigree

software “Pedigraph”.

Page 133: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 169 -

CHAPTER 6

IDENTIFICATION OF TRAITS AND FUNCTION IN LIVESTOCK

BY GENOMIC MATCHING: GENETIC DETERMINATION OF

HOMOZYGOUS AND HETEROZYGOUS POLLED CATTLE

The work described in this chapter is part of an Australian Research Council

(ARC) funded project and is in preparation for submission:

McLure, C., J. Dawkins, B. Stewart, and R. Dawkins. (In preparation). " Identification

of traits and function in livestock by genomic matching: Genetically determining

homozygous and heterozygous polled cattle".

Page 134: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 170 -

Page 135: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 171 -

Identification of traits and function in livestock by genomic

matching: Genetically determining homozygous and

heterozygous polled cattle.

Craig A McLure1,2, John R Dawkins2, Brent J Stewart1,2 & Roger L Dawkins1,2,3

1Faculty of Medicine and Dentistry, University of Western Australia, Nedlands, Western

Australia 6907

2C.Y. O’Connor ERADE Village, PO Box 5100, Canning Vale, Western Australia 6155

Manuscript number 0000 of the C Y O’Connor ERADE Village.

3Correspondence should be addressed to RLD:

Professor Roger Dawkins

CY O’Connor ERADE Village

PO Box 5100

Canning Vale South

Western Australia 6155

Facsimile: +618 9397 1559

Email: [email protected]

Page 136: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 172 -

Abstract

A major aim is to apply our in-house patented DNA typing procedure (Genomic

Matching Technique, GMT) to commercially important problems in the livestock

industry. We (Gaudieri et al. 2001; Tay et al. 1995) and others (Witt et al. 2000) have

shown that the GMT method is the most cost effective way to identify donors for human

bone marrow transplantation. It does this because a single test can distinguish between

extensive sequences (up to 1 million bases), which contain the relevant genes. We refer

to each specific polymorphic sequence with its component alleles as a haplotype.

Introduction

One advantage of the method is that it is possible to type for traits and functions

encoded within a haplotype before the nature and function of all component genes are

known. Another advantage is that such explanatory studies become possible after the

relevant haplotypes are defined. However, it is worth emphasising that after 10 years

and huge effort in many laboratories, we still do not know precisely how the benefit in

transplantation is mediated and for this reason we prefer the GMT approach to more

classic alternatives which assume that functions can be deduced from sequence and

some simple in vitro reductionist experiments. Undoubtedly, there is far greater

complexity than some admit. Identifying the whole haplotype sequence is a critical first

step.

The GMT method was developed when we realised that functionally important regions

of the genome are characterised by historic duplications of stretches of DNA sequence.

Thus a clinically or commercially significant haplotype is a complex sequence with

many imperfect but specific iterations. To exploit this feature, we used the Polymerase

Chain Reaction (PCR) to amplify successive duplicons collectively. Because of the

Page 137: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 173 -

haplospecific differences are effectively multiplied, the readout is a robust signature of

the haplotype. Because only a single test is required, the cost is a fraction of the

alternatives. GMT is now the gold standard for bone marrow matching in South East

Asia.

More recently we have demonstrated the applicability of the GMT outside the human

MHC (McLure et al. Submitted-a; McLure et al. Submitted-b). This suggests the GMT

will be just as informative in identifying genomic polymorphism and AHs in the bovine

genome. We have selected polling in cattle as a model.

The Problem

Polling is sometimes said to be a simple monogenic trait (Georges et al. 1993) but we

consider this misleading. It is true that polling (P) is dominant over horning (H). Thus in

general the genotypes PP and PH will be polled and HH will be horned. However it is

now clear that there must be interacting genes for head shape, scurring (loose horns) and

polling. Of note also is that there are different patterns of inheritance in different breeds

and we have observed that there are distinct degrees of horning. Some cattle designated

polled as calves develop small horns later in life. Some cattle need repeated dehorning.

Accordingly we postulate that there are at least several linked genes, which together

constitute a haplotype that controls several related functions, including rate of growth,

attachment and shape.

It follows that the genetics of horning may be ideal for GMT and that other attempts

may have failed because the complexity was underestimated. On the other hand

horning is not too complex in that at least most of the genes must be close together and

within a haplotype.

Page 138: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 174 -

Methods

DNA Extraction

DNA was extracted from Acid Citrate Dextrose (ACD) treated whole blood by the

standard salting out protocol.

Pedigree analysis of 920 cattle

Figure 1 shows the pedigree of 920 cattle from Melaleuca Stud and was generated by

the pedigree analysis software, Pedigraph (Garbe and Da 2004).The records that were

used to form this complex family tree have been maintained by Professor Roger

Dawkins and Mr John Dawkins and date back over 20 years. As shown in Figure1, the

DNA collected includes many families with at least 4 generations. Many of the cattle

also have several offspring.

Beef breeder database

The beef breeder database is a data management tool that was developed specifically for

this project. The program incorporates the common stud management tools such as

animal parentage, administered therapeutics and phenotype as well as many tools for

laboratory sample management and genetic testing data. Various screenshots of this

database are shown in Figure 2.

The Genomic Matching Technique Approach

This will be done by using the limited published (Brenneman et al. 1996; Georges et al.

1993; Harlizius et al. 1997) and proprietary data on mapping of polling and then

applying the synteny strategy. We know the equivalent regions in other species

Page 139: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 175 -

including man because for some years we have been pursuing comparative genomics

with the production of maps which show that vertebrates have the same basic genomic

structure albeit shuffled into patchworks which are species specific. Since we have

detailed sequence data on at least several examples of well studied species, we have

already designed primers which can be expected to amplify the relevant region of cattle.

Once candidate regions are identified, the following steps are taked:

1. Identify genomic duplication in candidate regions,

2. Examine duplications for conserved regions and polymorphic geometric elements

3. Design primers that amplify at multiple locations within a block

4. Test primers on a selection of individuals and examine amplicons for polymorphism

5. Segregate profiles to confirm primers are binding at linked loci.

6. Define Ancestral Haplotypes and associatiate with implicated phenotypes

Discussion

Irrespective of its interest scientifically, horn growth is emerging as a major issue in the

cattle industry. Firstly, horning is dangerous to workers and is therefore important in

terms of litigation and industrial relations. Secondly horned cattle injure each other and

inflict substantial bruising when transported. For these reasons, it is almost mandatory

to dehorn but there are several problems. Dehorning at birth disguises the phenotype so

that the distinction between polled and horned and degrees of horning is not apparent to

a breeder. This is one reason why horning genes remain so prevalent. More

importantly, dehorning later in life, when the phenotype is recognisable, is painful and

about to judged unacceptable unless performed under prohibitively expensive surgical

conditions.

Page 140: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 176 -

We conclude that any simple and robust test for the preferred haplotypes encoding

minimal horn growth without deleterious associations will be valuable to the beef

industry. A test which is simple to collect and cheap to perform will be commercially

viable. Thus, we will use some of the DNA collected for other purposes such as ID,

pathogen tracing etc and aspire to a cost to the farmer of only $20 per head.

Acknowledgements

The authors would like to thank (i) Dr Yang Da for kindly generating the full pedigree

of the herd using his software Pedigraph (garbe et al, 2004), (ii) The CY O’Connor

ERADE Village Foundation, The Australian Research Council and Genetic

Technologies for their generous support.

Page 141: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 177 -

References

Brenneman, R. A., S. K. Davis, J. O. Sanders, B. M. Burns, T. C. Wheeler, J. W. Turner

and J. F. Taylor (1996). "The polled locus maps to BTA1 in a Bos indicus x Bos

taurus cross." J Hered 87(2): 156-61.

Garbe, J. R. and Y. Da (2004). "Pedigraph user manual version 2.0 trail version."

Department of Animal Science, University of Minnesota.

Gaudieri, S., N. Longman-Jacobsen, G. K. Tay and R. L. Dawkins (2001). "Sequence

analysis of the MHC class I region reveals the basis of the genomic matching

technique." Hum Immunol 62(3): 279-85.

Georges, M., R. Drinkwater, T. King, A. Mishra, S. S. Moore, D. Nielsen, L. S.

Sargeant, A. Sorensen, M. R. Steele, X. Zhao and et al. (1993). "Microsatellite

mapping of a gene affecting horn development in Bos taurus." Nat Genet 4(2):

206-10.

Harlizius, B., I. Tammen, K. Eichler, A. Eggen and D. J. Hetzel (1997). "New markers

on bovine chromosome 1 are closely linked to the polled gene in Simmental and

Pinzgauer cattle." Mamm Genome 8(4): 255-7.

McLure, C., P. Kesners, S. Lester, D. Male, C. Amadou, J. Dawkins, B. Stewart, J.

Williamson and R. Dawkins (Submitted-a). "Haplotyping of the canine MHC

without the need for DLA typing."

McLure, C., J. Williamson, L. Smyth, S. Agrawal, S. Lester, J. Millman, P. Keating, B.

Stewart and R. Dawkins (Submitted-b). "Extensive genomic and functional

polymorphism of Complement Control Proteins."

Tay, G. K., C. S. Witt, F. T. Christiansen, D. Charron, D. Baker, R. Herrmann, L. K.

Smith, D. Diepeveen, S. Mallal, J. McCluskey and et al. (1995). "Matching for

MHC haplotypes results in improved survival following unrelated bone marrow

transplantation." Bone Marrow Transplant 15(3): 381-5.

Page 142: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 178 -

Witt, C., D. Sayer, F. Trimboli, M. Saw, R. Herrmann, P. Cannell, D. Baker and F.

Christiansen (2000). "Unrelated donors selected prospectively by block-

matching have superior bone marrow transplant outcome." Hum Immunol 61(2):

85-91.

Page 143: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 179 -

Figure Legends

Figure 1. Pedigree of 920 cattle.

The pedigree above shows the controlled interbreeding within a herd over the last 20

years. Names are assigned to those cattle introduced to the family, Numbers are

assigned to cattle that were born on the property, such that the first two numbers

represent the year and the last two represent the number in that year. DNA and serum

has been collected from all the individual shaded grey. The cattle shaded green have not

been collected as yet. The DNA collection covers at least 4 generations with multiple

offspring in each. These samples will be invaluable with the future genetic analysis of

phenotypic traits.

Figure 2. Beef Breeder database for phenotypic and genotypic data management.

Screen shots of the many layers to the Beef Breeder database. a) Main Page, b) Data

entry and analysis options, 1c) Search cattle based on any category, d) Individual cattle

details, e) 4 generation family trees identifying parents, grandparents and great

grandparents, f) Progeny Table, f) Entry of updated data , h) Laboratory data

management.

Page 144: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 180 -

Page 145: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure
Page 146: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure
Page 147: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 183 -

CONCLUSIONS AND FUTURE PLANS

Page 148: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 184 -

Page 149: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 185 -

CONCLUSIONS AND FUTURE PLANS

Future Plans

The Regulators of Complement

As can be imagined, there are many studies currently planned to utilise this extensive

genomic polymorphism of the RCA alpha block. Currently our focus is on determining

how far the alpha block extends, but we expect this is likely to be over 0.9 Mb and

containing the DAF, CR2, CR1, MCPL, CR1 and MCP (Rodriguez de Cordoba). The

further the block extends the more informative the haplotypes will become. It will be

extremely interesting if as we predict AHs vary not only in the number and location of

indels and SNPs but also in gene copy number (complete genes or internal domains).

While clinical studies were beyond the scope of this project, we have commenced

several pilot studies that are investigating the involvement of the RCA AHs in a number

of autoimmune diseases. Theses include Recurrent Spontaneous Abortion (RSA),

Sjogren’s Syndrome (SS) and Systemic Lupus Erythematosus (SLE) but we plan to

include many more diseases and population groups. The results of these studies will no

doubt create valuable insight into the mechanisms defining multifactorial disease.

As many of the CCPs act as viral receptors, the pathogenicity of and resistance to

viruses such as Measles (MV), Epstein Barr (EBV) and Human Herpes (HHV), is likely

to be encoded within the individual RCA AHs. The identificiation of resistant AHs may

lead to the development of novel vaccines and therapies. We hope to begin these studies

very shortly.

Page 150: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 186 -

Applications of the GMT to other genomes

Chapters 4 and 5 provide the evidence to suggest that the GMT has applications, not

only within the human MHC, but the human and non-human genomes in general.

Canines

The canine parentage project has demonstrated proof-of-concept. It is now our aim to

utilise the GMT signatures of the dog to define Canine MHC AHs (Dog Lymphocyte

Antigen – DLA). These will be highly informative in identifying disease susceptible and

resistant dogs and breeds. As the dog shares many of the MHC related diseases found in

humans, this is also expected to reveal many disease mechanisms that have previously

been missed or overlooked in humans.

Livestock

The many livestock projects appear promising, especially with the ever-increasing

amount of genomic sequence being made available and I look forward to continuing this

work.

Conclusion

In conclusion this thesis provides a new look at the function of duplication and

polymorphism within the human RCA. Analysis of the CR1 and CR1L genes

demonstrates the importance of duplication, indels and degeneracy in defining the SCR

repertoires of individual genes and their alleles. Internal duplications of functional

subunits have created genomic expansion while transposable elements and degeneracy

have created divergence. All these add to the overall complexity. Using this knowledge,

I have developed a functional assay that is able to characterise AHs defining this

Page 151: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 187 -

divergence. Eventual sequencing of each entire Ancestral Haplotype will be needed in

order to define the individual events specific to each.

Page 152: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 188 -

Page 153: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 189 -

APPENDICES

Page 154: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 190 -

Page 155: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 191 -

APPENDIX 1

NCBI sequence submission for CR1 and CR1L

HGEs

Page 156: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 192 -

Page 157: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 193 -

LOCUS C05/522D 298 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_05001 (CR1MCP5&6). ACCESSION DQ007054 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..298 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..195 repeat_region 107..138 /rpt_unit="cctc" repeat_region 139..178 /rpt_unit="cctt" repeat_region 179..186 /rpt_unit="ccttccct" primer_bind complement(276..298) BASE COUNT 50 a 105 c 25 g 118 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctccctccc ttccttcctt ccttccttct ttccttcctt ccttccttcc 181 ttccctcttt ccttattttc tttcttcttt accacgctgg ctaggaccac cagtataaca 241 ttgaacattg gtagcaatag atgtcatcct tgtcttgttc cacatctcaa agttaaag //

Page 158: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 194 -

LOCUS C05/524R 298 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_05001 (CR1MCP5&6). ACCESSION DQ007055 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..298 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..195 repeat_region 107..138 /rpt_unit="cctc" repeat_region 139..178 /rpt_unit="cctt" repeat_region 179..186 /rpt_unit="ccttccct" primer_bind complement(276..298) BASE COUNT 50 a 105 c 25 g 118 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctccctccc ttccttcctt ccttccttct ttccttcctt ccttccttcc 181 ttccctcttt ccttattttc tttcttcttt accacgctgg ctaggaccac cagtataaca 241 ttgaacattg gtagcaatag atgtcatcct tgtcttgttc cacatctcaa agttaaag //

Page 159: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 195 -

LOCUS C05/531P 298 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_05001 (CR1MCP5&6). ACCESSION DQ007056 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..298 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..195 repeat_region 107..134 /rpt_unit="cctc" repeat_region 135..178 /rpt_unit="cctt" repeat_region 179..186 /rpt_unit="ccttccct" primer_bind complement(276..298) BASE COUNT 50 a 104 c 25 g 118 t 1 others ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctccctycc ttccttcctt ccttccttct ttccttcctt ccttccttcc 181 ttccctcttt ccttattttc tttcttcttt accacgctgg ctaggaccac cagtataaca 241 ttgaacattg gtagcaatag atgtcatcct tgtcttgttc cacatctcaa agttaaag //

Page 160: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 196 -

LOCUS C05/528S 298 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_05001 (CR1MCP5&6). ACCESSION DQ007057 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 298) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..298 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..195 repeat_region 107..134 /rpt_unit="cctc" repeat_region 135..178 /rpt_unit="cctt" repeat_region 179..186 /rpt_unit="ccttccct" primer_bind complement(276..298) BASE COUNT 50 a 104 c 25 g 118 t 1 others ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctccctycc ttccttcctt ccttccttct ttccttcctt ccttccttcc 181 ttccctcttt ccttattttc tttcttcttt accacgctgg ctaggaccac cagtataaca 241 ttgaacattg gtagcaatag atgtcatcct tgtcttgttc cacatctcaa agttaaag //

Page 161: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 197 -

LOCUS C05/523K-1 294 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_04001 (CR1MCP5&6). ACCESSION DQ007058 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 294) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 294) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..294 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..191 repeat_region 107..134 /rpt_unit="cctc" repeat_region 135..174 /rpt_unit="cctt" repeat_region 175..182 /rpt_unit="ccttccct" primer_bind complement(272..294) BASE COUNT 50 a 102 c 25 g 117 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctcccttcc ttccttcctt ccttctttcc ttccttcctt ccttccttcc 181 ctctttcctt attttctttc ttctttacca cgctggctag gaccaccagt ataacattga 241 acattggtag caatagatgt catccttgtc ttgttccaca tctcaaagtt aaag //

Page 162: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 198 -

LOCUS C05/519G 294 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_04001 (CR1MCP5&6). ACCESSION DQ007059 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 294) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 294) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..294 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..191 repeat_region 107..134 /rpt_unit="cctc" repeat_region 135..174 /rpt_unit="cctt" repeat_region 175..182 /rpt_unit="ccttccct" primer_bind complement(272..294) BASE COUNT 50 a 102 c 25 g 117 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctcccttcc ttccttcctt ccttctttcc ttccttcctt ccttccttcc 181 ctctttcctt attttctttc ttctttacca cgctggctag gaccaccagt ataacattga 241 acattggtag caatagatgt catccttgtc ttgttccaca tctcaaagtt aaag

Page 163: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 199 -

LOCUS C05/525Y 302 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_06001 (CR1MCP5&6). ACCESSION DQ007060 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 302) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 302) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..302 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..199 repeat_region 107..134 /rpt_unit="cctc" repeat_region 135..182 /rpt_unit="cctt" repeat_region 183..190 /rpt_unit="ccttccct" primer_bind complement(280..302) BASE COUNT 50 a 107 c 25 g 120 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctcccttcc ttccttcctt ccttccttcc ttccttcctt ccttccttcc 181 ttccttccct ctttccttat tttctttctt ctttaccacg ctggctagga ccaccagtat 241 aacattgaac attggtagca atagatgtca tccttgtctt gttccacatc tcaaagttaa 301 ag

Page 164: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 200 -

LOCUS C05/527L 306 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_07001 (CR1MCP5&6). ACCESSION DQ007061 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 306) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 306) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..306 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..203 repeat_region 107..135 /rpt_unit="cctc" repeat_region 135..186 /rpt_unit="cctt" repeat_region 187..194 /rpt_unit="ccttccct" primer_bind complement(284..306) BASE COUNT 50 a 109 c 25 g 122 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc cctcccttcc ttccttcctt ccttccttcc ttccttcctt ccttccttcc 181 ttccttcctt ccctctttcc ttattttctt tcttctttac cacgctggct aggaccacca 241 gtataacatt gaacattggt agcaatagat gtcatccttg tcttgttcca catctcaaag 301 ttaaag //

Page 165: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 201 -

LOCUS C05/517T 290 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_03001 (CR1MCP5&6). ACCESSION DQ007062 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 290) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 290) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..290 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..187 repeat_region 107..130 /rpt_unit="cctc" repeat_region 131..170 /rpt_unit="cctt" repeat_region 171..178 /rpt_unit="ccttccct" primer_bind complement(268..290) BASE COUNT 50 a 99 c 25 g 116 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc cctccctccc 121 tccctccctc ccttccttcc ttccttcctt ctttccttcc ttccttcctt ccttccctct 181 ttccttattt tctttcttct ttaccacgct ggctaggacc accagtataa cattgaacat 241 tggtagcaat agatgtcatc cttgtcttgt tccacatctc aaagttaaag //

Page 166: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 202 -

LOCUS C05/523K-2 294 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1_04002 (CR1MCP5&6). ACCESSION DQ007063 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 294) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 294) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..294 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..191 repeat_region 107..134 /rpt_unit="cctc" repeat_region 135..174 /rpt_unit="cctt" repeat_region 175..182 /rpt_unit="ccttccct" primer_bind complement(272..294) BASE COUNT 50 a 101 c 25 g 118 t ORIGIN 1 aattccaaat tggcctggtt gacatggtgc caaaccacca aataattata attttattta 61 actctttgtc ttcttttctt tctttccttc cctccctccc tcctttcctc ccttcctccc 121 tccctccctc cctcccttcc ttccttcctt ccttctttcc ttccttcctt ccttccttcc 181 ctctttcctt attttctttc ttctttacca cgctggctag gaccaccagt ataacattga 241 acattggtag caatagatgt catccttgtc ttgttccaca tctcaaagtt aaag //

Page 167: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 203 -

LOCUS C05/450E 359 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_09001 (CR1MCP5&6). ACCESSION DQ007064 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 359) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 359) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..359 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..257 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..201 /rpt_unit="cctt" repeat_region 202..241 /rpt_unit="ccttccct" primer_bind complement(337..359) BASE COUNT 53 a 137 c 32 g 137 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata aatttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccctc cttccctcct tccctccttc cctccttccc 241 tcctttcctt ctccttattt tctttcttct ttaccacacg gctaggacca ccagtacaac 301 attgaacatt ggtagcaata gatgtcatcc ttgtcttgtt ccacatctca aagggaagg //

Page 168: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 204 -

LOCUS C05/452S 363 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_09501 (CR1MCP5&6). ACCESSION DQ007065 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 363) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 363) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..363 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..261 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..205 /rpt_unit="cctt" repeat_region 206..245 /rpt_unit="ccttccct" primer_bind complement(341..363) BASE COUNT 53 a 139 c 32 g 139 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata aatttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cctccttccc tccttccctc cttccctcct 241 tccctccttt ccttctcctt attttctttc ttctttacca cacggctagg accaccagta 301 caacattgaa cattggtagc aatagatgtc atccttgtct tgttccacat ctcaaaggga 361 agg //

Page 169: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 205 -

LOCUS C05/448P 363 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_09501 (CR1MCP5&6). ACCESSION DQ007066 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 363) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 363) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..363 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..261 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..205 /rpt_unit="cctt" repeat_region 206..245 /rpt_unit="ccttccct" primer_bind complement(341..363) BASE COUNT 53 a 139 c 32 g 139 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata aatttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cctccttccc tccttccctc cttccctcct 241 tccctccttt ccttctcctt attttctttc ttctttacca cacggctagg accaccagta 301 caacattgaa cattggtagc aatagatgtc atccttgtct tgttccacat ctcaaaggga 361 agg //

Page 170: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 206 -

LOCUS C05/449W 363 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_09501 (CR1MCP5&6). ACCESSION DQ007067 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 363) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 363) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..363 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..261 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..205 /rpt_unit="cctt" repeat_region 206..245 /rpt_unit="ccttccct" primer_bind complement(341..363) BASE COUNT 53 a 139 c 32 g 139 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata aatttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cctccttccc tccttccctc cttccctcct 241 tccctccttt ccttctcctt attttctttc ttctttacca cacggctagg accaccagta 301 caacattgaa cattggtagc aatagatgtc atccttgtct tgttccacat ctcaaaggga 361 agg //

Page 171: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 207 -

LOCUS C05/454F 367 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_10001 (CR1MCP5&6). ACCESSION DQ007068 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 367) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 367) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..367 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..265 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..209 /rpt_unit="cctt" repeat_region 210..249 /rpt_unit="ccttccct" primer_bind complement(345..367) BASE COUNT 52 a 140 c 32 g 143 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccctcct tccctccttc cctccttccc 241 tccttccctc ctttccttct ccttattttc tttcttcttt accacacggc taggaccacc 301 agtataacat tgaacattgg tagcaataga tgtcatcctt gtcttgttcc acatctcaaa 361 gggaagg //

Page 172: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 208 -

LOCUS C05/453Z 387 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_15001 (CR1MCP5&6). ACCESSION DQ007069 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 387) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 387) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..387 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..285 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..221 /rpt_unit="cctt" repeat_region 222..269 /rpt_unit="ccttccct" primer_bind complement(365..387) BASE COUNT 52 a 151 c 32 g 152 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccctc cttccctcct 241 tccctccttc cctccttccc tccttccctc ctttccttct ccttattttc tttcttcttt 301 accacacggc taggaccacc agtataacat tgaacattgg tagcaataga tgtcatcctt 361 gtcttgttcc acatctcaaa gggaagg //

Page 173: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 209 -

LOCUS C05/456T 379 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_13001 (CR1MCP5&6). ACCESSION DQ007070 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 379) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 379) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..379 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..277 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..221 /rpt_unit="cctt" repeat_region 222..261 /rpt_unit="ccttccct" primer_bind complement(352..379) BASE COUNT 52 a 146 c 32 g 149 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccctc cttccctcct 241 tccctccttc cctccttccc tcctttcctt ctccttattt tctttcttct ttaccacacg 301 gctaggacca ccagtataac attgaacatt ggtagcaata gatgtcatcc ttgtcttgtt 361 ccacatctca aagggaagg //

Page 174: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 210 -

LOCUS C05/443G 379 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_13001 (CR1MCP5&6). ACCESSION DQ007071 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 379) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 379) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..379 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..277 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..221 /rpt_unit="cctt" repeat_region 222..261 /rpt_unit="ccttccct" primer_bind complement(357..379) BASE COUNT 52 a 146 c 32 g 149 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccctc cttccctcct 241 tccctccttc cctccttccc tcctttcctt ctccttattt tctttcttct ttaccacacg 301 gctaggacca ccagtataac attgaacatt ggtagcaata gatgtcatcc ttgtcttgtt 361 ccacatctca aagggaagg //

Page 175: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 211 -

LOCUS C05/445U 395 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_17001 (CR1MCP5&6). ACCESSION DQ007072 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 395) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 395) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..395 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..293 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..229 /rpt_unit="cctt" repeat_region 230..277 /rpt_unit="ccttccct" primer_bind complement(373..395) BASE COUNT 52 a 155 c 32 g 156 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccttc cttccctcct 241 tccctccttc cctccttccc tccttccctc cttccctcct ttccttctcc ttattttctt 301 tcttctttac cacacggcta ggaccaccag tataacattg aacattggta gcaatagatg 361 tcatccttgt cttgttccac atctcaaagg gaagg //

Page 176: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 212 -

LOCUS C05/457A 371 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_11001 (CR1MCP5&6). ACCESSION DQ007073 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 371) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 371) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..371 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..269 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..221 /rpt_unit="cctt" repeat_region 222..253 /rpt_unit="ccttccct" primer_bind complement(349..371) BASE COUNT 52 a 141 c 32 g 146 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccctc cttccctcct 241 tccctccttc cctcctttcc ttctccttat tttctttctt ctttaccaca cggctaggac 301 caccagtata acattgaaca ttggtagcaa tagatgtcat ccttgtcttg ttccacatct 361 caaagggaag g //

Page 177: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 213 -

LOCUS C05/455M 383 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_14001 (CR1MCP5&6). ACCESSION DQ007074 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 383) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 383) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..383 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..281 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..225 /rpt_unit="cctt" repeat_region 226..265 /rpt_unit="ccttccct" primer_bind complement(361..383) BASE COUNT 52 a 148 c 32 g 151 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccttc cctccttccc 241 tccttccctc cttccctcct tccctccttt ccttctcctt attttctttc ttctttacca 301 cacggctagg accaccagta taacattgaa cattggtagc aatagatgtc atccttgtct 361 tgttccacat ctcaaaggga agg //

Page 178: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 214 -

LOCUS C05/459N 383 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_14001 (CR1MCP5&6). ACCESSION DQ007075 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 383) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 383) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..383 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..281 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..225 /rpt_unit="cctt" repeat_region 226..265 /rpt_unit="ccttccct" primer_bind complement(361..383) BASE COUNT 52 a 148 c 32 g 151 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccttc cctccttccc 241 tccttccctc cttccctcct tccctccttt ccttctcctt attttctttc ttctttacca 301 cacggctagg accaccagta taacattgaa cattggtagc aatagatgtc atccttgtct 361 tgttccacat ctcaaaggga agg //

Page 179: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 215 -

LOCUS C05/446B 383 bp DNA linear PRI 13-APR-2005 DEFINITION Human RCA CR1L_14001 (CR1MCP5&6). ACCESSION DQ007076 VERSION KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 383) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Extensive Genomic Polymorphism of the CR1 Region: RCA Ancestral Haplotypes, Function and Disease JOURNAL Unpublished REFERENCE 2 (bases 1 to 383) AUTHORS McLure,C.A., Williamson,J.F., Smyth,L.A., Agrawal,S., Lester,S., Millman,J., Keating,P.J., Stewart,B.J. and Dawkins,R.L. TITLE Direct Submission JOURNAL Submitted (13-APR-2005) C Y O'Connor ERADE Village, PO Box 5100, Canning Vale, WA 6155, Australia FEATURES Location/Qualifiers source 1..383 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" primer_bind 1..22 satellite 68..281 repeat_region 166..177 /rpt_unit="cctc" repeat_region 178..225 /rpt_unit="cctt" repeat_region 226..265 /rpt_unit="ccttccct" primer_bind complement(361..383) BASE COUNT 52 a 148 c 32 g 151 t ORIGIN 1 aattccaaat tggcctggtt gacactgtac aaaaccacca gataattata attttattta 61 actctttgtc ttcttttctt tccttccctc cttcccttct gcctgcctgc ttgccttcct 121 tctttgcttg cttccttcct tcctccctcc ctccatccct cccttcctcc ctccctccct 181 tccttccttc cttccttcct tccttccttc cttccttcct tccttccttc cctccttccc 241 tccttccctc cttccctcct tccctccttt ccttctcctt attttctttc ttctttacca 301 cacggctagg accaccagta taacattgaa cattggtagc aatagatgtc atccttgtct 361 tgttccacat ctcaaaggga agg //

Page 180: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 216 -

Page 181: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 217 -

APPENDIX 2

ENLARGED FIGURES AND TABLES

Page 182: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 218 -

Page 183: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 219 -

CHAPTER 1 – MS0014

McLure, C., R. Dawkins, J. Williamson, R. Davies, J. Berry, N. Longman-Jacobsen, R.

Laird and S. Gaudieri (2004). "Amino acid patterns within short consensus repeats

define conserved duplicons shared by genes of the RCA complex." Journal of

Molecular Evolution 59(2): 143-157.

FIGURES AND TABLES

Page 184: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 220 -

Page 185: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

Figure 1: Characteristic amino acid patterns of eleven SCR groupsGroup† Protein No Residues§

a C x x x x x x x x A x x x x x x x x x x F x x G T x x x Y x C x P x Y x x x x F x I x C x x x x x W x x x x x x C 57 Hosa CR1-1 C N A P E W L P F A R P T N L T D E F E F P V G T Y L N Y E C R P G Y S G R P F S I I C L K N S V W T G A K D R C

Patr CR1-1 C N A P E W L P F A R P T N L T D E F E F P I G T Y L N Y E C R P G Y Y G R P F S I I C L K N S V W T G A K D R C

Hosa CR1-22 C K T P E Q F P F A S P T I P I N D F E F P I G T S L N Y E C R P G Y F G K M F S I S C L E N L V W S S V E D N C

Patr CR1-22 C K T P E Q F P F A S P T I P I N D F E F P V G T S L N Y E C R P G Y F G K M F S I S C L E N L V W S S V E D N C

Paha CR1-21 C K T P E Q F P F A S P T I P I N D F E F P V G T S L N Y E C H P G Y F G R M F S I S C L E N L V W S S V E D N C

Hosa CR1-8 C Q A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L V W S S P K D V C

Patr CR1-8 C Q A P D H F L F A K L K T Q T N A S D F P I G T S L K Y K C R P E Y Y G R P F S I T C L D N L V W S S P K D V C

Paha CR1-7 C K A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L E W S S P K D V C

Hosa CR1-15 C Q A P D H F L F A K L K T Q T N A S D F P V G T S L K Y E C R P E Y Y G R P F S I T C L D N L V W S S P K D V C

Patr CR1-15 C Q A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L V W S S P K D V C

Paha CR1-14 C K A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G K P F S I T C L D N L V W S S P K D V C

Mumu Crry-1 C P A P S Q L P S A K P I N L T D E S M F P I G T Y L L Y E C L P G Y I K R Q F S I T C K Q D S T W T S A E D K C

Rano Crry-1 C P A P P L F P Y A K P I N P T D E S T F P V G T S L K Y E C R P G Y I K R Q F S I T C E V N S V W T S P Q D V C

Mumu CR1-1 C K L L P K Y S F A K P S I V S D K S E F A I G T T W E Y K C R P G Y F R K S F I I T C L E T S K W S D A Q Q F C

Hosa DAF-1 C G L P P D V P N A Q P A L E G R T S F P E D T V I T Y K C E E S F V K I P G E K D S V T C L K G S Q W S D I E E F C

Hosa DAF-2 C E V P T R L N S A S L K Q P Y I T Q N Y F P V G T V V E Y E C R P G Y R R E P S L S P K L T C L Q N L K W S T A V E F C

Hosa MCP-1 C E E P P T F E A M E L I G K P K P Y Y E I G E R V D Y K C K K G Y F Y I P P L A T H T I C D R N H T W L P V S D D A C

Hosa CR2-7 C Q A P P N I L N G Q K E D R H M V R F D P G T S I K Y S C N P G Y V L V G E E S I Q C T S E V W T P P V P Q C

Hosa C2-1 C P Q N V N I S G G T F T L S H G W A P G S L L T Y S C P Q G L Y P S P A S R L C - - - - - - - - - - - - -

Hosa Bf-1 C S L E G V E I K G G S F R L L Q E G Q A L E Y V C P S G F Y P Y P V Q T R T C R S T G S W S T L K - - -

b C x P P x x x L H x x x x x x x x x x F x x G Q x V x Y x C x P x Y x L x G x x x x x C T x x G x W S x x x P x C 57Hosa CR1-5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T CPatr CR1-5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S L R C T P Q G D W S P A T P T C

Paha CR1-4 C Q P P P D V L H G E R T Q R D K D I F Q T G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R C

Hosa CR1-12 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T CPatr CR1-12 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S L R C T P Q G D W S P A A P T C

Paha CR1-11 C Q P P P D V L H G E R T Q R D K D I F Q P G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R C

Hosa CR1-19 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R CPatr CR1-19 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R CPaha CR1-18 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R CHosa CR1-26 C Q P P P E I L H G E H T L S H Q D N F S P G Q E V F Y S C E P S Y D L R G A A S L H C T P Q G D W S P E A P R C

Patr CR1-26 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C

Paha CR1-25 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P I C

Mumu CR1-5 C L P P Q N I L H G D Y N K K D E F F S V G Q K V S Y T C N P G Y T L I G T N L V E C T S L G T W S N T V P T C

Hosa CR2-11 C Q P P P G L H H G H H T G G N T V F F V S G M T V D Y T C D P G Y L L V G N K S I H C M P S G N W S P S A P R C

Hosa CR2-3 C P A L P M I H N G H H T S E N V G S I A P G L S V T Y S C E S G Y L L V G E K I I N C L S S G K W S A V P P T C

Hosa C2-3 C P N P G I S L G A V R T G F R F G H G D K V R Y R C S S N L V L T G S S E R E C Q G N G V W S G T E P I CHosa Bf-3 C S N P G I P I G T R K V G S Q Y R L E D S V T Y H C S R G L T L R G S Q R R T C Q E G G S W S G T E P S C

c C P H P P K I Q N G H x I G G H V S L Y L P G M T I x Y x C D P G Y L L V G K G x I F C T D Q G I W S Q L D H Y C 57

Hosa CR1-29 C P H P P K I Q N G H Y I G G H V S L Y L P G M T I S Y T C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y CPatr CR1-29 C P H P P K I Q N G H D I G G H V S L Y L P G M T I S Y I C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y CPaha CR1-28 C P H P P K I Q N G H Y I G G H V S L Y L P G M T I G Y I C D P G Y L L V G K G I I F C T D Q G I W S Q L D H Y CHosa CR2-15 C P P P P K T P N G N H T G G N I A R F S P G M S I L Y S C D Q G Y L L V G E A L L L C T H E G T W S Q P A P H C

d C P x P P x I x N G R H T G x x x x x x P x G K x x x Y x C D x H x D R G x x F x L I G E S x I R C T S D x x G N G V W S S x A P R C 67Hosa CR1-7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R CPatr CR1-7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C

Paha CR1-6 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C

Hosa CR1-14 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R CPatr CR1-14 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T Y F D L I G E S T I R C T S D P Q G N G V W S S P A P R C

Paha CR1-13 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S R A P R C

Hosa CR1-21 C P N P P A I L N G R H T G T P S G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R CPatr CR1-21 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R CPaha CR1-20 C P N P P A I L N G R H T G A L L G D I P Y G K E I S Y T C D P H R D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R CHosa CR1-28 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D P Q G N G V W S S P A P R CPatr CR1-28 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D P Q G N G V W S S P A P R CPaha CR1-27 C P N P P A I L N G R H T G T P L G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R C

Hosa CR2-5 C P S P P P I L N G R H I G N S L A N V S Y G S I V T Y T C D P D P E E G V N F I L I G E S T L R C T V D S Q K T G T W S G P A P R C

Hosa CR2-9 C P P P P V I Y N G A H T G S S L E D F P Y G T T V T Y T C N P G P E R G V E F S L I G E S T I R C T S N D Q E R G T W S G P A P L C

Hosa CR2-13 C H P P P V I V N G K H T G M M A E N F L Y G N E V S Y E C D Q G F Y L L G E K K L Q C R S D S K G H G S W S G P S P Q C

Hosa MCP-3 C T P P P K I K N G K H T F S E V E V F E Y L D A V T Y S C D P A P G P D P F S L I G E S T I Y C G D N S V W S R A A P E C

e C x x P P x I x N G D F x S x x x x x F x x G x V V x Y x C x x x x x x x x x F x L V G E x S x x C T S x x x x x G x W x x P x P x C 67Hosa CR1-3 C G L P P T I T N G D F I S T N R E N F H Y G S V V V Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPatr CR1-3 C G L P P T I T N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C

Paha CR1-2 C G L P P T I D N G D F F S A N K E Y F H Y G S V V T Y R C N L G S G G R K L F E L V G E P S I Y C T S N E D Q V G I W S G P A P Q C

Hosa CR1-10 C G L P P T I A N G D F I S T N R E N F H Y G S V V V Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPatr CR1-10 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPaha CR1-9 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CHosa CR1-17 C G L P P T I A N G D F I S T N R E N F H Y G S V V V Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPatr CR1-17 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPaha CR1-16 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q C

Hosa CR1-24 C E P P P T I S N G D F Y S N N R T S F H N G T V V V Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R CPatr CR1-24 C E P P P T I S N G D F Y S N N R A S F H N G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R CPaha CR1-23 C K P P P T I S N G D F Y S N N R T S F H S G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G A W S S P P P R C

Mumu Crry-3 C E I P P G I P N G D F F S S T R E D F H Y G M V V V Y R C N T D A R G K A L F N L V G E P S L Y C T S N D G E I G V W S G P P P Q C

Rano Crry-3 C E I P P S I P N G D F F S P N R E D F H Y G M V V V Y Q C N T D A R G K K L F N L V G E P S I H C T S I D G Q V G V W S G P P P Q CMumu CR1-3 C E S P P A I S N G D F Y S S S R D S F F Y G M V V T Y Y C H T G K N R E K L F D L V G E K S I Y C T S K D N Q V G I W N S P P P Q C

Hosa DAF-4 C P A P P Q I D N G I I Q G E R D H Y G Y R Q S V T Y A C N K G F T M I G E H S I Y C T V N N D E G E W S G P P P E C

f C x x P x x x N x x x x x x x x x x F x L x x x x x F x C x x G F x M x G x x x x x C x x x x x W x P x L P x C 56Hosa CR1-4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S CPatr CR1-4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P P R V K C Q A L N K W E P E L P S CPaha CR1-3 C T P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S CHosa CR1-11 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S CPatr CR1-11 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F A M K G P R R V K C Q A L N K W E P E L P S CPaha CR1-10 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S CHosa CR1-18 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S CPatr CR1-18 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P H R V K C Q A L N K W E P E L P S CPaha CR1-17 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S C

Hosa CR1-25 C T A P E V E N A I R V P G N R S F F S L T E I I R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H CPatr CR1-25 C T A P E V E N A I R V P G N R S F F S L T E I V R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H C

Paha CR1-24 C T A P E V K N G I R V P G N R S F F S L N E I V R F R C Q P G F V M V G S H T V Q C Q T N N R W G P K L P H C

Mumu Crry-4 C T P P P Y V E N A V M L S E N R S L F S L R D I V E F R C H P G F I M K G A S S V H C Q S L N K W E P E L P S C

Rano Crry-4 C T P P H V E N A V I V S K N K S L F S L R D M V E F R C Q D G F M M K G D S S V Y C R S L N R W E P Q L P S C

Mumu CR1-4 C P M P E I E N G L V E S G F K H S F F L N D T V I F K C K S G F T M K G S R I A W C Q P N S K W S P P L P T C

Hosa CR2-2 C P E P I V P G G Y K I R G S T P Y R H G D S V T F A C K T N F S M N G N K S V W C Q A N N M W G P T R L P T C

Hosa CR2-14 C P N P E V K H G Y K L N K T H S A Y S H N D I V Y V D C N P G F I M N G S R V I R C H T D N T W V P G V P T C

Page 186: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

Figure 1: Characteristic amino acid patterns of eleven SCR groups (Cont)

g C x H x x I x x G x x x S G x x x x Y x Y N D T V x F x C x x G F T L K G S x Q I R C x A x x T W x P x x P V C 56Hosa CR2-6 C P H P Q I L R G R M V S G Q K D R Y T Y N D T V I F A C M F G F T L K G S K Q I R C N A Q G T W E P S A P V CHosa CR2-10 C S H V H I A N G Y K I S G K E A P Y F Y N D T V T F K C Y S G F T L K G S S Q I R C K A D N T W D P E I P V CHosa MCP-4 C R F P V V E N G K Q I S G F G K K F Y Y K A T V M F E C D K G F Y L D G S D T I V C D S N S T W D P P V P K C

h C x x P x x M x G x x K x L x M K K x Y x Y G x x V x L x C E D G Y x L E G S x x S Q C Q x D x x W x P x L x x C 57Hosa CR1-30 C S F P L F M N G I S K E L E M K K V Y H Y G D Y V T L K C E D G Y T L E G S P W S Q C Q A D D R W D P P L A K CPatr CR1-30 C S F P L F M N G I S K E L E M K K V Y H Y G D Y V T L K C E D G Y T L E G S P W S Q C Q A D D R W D P P L A K CPaha CR1-29 C S F P Q F M N G I S K E L E M K K V Y H Y G D Y V T L E C E D G Y A L E G S P W S Q C Q A D D R W D P P L A I C

Mumu CRry-5 C R L P Q E M S G F Q K G L G M K K E Y Y Y G E N V T L E C E D G Y T L E G S S Q S Q C Q S D G S W N P L L A K C

Rano Crry-6 C K L P Q D M S G F Q K G L Q M K K D Y Y Y G D N V A L E C E D G Y T L E G S S Q S Q C Q S D A S W D P P L P K CRano Crry-7 C K L P Q D M S G F Q K G L Q M K K D Y Y Y G D N V A L E C E D G Y T L E G S S Q S Q C Q S D A S W D P P L P K C

Hosa CR2-16 C S S P A D M D G I Q K G L E P R K M Y Q Y G A V V T L E C E D G Y M L E G S P Q S Q C Q S D H Q W N P P L A V C

Hosa C2-2 C P A P V S F E N G I Y T P R L G S Y P V G G N V S F E C E D G F I L R G S P V R Q C R P N G M W D G E T A V C

Hosa Bf-2 C P R P H D F E N G E Y W P R S P Y Y N V S D E I S F H C Y D G Y T L R G S A N R T C Q V N G R W S G Q T A I C

i C x x x R Q x L x x x x x x x x x x x V N x x C x x G Y x x x G x x Y Q x C Q x x x x W F x x I x L C 51Hosa CR2-8 C E A T G R Q L L T K P Q H Q F V R P D V N S S C G E G Y K L S G S V Y Q E C Q G T I P W F M E I R L CHosa CR2-12 C Q H V R Q S L Q E L P A G S R V E L V N T S C Q D G Y Q L T G H A Y Q M C Q D A E N G I W F K K I P L C

j C x x P x x P x x G x V H x x x x x x x G S x x x Y x C x x G x R L I G x x S x x C x x x x x x x x W x x x x P x C 58Hosa CR1-2 C R N P P D P V N G M V H V I K G I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I C

Patr CR1-2 C R N P P D P V N G M V H V I K D I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I C

Paha CR1-1 M V H V I K D I Q F G S Q I N Y S C T E G H R L I G S S S A T C I I S G N T V I W D N E T P I C

Hosa CR1-9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I CPatr CR1-9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I C

Paha CR1-8 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C V T S G N T A H W S T K P P I C

Hosa CR1-16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N T A H W S T K P P I C

Patr CR1-16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N S A H W S T K P P I C

Paha CR1-15 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I I S G N T A H W S T K P P I C

Hosa CR1-23 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I CPatr CR1-23 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I CPaha CR1-22 C G T P P E P F - N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K E A P I C

Mumu CRry-2 C K T P S D P E N G L V H V H T G I Q F G S R I N Y T C N Q G Y R L I G S S S A V C V I T D Q S V D W D T E A P I C

Rano Crry-2 C E T P L D P Q N G I V H V N T D I R F G S S I T Y T C N E G Y R L I G S S S A M C I I S D Q S V A W D A E A P I C

Mumu CR1-2 C M N P Q E P L H G S V H I N T G I E F G S T I T Y S C N Q G Y R L I G D S S A T C I V S D N T V M W D N D M P L C

Hosa DAF-3 C P N P G E I R N G Q I D V P G G I L F G A T I S F S C N T G Y K L F G S T S S F C L I S G S S V Q W S D P L P E C

Hosa CR2-1 C G S P P P I L N G R I S Y Y S T P I A V G T V I R Y S C S G T F R L I G E K S L L C I T K D K V D G T W D K P A P K C

k C x x x x x x L x x G x V x x P x x L Q L G A x V x F V C x x G x x L K G x S x S x C V L x G x x x x W N x S V P V C 59Hosa CR1-6 C D D F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPatr CR1-6 C D D F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPaha CR1-5 C D D S L G Q L P N G R V L F P R S L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CHosa CR1-13 C D D F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPatr CR1-13 C D G F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPaha CR1-12 C D D S L G Q L P N G R V L F P R S L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V C

Hosa CR1-20 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F R L K G S S V S H C V L V G M R S L W N N S V P V CPatr CR1-20 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F R L K G S S V S H C V L V G M R S L W N N S V P V CPaha CR1-19 C D D F L G Q L H H G R V L V P F N L Q L G A K V S F V C D E G F R L K G S S V S H C V L V G M R S L W N N S V P V C

Hosa CR1-27 C D D F L G Q L P H G R V L L P L N L Q L G A K V S F V C D E G F R L K G R S A S H C V L A G M K A L W N S S V P V CPatr CR1-27 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F R L K G R S A S H C V L A G M K A L W N S S V P V CPaha CR1-26 C D E F L G Q L P H G R V L S P L N L Q L G A K V S F V C D E G F R L K G R S A S H C V L A G M K A L W N S S V P V C

Rano Crry-5 C G A F L G E L P N G H V F V P Q N L Q L G A K V T F V C N T G Y Q L K G N S S S H C V L D G V E S I W N S S V P V CMumu CR1-6 C D A I P N H L L H G R V F L P P N L Q L G A E V S F V C D L G

Hosa CR2-4 C K S L G R F P N G K V K E P P I L R V G V T A N F F C D E G Y R L Q G P P S S R C V I A G Q G V A W T K M P V C

Hosa MCP-2 C P Y I R D P L N G Q A V P A N G T Y E F G Y Q M H F I C N E G Y Y L I G E E I L Y C E L K G S V A I W S G K P P I C

Consensus* C x x P P x I x N G x I x x x x x x x F G D x I x Y x C x x G x x x x x x F x x x G x x x I x C x x x A x W x x x x P x C 61SCR Patterns L A L Y E L F Y L G

V S V V V S* The consensus SCR sequence derived by reference 11.† The proteins used to define the groups are Mumu and Rano Crry, Hosa, Patr, Paha CR1 and the 'ajefbk' portion of Mumu CR1. The residues essential for defining any of the groups were only assigned when all group members had a single residue at a specific position and are shown as black boxes. Positions where multiple residues were present were designated with an (x). Of these, grey boxes indicate amino acids shared by multiple members of the group. Boxed amino acids indicate the same amino acid in the CR1 and Crry protein(s) and Hosa CR2, MCP, DAF, C2 and/or Bf. § Number of residues for group based only on CR1 and Crry proteins. Note Hosa is Homo sapien, Mumu is Mus musculus, Rano is Rattus norvegicus, Patr is Pan troglodytes and Paha is Papio hamadryas

Page 187: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 223 -

CHAPTER 2 – MS0406

McLure, C., J. Williamson, B. Stewart, P. Keating and R. Dawkins (2004). "Genomic

analysis reveals a duplication of eight rather than seven SCRs in Primate CR1 and

CR1L: Evidence for an additional set shared between CR1 and CR2." Immunogenetics

56(9): 631-638.

FIGURES AND TABLES

Page 188: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 224 -

Page 189: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80

a) Hosa_CR1_7-8 T G C T C A T T T C C C C A C A T C C C A A A T G G G T T C A G A A T A T C T A A C C C A G G C C C T C C A T A T T T C C G T A A T T A C A A T G T G G T A T THosa_CR1_14-15 T G C T C A T T T C C C C A C A T C C C A A A T G G G T T C A G A A T A T C T A A C C C A G G C C C T C C A T A T T T C C G T A A T T A C A A T G T G G T A T TPatr_CR1_7-8 T G C T C A T T T C C C C A C A T C C C A A A T G G G T T C A G A A T A T C T A A C C C A G G C C C T C C A T A T T T C C A T A A T T A C A A T G T G G T A T TPatr_CR1_8-9 T G C T C A T T T C C C C A C A T C C C A A A T G G G T T C A G A A T A T C T A A C C C A G G C C C T C C A T A T T T C C A T A A T T A C A A T G T G G T A T TPatr_CR1_16-17 T G C T C A T T T C C C C A C A T C C C T A A T G G G T T C A G A A T A T C T A A C C C A G C C C C T C C A T A T T T C C A T A A T T A C A G T G T A G T A T THosa_CR1_21-22 T G C T C A T T T C C C C A C A T C C C T A A T G G G T T C A G A A T A T C T A A C C C A G C C C C T C C A T A T T T C C A T A A T T A C A G T G T A G T A T TPatr_CR1L_12+ T A T C C G C T T C C A C A T A T C C T A A A T G G G T T C A G A A T A T G T A G G T G A G A A C C T C C A T A T T T C T A T A G T - - G A C A G T C A T T T THosa_CR1L_12+ T A T C C G C T T C C A C A T A T C C T A A A T G G G T T C A G A A T A T G T A G G T G A G A A C C T T C A T A T T T C T A T A G T - - G A C A G T C A T T T THosa_CR1_35-36 T A C C T G C T T C C A C A T A T C C T A A A T G G G T T C A G A A T A T C T A G G T A A G A A C C T C C A T A T T T C T A T A G T - - G A C A G T C A T T T TPatr_CR1_30-31 T A C C T G C T T C C A C A T A T C C T A A A T G G G T T C A G A A T A T C T A G G T A A G A A C C T C C A T A T T T C T A T A G T - - G A C A G T C G T T T TPatr_CR1_23-24 T A T C C G C T T C C A C A T A T C C T A A A T G G G T T C A G A A T A T G T G G A C C C A A - C C C T C A T G T T T C T G T A A T A A G A C T A T G G T A T AHosa_CR1_28-29 T A T C C A C T T C C A C A T A T C C T A A A T G G G T T C A G A A T A T G T G G A C C C A A T C C C T C A T G T T T C T G T A A T A A G A C T A T G G T A T T

95 100 105 110 115 120 125 130 135 140 145 150 155 160Hosa_CR1_7-8 T A T T T - - - - - - G T G C A T T T G C - - - - - - - - - - - - - - - C A C A A T G G A A T G C C A A A C C A A T G A T A G A T G G T T C T A A G G T A G G THosa_CR1_14-15 T A T T T - - - - - - G T G C A T T T G C - - - - - - - - - - - - - - - C A C A A T G G A A T G C C A A A C C A A T G A T A G A T G G T T C T A A G G T A G G TPatr_CR1_7-8 T A T T T - - - - - - G T G C A T T T G C - - - - - - - - - - - - - - - C A C A A T G G A A T G C C A A A C C A A T G A T A G A T G G T T C T A A G G T A G G TPatr_CR1_8-9 T A T T T - - - - - - G T G C A T T T G C - - - - - - - - - - - - - - - C A C A A T G G A A T G C C A A A C C A A T G A T A G A T G G T T C T A A G G T A G G TPatr_CR1_16-17 T A T T T A T A T G T G T G C A T T T G C - - - - - - - - - - - - - - - C A C A A T G G A A T G C C A A A C C A A T G A T A A A T G G T T C T A A G G T A G G THosa_CR1_21-22 T A T T T G T A T G T A T G C A T T T G C - - - - - - - - - - - - - - - C A C A A T G G A A T G C C A A A C C A A T G A T A G A T G G T T C T A A G G T A G G TPatr_CR1L_12+ T G C T T G T G A A A A T G G C T T T G C T G T A A C T G T C A G A G A C A G A A C T A C C T C C C A A G T G A A T G A C A A A C G G G T T C T A G A T A G G CHosa_CR1L_12+ T G C T T G T G A A A A T G G C T T T G C T G T A A C T G T C A G A G A C A G A A C T A C C T C C C A A G T G A A T G A C A A A T G G G T T C T A G A T A G G CHosa_CR1_35-36 T G C T T G T G A A A A T G G C T T T G C T G T A A C T G T C A G A G A C A G A A C T A C C T C C C A A G T G A A T G A C A A A C G G G T T A T A G A T A G G CPatr_CR1_30-31 T G C T T G T G A A A A T G G C T T T G C T G T A A C T G T C A G A G A C A G A A C T C C C T C C C A A G T G A A T G A C A A A T G G G T T C T A G A T A G G CPatr_CR1_23-24 T G T T T G T G A G C T T A A C T T T G T C A T A A G T G T A A A T G T C A A A A T T A C A T A C C A T A G C T A A A A C A G A T G G G T T T C A G C A G C G CHosa_CR1_28-29 T G T T T G T G A G C T T A A C T T T G T C A T A A G T G T A A A T G T C A A A A T T A C A T A C C A T A G C T A A A A C A G A T G G G T T T C A G C A G C G C

165 170Hosa_CR1_7-8 - C A A C A C A T A AHosa_CR1_14-15 - C A A C A C A T A APatr_CR1_7-8 - C A A C A C A T A APatr_CR1_8-9 - C A A C A C A T A APatr_CR1_16-17 G C A A C A C A T A AHosa_CR1_21-22 - C A A C A C A T A APatr_CR1L_12+ A C A - - - - C T G THosa_CR1L_12+ A C A - - - - C T G THosa_CR1_35-36 A C G - - - - C T G TPatr_CR1_30-31 A C G - - - - C T G TPatr_CR1_23-24 - C A A G G T A T G THosa_CR1_28-29 - C A A G G T A T G T

5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80

b) Hosa_CR1_CR7 T G T C C A A G T C C T C C A G T T A T T C C T A A T G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G G A A A A C A G T A A AHosa_CR1_CR14 T G T C C A A G T C C T C C A G T T A T T C C T A A T G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G G A A A A C A G T A A AHosa_CR1_CR21 T G T C C A A G T C C T C C A G T T A T T C C T A A T G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G A A A A G C A G T A A APatr_CR1_CR16 T G T C C A A G T C C T C C A G T T A T T C C T A A C G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G A A A A G C A G T A A APatr_CR1_CR7 T G T C C A A G T C C T C C A G T T A T T C C T A A T G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G A A A A G C A G T A A APatr_CR1_CR8 T G T C C A A G T C C T C C A G T T A T T C C T A A T G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G A A A A G C A G T A A APaha_CR1_CR7 T G T C C A A G T C C T C C A G T T A T T C C T A A T G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G A A A A G C T G T G A CPaha_CR1_CR14 T G T C C A A G T C C T C C A G T T A T T C C T A A T G G G A G A C A C A C A G G A A A A C C T C T G G A A G T C T T T C C C T T T G G A A A A G C T G T G A CHosa_CR1_CR28 T G T C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C A G G A A C T C C C T C T G G A G A T A T T C C C T A T G G A A A A G A A A T A T CPatr_CR1_CR23 T G T C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C A G G A A C T C C C T T T G G A G A T A T T C C C T A T G G A A A A G A A A T A T CHosa_CR1_CR35 T G T C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C A G G A A C T C C C T T T G G A G A T A T T C C C T A T G G A A A A G A A A T A T CPatr_CR1_CR30 T G T C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C A G G A A C T C C C T T T G G A G A T A T T C C C T A T G G A A A A G A A A T A T CPatr_CR1L_CR12 T G T C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C A G G A A C T C C C C T T G G A G A T A T T C C C T A T G G A A A A G A A G T A T CHosa_CR1L_CR12 T G T C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C A G G A A C T C C C C T T G G A G A T A T T C C C T A T G G A A A A G A A G T A T CPaha_CR1_CR21 T G T C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C A G G A G C T C T C C T T G G A G A T A T T C C C T A T G G A A A A G A A A T A T CPacy_CR1L_CR7 T G C C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A T A G G A G C T C C C C T T G G A G A T A T T C C C T A T G G A A A A G A A G T A T CPaha_CR1_CR28 T G C C C A A A T C C T C C A G C T A T C C T T A A T G G G A G A C A C A C C G G A A C T C C C C T T G G A G A T A T T C C C T A T G G A A A A G A A G T A T C

95 100 105 110 115 120 125 130 135 140 145 150 155 160

Hosa_CR1_CR7 T T A C A C A T G C G A C C C C C A C C C A G A C A G A G G G A C G A G C T T C G A C C T C A T T G G A G A G A G C A C C A T C C G C T G C A C A A G T G A C CHosa_CR1_CR14 T T A C A C A T G C G A C C C C C A C C C A G A C A G A G G G A C G A G C T T C G A C C T C A T T G G A G A G A G C A C C A T C C G C T G C A C A A G T G A C CHosa_CR1_CR21 T T A C A C A T G C G A C C C C C A C C C A G A C A G A G G G A C G A G C T T C G A C C T C A T T G G A G A G A G C A C C A T C C G C T G C A C A A G T G A C CPatr_CR1_CR16 T T A C A C A T G C G A C C C C C A C C C A G A C A G A G G G A C G A C C T T C G A C C T C A T T G G A G A G A G C A C C A T C C G C T G C A C A A G T G A C CPatr_CR1_CR7 T T A C A C A T G C G A C C C C C A C C C A G A C A G A G G G A C G A C C T T C G A C C T C A T T G G A G A G A G C A C C A T C C G C T G C A C A A G T G A C CPatr_CR1_CR8 T T A C A C A T G C G A C C C C C A C C C A G A C A G A G G G A C G A C C T T C G A C C T C A T T G G A G A G A G C A C C A T C C G C T G C A C A A G T G A C CPaha_CR1_CR7 T T A C A C A T G T G A C C C C C A C C C A G A C A G A G G G A T G A C C T T C G A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C CPaha_CR1_CR14 T T A C A C A T G T G A C C C C C A C C C A G A C A G A G G G A T G A C C T T C G A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C CHosa_CR1_CR28 T T A C A C A T G T G A C C C C C A C C C A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C CPatr_CR1_CR23 T T A C A C A T G T G A C C C C C A C C C A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C CHosa_CR1_CR35 T T A C G C A T G C G A C A C C C A C C C A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C T C C A T C C G C T G C A C A A G T G A C CPatr_CR1_CR30 T T A C G C A T G C G A C A C C C A C C C A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C T C C A T C C G C T G C A C A A G T G A C CPatr_CR1L_CR12 T T A C A C A T G T G A C C C C C A C C C A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C CHosa_CR1L_CR12 T T A C A C A T G T G A C C C C C A C C C A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C A C C A T C C G C C G C A C A A G T G A A CPaha_CR1_CR21 T T A C A C A T G T G A C C C C C A C C G A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C CPacy_CR1L_CR7 T T A C A T A T G C G A C C C C C A C C C A G A C A G A G G G A T G A C C G T C A A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C CPaha_CR1_CR28 T T A C A C A T G C G A C C C C C A C C C A G A C A G A G G G A T G A C C T T C A A C C T C A T T G G G G A G A G C A C C A T C C G C T G C A C A A G T G A C C

165 170 175 180 185 190 195 200Hosa_CR1_CR7 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G THosa_CR1_CR14 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G THosa_CR1_CR21 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPatr_CR1_CR16 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPatr_CR1_CR7 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPatr_CR1_CR8 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPaha_CR1_CR7 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPaha_CR1_CR14 C T C A A G G G A A T G G G G T T T G G A G C A G C C G G G C C C C T C G C T G THosa_CR1_CR28 C T C A T G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPatr_CR1_CR23 C T C A T G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G THosa_CR1_CR35 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPatr_CR1_CR30 C T C A A G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPatr_CR1L_CR12 C T C A T G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G THosa_CR1L_CR12 C T C A T G G G A A T G G G G T T T G G A G C A G C C C T G C C C C T C G C T G TPaha_CR1_CR21 T T C A A G G G A A T G G G G T T T G G A G C A G C C C C G C C C C T C G C T G TPacy_CR1L_CR7 C T C A A G G G A A T G G G G T T T G G A G C A G C C C C G C C C C T C G C T G TPaha_CR1_CR28 T T C A A G G G A A T G G G G T T T G G A G C A G C C C C G C C C C T C G C T G T

85 90

85 90

Figure 1 – Relationship of additional SCRsFigure 1a shows the alignment of the nucleotide sequences of the twelve additionalnovel SCRs. Note blocks of conservation confirming that all are closely related. However, the conservation is much less than found within expressed subfamilies such as "d“ shown in Fig 1b. Conserved nucleotides have been shaded black with white text. Partially conserved nucleotides have been shaded grey.

Page 190: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 252 -

Page 191: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 227 -

CHAPTER 3 – MS0402

McLure, C., J. Williamson, B. Stewart, P. Keating and R. Dawkins (2005). "Indels and

imperfect duplication have driven the evolution of human CR1 and CR1-like from their

precursor CR1 alpha: Importance of functional sets." Hum Immunol 66(3): 258-273.

FIGURES AND TABLES

Page 192: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 228 -

Page 193: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

Figure 1a. Sets of SCR subfamilies reveal relationships between Complement Control proteins. (i) Phylogenetic analysis of the amino acid sequence of Hosa CR1 and CR1L reveals that 55 SCRs form 10 distinct subfamilies (a = red, j = orange, e = yellow, f = green, b = dark blue, k = purple, d = mauve, g = grey, c = light blue and h = aqua.). Note the six degenerate g-likeSCRs are not shown in the phylogenies. (ii) After adding Mumu and Rano Crry, Patr and Pacy CR1L and Patr and Paha CR1, the same subfamilies are apparent, indicating conservation of amino acid motifs. (iii) Organisation and use of subfamilies. The colours from (i) and (ii) have been retained. White indicates a gap to permit the recognition of patterns. Black dots indicate SCR presence is unknown (no genomic sequence). Coloured boxes with white stripes indicate truncated or degenerate SCRs grouped on remaining sequence. The numbers apply to Hosa CR1. Degenerate SCRs are designated 7-8, 14-15 etc, thus not affecting the existing numbering. The inner circles show Mumu Crry “ajefh” and Rano Crry “ajefkhh”. The next three circles (CR1L) show the addition and deletion of SCRs resulting in Hosa, Patr and Pacy CR1L being “ajefbkjefbkdg-like”, “ajefbkjefbkdg-like” and “ajef…bkd” respectively. The outer three circles (CR1) reflect the use of the “ch”, “ajef” and “bkdg-like” sets (with some deletions). Imperfect internal duplication of octamers is apparent at several locations such as 8 to 14.

hf cbj dea k

Rano Crry

Mumu Crry

Pacy CR1L

Patr CR1L

Hosa CR1L

Paha CR1

Patr CR1

Hosa CR1

(i) Human(ii) Human, primate and Rodent

(iii)

Hos

aC

R1

10H

osa

CR

1 3

Hos

aC

R1

17

Hosa CR1 2

4

Hosa CR1L 3Hosa CR1L 8Hosa CR1 31

Hosa CR1 7Hosa CR1 21Hosa CR1 14

Hosa CR1L 12

Hosa CR1 28

Hosa

CR

1 35

Hosa CR1 36

Hosa CR1 33

Hosa CR1L 10

Hosa

CR

1 26

Hosa

CR

1L 5H

osaC

R1 12

Hosa

CR

1 5

HosaCR1 19

Hos

aC

R1

1H

osa

CR

1L 1

Hos

aC

R1

29

Hos

aC

R1

8

HosaCR1 15

Hosa CR1 22

Hos

aC

R1

16H

osa

CR1

23

Hosa CR1 9

Hosa CR1L 7

Hosa CR1L 2Hosa CR1 2Hosa CR1 30

Hosa CR1 27Hosa CR1L 11

Hosa CR1 34

HosaCR1L 6

Hosa

CR

1 20

Hosa

CR

1 6H

osaC

R1 13

Hosa

CR

1 37

HosaCR1 32

HosaCR1L 4

Hosa

CR

1L 9

Hosa

CR

1 18H

osaC

R1 4

HosaCR1 11

HosaCR1 25

0.5

Pah

a C

R1

10P

aha

CR

1 17

Hos

aC

R1

24P

acy

CR

1 L 3

Pah

a C

R1

3H

osa

CR

1L 3

Pat

rC

R1L

3

Hos

aC

R1L

8

Hos

aC

R1

10

Hos

aC

R1

17

Hosa

CR1 3

Patr CR1

3

PatrCR1 12

Rano Crry3

Mumu Crry3

Patr CR1 26

Hosa CR1 31

Paha CR1 24

Hosa

CR1 7

Hosa CR1 2

1

Patr CR1 7

Patr CR1 16

Patr CR1 8Hosa CR1 14

Paha CR1 7Paha CR1 14Patr CR1L 12Paha CR1 28Pacy CR1L 7

Paha CR1 21

Hosa CR1L 12

Hosa CR1 28

Patr CR1 23

Hosa CR1 35

Patr CR1 30

Hosa CR1 36

Patr CR1 31

Paha CR1 29Hosa CR1 19Patr CR1 5Hosa CR1 12

Hosa CR1 5

Patr CR1 14Paha CR1 12

Paha CR1 5

HosaCR1L 5

Patr CR1L 5Hosa

CR1 26

Patr CR

1 21

Patr

CR

1 28

Paha C

R1 19

Paha C

R1 26

Hos a

CR

1 3 3P

acyC

R1L 5

Patr

CR

1L 10

Hosa

CR

1L 10

HosaCR1 20

PatrCR1 6

Hosa

CR1 13

Hos

aC

R1

6

Pat

rC

R1

15

Hos

aC

R1L

6

Pat

rC

R1L

6H

osa

CR

1 27

Pat

rC

R1

22P

aha

CR

1 6

Pah

a C

R1

13

Pah

a C

R1

20

Paha

CR

1 27

Hosa

CR1 34

Hosa CR1L

11

Patr CR1L 11

Pacy CR1L 6

Patr CR1 29

Rano Crry5

Hos

aC

R1

37P

atr C

R1

32

Paha

CR

1 30

Mum

uC

rry5

Rano

Crry6

Rano Crry7

Ran

oC

rry

4

Mum

uCrry

4H

osa

CR

1 32

Patr C

R1 27

Paha C

R1 25

Hosa

CR1L 4

Patr

CR1L 4

Paha C

R1 11

Paha C

R1 18

Pacy CR1L 4

Paha CR1 4

Patr CR1 4Hosa CR1 4Hosa CR1 11Hosa CR1 18Hosa CR1 25

Patr CR1 13

Patr CR1 20

Patr CR1L 9

Hosa

CR

1L 9

Patr CR1 25Paha CR1 23Hosa CR1 30

Pacy CR1L 2Paha CR1 2

Patr CR1 2Patr CR1L 2Hosa CR1L 2

Hosa CR1 2

Rano Crry 2

Mumu Crry 2

Hosa CR1L 7

Patr CR1L 7

Paha CR1 9

Paha CR

1 16

Hosa

CR

1 9

Hosa

CR

1 16P

atrC

R1 11

Patr

CR

1 18H

osaC

R1 23

HosaCR1 29

Patr CR

1 24

Paha C

R1 22

Hosa

CR

1 22

Patr C

R1 17

Hosa

CR

1 15

Hosa

CR

1 8P

aha CR

1 8P

aha CR

1 15R

anoC

rry1M

umu

Crry

1H

osaC

R1 1

Patr

CR

1 1P

aha CR

1 1

PacyC

R1L 1

HosaCR

1L 1

PatrCR1L 1

0.5

g

Page 194: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure
Page 195: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

KbMCPCR1

LMCPLCR1

j e f b ka d ga j e f b k d g c h

50 100 150

50

100

150

20

40

60

20

40

Human(i) Chimpanzee

(ii)

Norway rat

(iii)

House mouse

MCP

Crrya j e f k

h h

MCP

CR1L

MCP

LCR

1

j e f b

a j e f b k d g

c h

d g a

MCP

Crrya j e f

h

MCP

CR1Lj e f b k

a

d g

a j e f b k d g

Figure 2b. Domain and segmental duplications within primate CR1 explain the genomic expansion of CR1and CR1L not seen in the rodent.Comparative analysis of the genomic region containing HosaCR1, MCPL, CR1L and MCP with syntenic regions in (i) Patr, (ii) Mumuand (iii) Rano using dotter (Sonhammerand Durbin, 1995). Comparative dot-plots reveal high sequence similarity between Hosa and Patr. There are notable exceptions, for example the missing LHR in CR1, the extra SCRs(dg-like) between the 1st and 2nd octamers and the additional MCPduplicon. Loss of sequence similarity between rodent and Hosa is due to the absence of both internal duplication of octamers within primate CR1 and segmental duplication of the primate genes CR1and MCP. Sequences have been Repeat Masked (http://repeatmasker.org/cgi-bin/WEBRepeatMasker) and masked elements removed from the annotated sequence. Note that missing Patr CR1Ldata has been found in WGS sequence. These follow the ensembl sequence, are separated by horizontal lines and have been annotated accordingly. f

Page 196: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

Figure 3a: Analysis of genomic sequence of CR1L against CR1. Coloured lines running through the plot indicate positions of individual SCRs such that a = red, j = orange, e = yellow, f = green, b = dark blue, k = purple, d = mauve, g = grey, c = light blue and h = aqua. Filled regions show non coding sequence between split exons (j & k). For the purpose of discussion we assume that the CR1L was the progenitor of CR1 which arose through further piecemeal duplication of components within CR1L. For example the region of CR1Lcommencing at b1 has duplicated en bloc and there has been further diversification, especially in non coding regions between j2(i) and j2(ii). The deletion of the segment including d1, g-like1 & a2 is indicated by a vertical gap between k (ii) and j (i) (Revised Figure 4a. from [6]).

Page 197: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

Figure 4. Phylogenetic and genomic analyses define a model for the evolution of Hosa CR1 and CR1L. The evolutionary model has been constructed through analyses of phylogenetic trees comparing SCR peptides of CR1 related genes and the comparisons of the genomic sequence from the two genes. We hypothesise that these genes have resulted from multiple duplication and indel events. First, we believe the ancestral sequence (CR1 a) contained 1 copy of each SCR subfamily “a, j, e, f, b, k, d, g -like, c & h”. Second, the domain containing “a, j, e, f, b, k, d & g-like” was then duplicated twice, which we refer to as a “triplication”; this resulted in CR1 b, containing three copies of the domain “ajefbkdg-like” as well as 1 copy of the “c” and “h” SCRs. Third, segmental duplication of the entire gene, CR1b, formed two copies of the gene (pre-CR1 and pre-CR1L). Last, triplication of the domain “b, k, d, g-like, a, j, e & f” in pre-CR1 created further genomic expansion while deletions (“d, g-like & a”, “b, k, d, g-like, a, j, e & f” and “c & h”) in pre-CR1L caused genomic contraction. The resultant genes CR1 and CR1L contain 42 and 13 SCRs respectively (including degenerate g-like).

Page 198: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

TABLE 1. SCR alignments from nine SCR groups found in Crry, CR1 and CR1L. § No residues

a C x x P x x x x x A x x x x x x x x x x F P x G T x L x Y E C x P x Y x x x x F S I x C x x x x x W x x x x D x C 57Hosa_CR1_1 C N A P E W L P F A R P T N L T D E F E F P I G T Y L N Y E C R P G Y S G R P F S I I C L K N S V W T G A K D R CPatr_CR1_1 C N A P E W L P F A R P T N L T D E F E F P I G T Y L N Y E C R P G Y Y G R P F S I I C L K N S V W T G A K D R C

^ Paha_CR1_1 C N A P E Q L P F A R P T E L I D E P E F S I G T H L K Y E C R P G Y Y G R P F S I I C L K N S V W T S A K D K CHosa_CR1L_1 C N V P E W L P F A R P T N L T D D F E F P I G T Y L N Y E C R P G Y S G R P F S I I C L K N S V W T S A K D K CPatr_CR1L_1 C N V P E W L P F A R P T N L T D E F E F P I G T Y L N Y E C R P G Y Y G R P F S I I C L K N S V W T S A E D K CPacy_CR1L_1 C N A P E Q L P F A R P T N L T D A S E F P V G T Y L K Y E C L P G Y H G K P F S I I C L K N S V W T S A K D K CRano_Crry_1 C P A P P L F P Y A K P I N P T D E S T F P V G T S L K Y E C R P G Y I K R Q F S I T C E V N S V W T S P Q D V CMumu_Crry_1 C P A P S Q L P S A K P I N L T D E S M F P I G T Y L L Y E C L P G Y I K R Q F S I T C K Q D S T W T S A E D K CHosa_CR1_8 C Q A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L V W S S P K D V CHosa_CR1_15 C Q A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L V W S S P K D V CHosa_CR1_22 C Q A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L V W S S P K D V CPatr_CR1_17 C Q A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L V W S S P K D V CPaha_CR1_8 C K A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G R P F S I T C L D N L E W S S P K D V CPaha_CR1_15 C K A P D H F L F A K L K T Q T N A S D F P I G T S L K Y E C R P E Y Y G K P F S I T C L D N L V W S S P K D V CPatr_CR1_9 C Q A P D H F L F A K L K T Q S F A I G T S L K Y K C R P E Y Y G R P F S I T C L D N L V W S S P K D V CPatr_CR1_10 C Q A P D H F L F A K L K T Q S F A I G T S L K Y K C R P E Y Y G R P F S I T C L D N L V W S S P K D V CHosa_CR1_29 C K T P E Q F P F A S P T I P I N D F E F P V G T S L N Y E C R P G Y F G K M F S I S C L E N L V W S S V E D N CPatr_CR1_24 C K T P E Q F P F A S P T I P I N D F E F P V G T S L N Y E C R P G Y F G K M F S I S C L E N L V W S S V E D N CPaha_CR1_22 C K T P E Q F P F A S P T I P I N D F E F P V G T S L N Y E C H P G Y F G R M F S I S C L E N L V W S S V E D N C

b C Q P P P x x L H x E x x x x x x x x F x x G x E V x Y x C x P x x x L R G x x x x x C x P Q G x x x P x x P x C 57Hosa_CR1_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T CHosa_CR1_12 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T CHosa_CR1_19 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T CPatr_CR1_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S L R C T P Q G D W S P A T P T CPaha_CR1_5 C Q P P P D V L H G E R T Q R D K D I F Q T G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R CPaha_CR1_12 C Q P P P D V L H G E R T Q R D K D I F Q P G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R CHosa_CR1L_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G S T Y L H C T P Q G D W S P A A P R CPatr_CR1L_5 C Q P P P D V L H G E R T Q R D K D N F S P G E E V Y Y S C E P G Y D L R G S T Y L H C T P Q G D W S P E A P R CHosa_CR1_26 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R CPatr_CR1_21 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R CPatr_CR1_28 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R CPaha_CR1_26 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P I CPacy_CR1L_5 C Q P P P E I L H G E H T P S H Q D F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W N P E A P I CPaha_CR1_19 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R CHosa_CR1_33 C Q P P P E I L H G E H T L S H Q D N F S P G Q E V F Y S C E P S Y D L R G A A S L H C T P Q G D W S P E A P R CPatr_CR1L_10 C Q P P P E I L H G E H T L S H Q D N F S P G Q D V F Y S C E P G Y D L R G A A S L H C T P Q G D W T P E A P R CHosa_CR1L_10 C Q P P P E I L H G E H T L S H Q D N F L P G Q E V F Y S C E P S Y D L R G A A S L H C M P Q G D W T P E A P R CPatr_CR1_14 C Q P P P D V L H A E A Y P K G Q G Q L F T R G R E V F Y S C X P A T N L R G A A S L R C T P Q G R L E P L Q P P Q C

c C P x P P K I Q N G H x I G G H V S L Y L P G M T I x Y I C D P G Y L L V G K G x I F C T D Q G I W S Q L D H Y C 57Hosa_CR1_36 C P D P P K I Q N G H Y I G G H V S L Y L P G M T I S Y I C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y CPatr_CR1_31 C P H P P K I Q N G H D I G G H V S L Y L P G M T I S Y I C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y CPaha_CR1_29 C P H P P K I Q N G H Y I G G H V S L Y L P G M T I G Y I C D P G Y L L V G K G I I F C T D Q G I W S Q L D H Y C

d C P x P P x I x N G R H x G x x x x x x P x G K x x x Y x C D P H x D R G x x x x L I G E S x I R x T S x x x G N G V W S S x A P R C 67Patr_CR1_8 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R CPatr_CR1_16 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R CPatr_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R CHosa_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R CHosa_CR1_21 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R CHosa_CR1_14 C P S P P V I P N G R H T G K P L E V F P F G K T V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R CPaha_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S P A P R CPaha_CR1_14 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S R A P R CHosa_CR1_28 C P N P P A I L N G R H T G T P S G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R CPatr_CR1_23 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R CHosa_CR1_35 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D R Q G N G V W S S P A P R CPatr_CR1_30 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D P Q G N G V W S S P A P R CHosa_CR1L_12 C P N P P A I L N G R H T G T P P G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R R T S E P H G N G V W S S P A P R CPatr_CR1L_12 C P N P P A I L N G R H T G T P L G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R CPaha_CR1_21 C P N P P A I L N G R H T G A L L G D I P Y G K E I S Y T C D P H R D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R CPaha_CR1_28 C P N P P A I L N G R H T G T P L G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R CPacy_CR1L_7 C P N P P A I L N G R H I G A P L G D I P Y G K E V S Y I C D P H P D R G M T V N L I G E S T I R C T S D P Q G N G V W S S P A P R C

e C x x P P x I x N G D F x S x x R x x F x x x x V V T Y x C x x x x x x x x x F x L V G E x S x x C T S x x x x x G x W S x P x P x C 67Hosa_CR1_10 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CHosa_CR1_17 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CHosa_CR1_3 C G L P P T I T N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPatr_CR1_3 C G L P P T I T N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CHosa_CR1_24 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPatr_CR1_12 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q CPatr_CR1_19 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W A P Q CPaha_CR1_10 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CPacy_CR1L_3 C G L P P T I A N G D F I S T S R E Y F P Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CPaha_CR1_17 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CHosa_CR1L_3 C G L P P T I A N G D F T S I S R E Y F H Y G S V V T Y H C N L G S R G K K V F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CPatr_CR1L_3 C G L P P T I A N G D F T S I S R E Y F H Y A S V V T Y H C N L G S G G K K V F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CPaha_CR1_3 C G L P P T I D N G D F F S A N K E Y F H Y G S V V T Y R C N L G S G G R K L F E L V G E P S I Y C T S N E D Q V G I W S G P A P Q CHosa_CR1L_8 C G L P P N I T N G Y F I S T D R E Y F H Y G S V V T Y H C N L G S R G R K V F E L V G E P S I Y C T S K D D Q V G V W S G P V P Q CPatr_CR1L_8 C G L P P N I T N G Y F I S T D R E Y F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S K G D Q V G V W Q CRano_Crry_3 C E I P P S I P N G D F F S P N R E D F H Y G M V V T Y Q C N T D A R G K K L F N L V G E P S I H C T S I D G Q V G V W S G P P P Q CMumu_Crry_3 C E I P P G I P N G D F F S S T R E D F H Y G M V V T Y R C N T D A R G K A L F N L V G E P S L Y C T S N D G E I G V W S G P P P Q CHosa_CR1_31 C E P P P T I S N G D F Y S N N R T S F H N G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R CPatr_CR1_26 C E P P P T I S N G D F Y S N N R A S F H N G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R CPaha_CR1_24 C K P P P T I S N G D F Y S N N R T S F H S G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G A W S S P P P R C

f C x x P x V x x x x x x x x N x S x F S L x x x V x F R C x x G F x M x G x x x V x C x x x x x W x P x L P x C 56Hosa_CR1_32 C T A P E V E N A I R V P G N R S F F S L T E I V R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H CPatr_CR1_27 C T A P E V E N A I R V P G N R S F F S L T E I V R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H CPaha_CR1_25 C T A P E V K N G I R V P G N R S F F S L N E I V R F R C Q P G F V M V G S H T V Q C Q T N N R W G P K L P H CRano_Crry_4 C T P P H V E N A V I V S K N K S L F S L R D M V E F R C Q D G F M M K G D S S V Y C R S L N R W E P Q L P S CMumu_Crry_4 C T P P P Y V E N A V M L S E N R S L F S L R D I V E F R C H P G F I M K G A S S V H C Q S L N K W E P E L P S CHosa_CR1_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S CHosa_CR1_11 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S CHosa_CR1_18 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S CHosa_CR1_25 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S CPatr_CR1_13 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F A M K G P R R V K C Q A L N K W E P E L P S CPatr_CR1_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P P R V K C Q A L N K W E P E L P S CPatr_CR1_20 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P H R V K C Q A L N K W E P E L P S CPatr_CR1L_9 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C L P G F V M K R P P P R V Q C Q A L N K W E T E L P S CHosa_CR1L_9 C T P P N V E G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P H R V Q C Q A L N K W E T E L P S CPacy_CR1L_4 C M P P N V E N G V L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R H V Q C Q A L N K W E P E L P S CPatr_CR1L_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R H V H C Q A L N K W E P E L P S CPaha_CR1_11 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S CPaha_CR1_18 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S CPaha_CR1_4 C T P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S CHosa_CR1L_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F G M K G P S H V K C Q A L N K W E P E L P S C

g-like x x x P H I x N G F R I x x x x P x x F x x x x x x x x x x x x x x x x x x x x x x x x x xPatr_CR1_7-8s C S F P H I P N G F R I S N P G P P Y F H N Y N V V F I C A F A T M E C Q T N D R WHosa_CR1_7-8s C S F P H I P N G F R I S N P G P P Y F R N Y N V V F I C A F A T M E C Q T N D R WPatr_CR1_8-9s C S F P H I P N G F R I S N P G P P Y F H N Y N V V F I C A F A T M E C Q T N D R WHosa_CR1_14-15s C S F P H I P N G F R I S N P G P P Y F R N Y N V V F I C A F A T M E C Q T N D R WPatr_CR1_16-17s C S F P H I P N G F R I S N P A P P Y F H N Y S V V F I Y M C A F A T M E C Q T N D K WHosa_CR1_21-22s C S F P H I P N G F R I S N P A P P Y F H N Y S V V F I C M Y A F A T M E C Q T N D K WPatr_CR1_23-24s P H I L N G F R I C G P N P S C F C N K T M V Y V CHosa_CR1_28-29s P H I L N G F R I C G P N P S C F C N K T M V F V C

Patr_CR1_30-31s P H I L N G F R I S R * E P P Y F Y S D R F C L * K W L C C N C QHosa_CR1_35-36s P H I L N G F R I S R * E P P Y F Y SPatr_CR1L_12+s P H I L N G F R I C R * E P P Y F Y S D H F C L * K W L C C N C QHosa_CR1L_12+s P H I L N G F R I C R * E P S Y F Y S

Page 199: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

TABLE 1. SCR alignments from nine SCR groups found in Crry, CR1 and CR1L (Cont)

h C x x P x x M x G x x K x L x M K K x Y x Y G x x V x L x C E D G Y x L E G S x x S Q C Q x D x x W x P x L x x C 57Hosa_CR1_37 C S F P L F M N G I S K E L E M K K V Y H Y G D Y V T L K C E D G Y T L E G S P W S Q C Q A D D R W D P P L A K CPatr_CR1_32 C S F P L F M N G I S K E L E M K K V Y H Y G D Y V T L K C E D G Y T L E G S P W S Q C Q A D D R W D P P L A K CPaha_CR1_30 C S F P Q F M N G I S K E L E M K K V Y H Y G D Y V T L E C E D G Y A L E G S P W S Q C Q A D D R W D P P L A I CRano_Crry_6 C K L P Q D M S G F Q K G L Q M K K D Y Y Y G D N V A L E C E D G Y T L E G S S Q S Q C Q S D A S W D P P L P K CRano_Crry_7 C K L P Q D M S G F Q K G L Q M K K D Y Y Y G D N V A L E C E D G Y T L E G S S Q S Q C Q S D A S W D P P L P K CMumu_Crry_5 C R L P Q E M S G F Q K G L G M K K E Y Y Y G E N V T L E C E D G Y T L E G S S Q S Q C Q S D G S W N P L L A K C

j C x x P x x P x N G x V H x x x x x x x G S x x x Y x C x x G x R L x G x x x x x C x x x x x x x x W x x x x P x C 58Pacy_CR1L_2 C R N P K D P V N G M V H V I K D I Q F G S Q I N Y S C N K G Y R L I G S S S A T C I I S G N T V I W D N E T P I C

^ Paha_CR1_2 C R N P R D P V N G M V H V I K D I Q F G S Q I N Y S C T E G H R L I G S S S A T C I I S G N T V I W D N E T P I CPatr_CR1L_2 C R N P P D P V N G M V H V I K D I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G N T V I W D N K T P V CHosa_CR1L_2 C R N P P D P V N G M A H V I K D I Q F G S Q I K Y S C P K G Y R L I G S S S A T C I I S G N T V I W D N K T P V CPatr_CR1_2 C R N P P D P V N G M V H V I K D I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I CHosa_CR1_2 C R N P P D P V N G M V H V I K G I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I CRano_Crry_2 C E T P L D P Q N G I V H V N T D I R F G S S I T Y T C N E G Y R L I G S S S A M C I I S D Q S V A W D A E A P I CMumu_Crry_2 C K T P S D P E N G L V H V H T G I Q F G S R I N Y T C N Q G Y R L I G S S S A V C V I T D Q S V D W D T E A P I CHosa_CR1_9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I CHosa_CR1_16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I CPatr_CR1_11 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I CPatr_CR1_18 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N S A H W S T K P P I CHosa_CR1_23 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N T A H W S T K P P I CPaha_CR1_16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I I S G N T A H W S T K P P I CPaha_CR1_9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C V T S G N T A H W S T K P P I CHosa_CR1L_7 C E T P P V P V N G M V H V I T D I H V G S R I N Y S C T T G H R L I G H S S A E C I L S G N T A H W S M K P P I CPatr_CR1L_7 C E T P P V P V N G M V H V I T D I H V G S R I N Y S C I T G H R L I G H S S A E C I L S G N T A H W S M K P P I CHosa_CR1_30 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I CPatr_CR1_25 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I CPaha_CR1_23 C G T P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K E A P I C

k C x x x x G x L x x G x V x x P x x L Q L G A K V x F V C x x G x x L K G x x x S x C V L x G x x x x W N x S V P V C 59Hosa_CR1L_6 C D D F L G Q L P N G H V L F P L N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPatr_CR1L_6 C D D F L G Q L P N G R V L F P L N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CHosa_CR1_6 C D D F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CHosa_CR1_13 C D D F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CHosa_CR1_20 C D D F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPatr_CR1_6 C D D F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPatr_CR1_15 C D G F M G Q L L N G R V L F P V N L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPaha_CR1_6 C D D S L G Q L P N G R V L F P R S L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CPaha_CR1_13 C D D S L G Q L P N G R V L F P R S L Q L G A K V D F V C D E G F Q L K G S S A S Y C V L A G M E S L W N S S V P V CHosa_CR1_27 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F R L K G S S V S H C V L V G M R S L W N N S V P V CPatr_CR1_22 C D D F L G Q L P H G H V L F P L N L Q L G A K V S F V C D E G F R L K G S S V S H C V L V G M R S L W N N S V P V CPaha_CR1_20 C D D F L G Q L H H G R V L V P F N L Q L G A K V S F V C D E G F R L K G S S V S H C V L V G M R S L W N N S V P V CHosa_CR1L_11 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F L K G R S A S H C V L A G M K A L W N S S V P V CPatr_CR1L_11 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F R L K G R S A S H C V L A G M K A L W N S S V P V CPacy_CR1L_6 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F R L K G R F A S H C V L A G M K A L W N S S V P V CPatr_CR1_29 C D D F L G Q L P H G R V L F P L N L Q L G A K V S F V C D E G F R L K G R S A S H C V L A G M K A L W N S S V P V CHosa_CR1_34 C D D F L G Q L P H G R V L L P L N L Q L G A K V S F V C D E G F R L K G R S A S H C V L A G M K A L W N S S V P V CPaha_CR1_27 C D E F L G Q L P H G R V L S P L N L Q L G A K V S F V C D E G F R L K G R S A S H C V L A G M K A L W N S S V P V CRano_Crry_5 C G A F L G E L P N G H V F V P Q N L Q L G A K V T F V C N T G Y Q L K G N S S S H C V L D G V E S I W N S S V P V C

* Consensus C x x P P x I x N G x I x x x x x x x F G D x I x Y x C x x G x x x x x x F x x x G x x x I x C x x x A x W x x x x P x C 61SCR Patterns L A Y E L F Y L G

V S V V S

TABLE 1. SCR alignments from nine SCR groups found in Crry, CR1 and CR1L (Cont).* The consensus SCR sequence derived by [11].† The proteins used to define the groups are Mumu and Rano Crry, Hosa, Patr, Paha CR1 and Hosa, Patr, Pacy CR1L. The residues essential for defining any of the subfamilies were only assigned when all group members had a single residue at a specific position and are shown as black boxes. Positions where multiple residues were present were designated with an (x). Of these, grey boxes indicate amino acids shared by multiple members of the group. Boxed sequences indicate partial or incomplete SCRs found within the genomic sequences. These generally involve frameshift mutations. When this is the case, the amino acid disrupted has been shade black and does not contain a residue. Sequence downstream is continued in the correct reading frame, thus ignoring the frameshift. These have been assigned to a SCR group based on their sequence present. Partial translations from the recently discovered g-like subfamily have been included but due to nucleotide degeneracy, limited sequence has been translated [9]. Partial, incomplete and degenerate SCRs were not included in phylogenetic analysis shown in Fig. 1a(ii) and Fig. 1b. § Number of residues for each subfamily. ^ Translated from the mRNA sequence but absent in respective protein sequence. Note Hosa is Homo sapien, Mumu is Mus musculus, Rano is Rattus norvegicus, Patr is Pan troglodytes, Paha is Papio hamadryas and Pacy is Papio Cynocephalus.

Page 200: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 252 -

Page 201: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 241 -

CHAPTER 4 – MS0408

McLure, C., J. Williamson, L. Smyth, S. Agrawal, S. Lester, J. Millman, P. Keating, B.

Stewart, and R. Dawkins (In Press). "Extensive genomic polymorphism of the CR1

region: RCA ancestral haplotypes, function and disease" Immunogenetics

FIGURES AND TABLES

Page 202: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 242 -

Page 203: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

BstN1 RFLP locations

Polymorphic Geometric Element

5' Conserved Region

110

20

25

26

29

30

31

40

41

50

60

70

80

81

82

90

95

CR1-like (NT_086601) A A T T C C A A A T T G G C C T G G T T G A C A C T G T A C A A A A C C A C C A G A T A A T T A T A A T T T T A T T T A A C T C T T T G T C T T C T T T T C T T T - - - - C C T T C C C T C C T T C CCR1-like (NT_021877.16) A A T T C C A A A T T G G C C T G G T T G A C A C T G T A C A A A A C C A C C A G A T A A T T A T A A T T T T A T T T A A C T C T T T G T C T T C T T T T C T T T - - - - C C T T C C C T C C T T C CCR1 (NT_021877.16) A A T T C C A A A T T G G C C T G G T T G A C A T G G T G C C A A A C C A C C A A A T A A T T A T A A T T T T A T T T A A C T C T T T G T C T T C T T T T C T T T C T T T C C T T C C C T C C C T C C

1 10

20

25

26

29

30

31

40

41

50

60

70

80

82

83

84

85

90

99

96

98

100

110

120

130

140

150

160

170

180

190

194

CR1-like (NT_086601) C T T C T G C C T G C C T G C T T G C C T T C C T T C T T T G C T T G C T T C C T T C C T T C C T C C C T C C C T C C A T C C C T C C C T T C C T C C C T C C C T C C C T T C C T T C C T T C C T T CCR1-like (NT_021877.16) C T T C T G C C T G C C T G C T T G C C T T C C T T C T T T G C T T G C T T C C T T C C T T T C T C C C T C C C T T C C T C C C T C C T T T C C T T C C T T C C T C C C T C C C T C C C T C C C T T CCR1 (NT_021877.16) C T C C T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - T T C C T C C C T C C C T C C C T C C C T C C

100

102

104

105

110

120

127

Polymorphic Geometric Element 3' Conserved Region

195

200

210

220

230

240

250

260

270

280

290

293

CR1-like (NT_086601) C T T C C T T C C T T C C T T C C T T C C T T C C T T C C T T C C T T C C C T C C T T C C C T C C T T C C C T C C T T C C C T C C T T C C C T C C T T C C C T C C T T T C C T T C T C C T T A T T T TCR1-like (NT_021877.16) C T C C C T C C C T T C C T T C C T T C C T T C C T T C C T T C C T T C C T T C C T T C C T T C C T T C C C T C C T T C C C T C C T T C C C T - - - - - - - - C C T T T C C T T C T C C T T A T T T TCR1 (NT_021877.16) C T C C C T C C C T T C C T T C C T T C C T T C C T T C T T T C C T T C C T T C C T T C C T T C C T T C C C T - - - - - - - - - - - - - - - - - - - - - - - - - C T T T C C T T - - - - - - A T T T T

128

130

140

150

160

170

180

182

183

190

191

195

294

300

310

311

312

320

330

340

350

360

370

380

386

387

390

391

CR1-like (NT_086601) C T T T C T T C T T T A C C A C A C - G G C T A G G A C C A C C A G T A T A A C A T T G A A C A T T G G T A G C A A T A G A T G T C A T C C T T G T C T T G T T C C A C A T C T C A A A G G G A A G GCR1-like (NT_021877.16) C T T T C T T C T T T A C C A C A C - G G C T A G G A C C A C C A G T A T A A C A T T G A A C A T T G G T A G C A A T A G A T G T C A T C C T T G T C T T G T T C C A C A T C T C A A A G G G A A G GCR1 (NT_021877.16) C T T T C T T C T T T A C C A C G C T G G C T A G G A C C A C C A G T A T A A C A T T G A A C A T T G G T A G C A A T A G A T G T C A T C C T T G T C T T G T T C C A C A T C T C A A A G T T A A A G

196

200

210

212

214

220

230

240

250

260

270

280

289

290

293

294

3' Conserved Region

a

100 kb 300 kb200 kb

CR1 MCPL CR1L MCP

Segment A Segment B

CR2 CD34

400 kb 500 kb

CR1MCP11 & CR1MCP12

CR1MCP6

CR1MCP5

CR1MCP11 & CR1MCP12

CR1MCP5 & CR1MCP6 CR1MCP5 & CR1MCP6

Figure 1. Multiple binding and amplification by primer pairs. Schematic representation of the genomic region on 1q32 showing the duplicated segments (purple and blue bars) containing the CR1 and MCP genes. The red and green lines indicate the positions of the forward (CR1MCP 5) and reverse primers (CR1MCP 6) designated P5+6. The amplified sequences of CR1 (purple) and CR1-like (blue), including Celera, have been aligned to show conserved regions flanking a polymorphic geometric element containing multiple complex components which distinguish CR1 and CR1-like sequences. Black shading and white text indicates conserved sequence. Numbers above and below the alignment represent nucleotide positions of CR1-like (Celera - NT_086601) and CR1 (NCBI – NT_021877.16) respectively. Also shown are locations of primers P 11+12 and BstN1 cutting sites (see Table 1). Conserved nucleotides at CR1-like positions 289-391 are part of a L1 element.

Page 204: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

0

20

40

60

80

100

120

140

0

20

40

60

80

100

120

140

I 1 C04/0162X 5I 1a C04/0163D 5 16 17II 1 C04/0175J 5 16II 1a C04/0176Q 5 6 20III 1 C04/0174C 5 16 20III 2 C04/0159A 5 6III 3 C04/0177X 5 6

Observed Segregation

242bp 331bp

Deduced Genotypes

404bp

Ban

d In

tens

ityB

and

Inte

nsity

489bp 501bp

5

6

1617

16

5

20

III

II

I

1a1

41 2

1a

3

1

III

II

I

1a1

41 2

1a

3

1

ab = 5, 0; 5, 0cd = 5,16; 5,17ac = 5, 0; 5,16ef = 5,20; 6, 0ce = 5,16; 5,20af = 5, 0; 6, 0af = 5, 0; 6, 0

4

I

1a1

41 2

1a

3

1

III

II

4

I

1a1

41 2

1a

3

1

III

II

0

20

40

60

80

100

120

140

0

20

40

60

80

100

120

140

I 1 C04/0157M 45 16I 1a C04/0156F 4 6 9 14II 2 C04/0172P 4 14II 3 C04/0164K 5 6 9 16II 3a C04/0220C 45III 1 C04/0165R 56 9III 2 C04/0166Y 4 6 9

Ban

d In

tens

ityB

and

Inte

nsity

Observed Segregation

Deduced Genotypes

16

9

64

5

242bp 331bp 404bp 489bp 501bp

I

III

II

1

1 2 3

12

3 3a2

1 1a

2

I

III

II

1

1 2 3

12

3 3a2

1 1a

2

4

56

9 1416

I

III

II

1 a1

1 2 3 3a

1 2

1aI

III

II

1 a1

1 2 3 3a

1 2

1a1 a1

1 2 3 3a

1 2

1a

ab = 4, 0; 5,16cd = 4,14; 6, 9ac = 4, 0; 4,14

bd = 5,16; 6, 9ef = 4, 0; 5, 0fd = 5, 0; 6, 9ed = 4, 0; 6, 9

Family 1 Family 2

Figure 4. Segregation of ancestral haplotypes. GMT P5+6 profiles from 3-generation families confirm unequivocal segregation of haplotypes. In each case the profile overlay has been restricted to 2 generations. Individual profiles are coloured as shown in the family tree and the laboratory specimen codes. The number assigned to each band is derived from Fig 2.

Page 205: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

5' Conserved Region Polymorphic Geometric Element 3' Conserved Region

25

26

29

31

41

52

81

90

97

98 * * * *

141

150

151

152

153

155

157

165

166

173

177

178

205

207

226

273

274

278

279

284

289

310

327

386

387

390

CR1-like (NT086601) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )7 C ( C C T T C C C T )6 C T C C T T C T A A - T G G G

CR1L_09001 (DQ007064) C T A A G A T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )1 C ( C C T T C C C T )5 C T C C T T C T A A - C G G G

CR1L_09501 (DQ007065) C T A A G A T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )2 C ( C C T T C C C T )5 C T C C T T C T A A - C G G G

CR1L_09501 (DQ007066) C T A A G A T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )2 C ( C C T T C C C T )5 C T C C T T C T A A - C G G G

CR1L_09501 (DQ007067) C T A A G A T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )2 C ( C C T T C C C T )5 C T C C T T C T A A - C G G G

CR1L_10001 (DQ007068) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )3 C ( C C T T C C C T )5 C T C C T T C T A A - T G G G

CR1L_15001 (DQ007069) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )6 C ( C C T T C C C T )6 C T C C T T C T A A - T G G G

CR1L_13001 (DQ007070) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )6 C ( C C T T C C C T )5 C T C C T T C T A A - T G G G

CR1L_13001 (DQ007071) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )6 C ( C C T T C C C T )5 C T C C T T C T A A - T G G G

CR1L_17001 (DQ007072) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )8 C ( C C T T C C C T )6 C T C C T T C T A A - T G G G

CR1L_11001 (DQ007073) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )6 C ( C C T T C C C T )4 C T C C T T C T A A - T G G G

CR1L_14001 (DQ007074) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )7 C ( C C T T C C C T )5 C T C C T T C T A A - T G G G

CR1L_14001 (DQ007075) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )7 C ( C C T T C C C T )5 C T C C T T C T A A - T G G G

CR1L_14001 (DQ007076) C T A A G T T - - - - C C T T C C C T T C T G ( C C T G )2 C T T G ( C C T T )2 C T T T ( G C T T )2 ( C C T T )2 - C C T C A C T ( C C T C )2 C ( C C T T )7 C ( C C T T C C C T )5 C T C C T T C T A A - T G G G

CR1_05001 (DQ007054) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )5 C ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_05001 (DQ007055) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )5 C ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_05001 (DQ007056) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )5 Y ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_05001 (DQ007057) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )5 Y ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_04001 (DQ007058) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )4 C ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_04001 (DQ007059) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )4 C ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_06001 (DQ007060) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )4 C ( C C T T )7 C ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_07001 (DQ007061) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )4 C ( C C T T )8 C ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_03001 (DQ007062) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )3 C ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1_04002 (DQ007063) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T T ( C C T C )4 C ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

CR1 (NT_021877.16) T G G C A T T C T T T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C - - - - T T C ( C C T C )4 C ( C C T T )5 T ( C C T T C C C T )1 - T - - - - - - A G T T T T A

25

26

29

31

41

52

81

82

83

84

85

94

104

106

114

115

130

134

135

154

156

175

182

186

191

212

214

230

289

290

293

CR1-like Defining Region Polymorphic Elements

Figure 6. Sequencing reveals the complexity of the haplospecific element and differences between CR1 and CR1-like. Sequence alignment identifies potential indels and polymorphic elements. The TC rich region is highly polymorphic in keeping with other haplospecific elements. Black shading and white text indicates consensus sequence on either side of the indel polymorphic region. The differences between CR1-like and CR1 are (i) G at 101, 105, 109, 113, 126 and 130 (*); (ii) length differences between 102 and 281bp; (iii) other indels. For the purposes of classifying the sequences of products we used (i) with or without the remainder. Numbers above and below the alignment represent nucleotide position of CR1-like (Celera - NT_086601) and CR1 (NCBI - NT _021877.16) respectively. Note “Y” indicates nucleotide C/T.

Page 206: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

§ No residues a C x x P x x x x x A x x x x x x x x x x F P x G T x L x Y E C x P x Y x x x x F S I x C x x x x x W x x x x D x C 57

b C Q P P P x x L H x E x x x x x x x x F x x G x E V x Y x C x P x Y D L R G x x x x x C x P Q G D W x P x x P x C 57Hosa_CR1_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T C E V K SHosa_CR1_12 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T C E V K SHosa_CR1_19 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S M R C T P Q G D W S P A A P T C E V K SPatr_CR1_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S L R C T P Q G D W S P A T P T C E V K SPatr_CR1_14 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G A A S L R C T P Q G D W S P A A P T C E V K SPaha_CR1_5 C Q P P P D V L H G E R T Q R D K D I F Q T G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R C E V K SPaha_CR1_12 C Q P P P D V L H G E R T Q R D K D I F Q P G Q E V F Y I C E P G Y D L R G A A S L R C T P Q G D W S P A A P R C E V K SHosa_CR1-like_5 C Q P P P D V L H A E R T Q R D K D N F S P G Q E V F Y S C E P G Y D L R G S T Y L H C T P Q G D W S P A A P R CPatr_CR1-like_5 C Q P P P D V L H G E R T Q R D K D N F S P G E E V Y Y S C E P G Y D L R G S T Y L H C T P Q G D W S P E A P R CHosa_CR1_26 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C A V K SPatr_CR1_21 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C A V K SPatr_CR1_28 C Q P P P E I L H G E H T P S H Q D N F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C T V K SPaha_CR1_26 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P I C T V K SPacy_CR1-like_5 C Q P P P E I L H G E H T P S H Q D F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W N P E A P I CPaha_CR1_19 C Q P P P E I L H G E H T P S H Q D K F S P G Q E V F Y S C E P G Y D L R G A A S L H C T P Q G D W S P E A P R C A V K SHosa_CR1_33 C Q P P P E I L H G E H T L S H Q D N F S P G Q E V F Y S C E P S Y D L R G A A S L H C T P Q G D W S P E A P R C T V K SPatr_CR1-like_10C Q P P P E I L H G E H T L S H Q D N F S P G Q D V F Y S C E P G Y D L R G A A S L H C T P Q G D W T P E A P R CHosa_CR1-like_10C Q P P P E I L H G E H T L S H Q D N F L P G Q E V F Y S C E P S Y D L R G A A S L H C M P Q G D W T P E A P R C

c C P x P P K I Q N G H x I G G H V S L Y L P G M T I x Y I C D P G Y L L V G K G x I F C T D Q G I W S Q L D H Y C 57Hosa_CR1_36 C P D P P K I Q N G H Y I G G H V S L Y L P G M T I S Y I C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y C K E V NPatr_CR1_31 C P H P P K I Q N G H D I G G H V S L Y L P G M T I S Y I C D P G Y L L V G K G F I F C T D Q G I W S Q L D H Y C K E V NPaha_CR1_29 C P H P P K I Q N G H Y I G G H V S L Y L P G M T I G Y I C D P G Y L L V G K G I I F C T D Q G I W S Q L D H Y C K E V N

d C P x P P x I x N G R H x G x x x x x x P x G K x x x Y x C D P H x D R G x x x x L I G E S x I R x T S x x x G N G V W S S x A P R C 67Patr_CR1_8 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G HPatr_CR1_16 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G HPatr_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G HHosa_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G HHosa_CR1_21 C P S P P V I P N G R H T G K P L E V F P F G K A V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G HHosa_CR1_14 C P S P P V I P N G R H T G K P L E V F P F G K T V N Y T C D P H P D R G T S F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G HPaha_CR1_7 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S P A P R C G I L G HPaha_CR1_14 C P S P P V I P N G R H T G K P L E V F P F G K A V T Y T C D P H P D R G M T F D L I G E S T I R C T S D P Q G N G V W S S R A P R C G I L G HHosa_CR1_28 C P N P P A I L N G R H T G T P S G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R C E L S V R A G HPatr_CR1_23 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R C E L P V H A G HHosa_CR1_35 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D R Q G N G V W S S P A P R C E L S V P A APatr_CR1_30 C P N P P A I L N G R H T G T P F G D I P Y G K E I S Y A C D T H P D R G M T F N L I G E S S I R C T S D P Q G N G V W S S P A P R C E L S V P A AHosa_CR1-like_12C P N P P A I L N G R H T G T P P G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R R T S E P H G N G V W S S P A P R CPatr_CR1-like_12C P N P P A I L N G R H T G T P L G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R C T S D P H G N G V W S S P A P R CPaha_CR1_21 C P N P P A I L N G R H T G A L L G D I P Y G K E I S Y T C D P H R D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R C E L S V R A G HPaha_CR1_28 C P N P P A I L N G R H T G T P L G D I P Y G K E V S Y T C D P H P D R G M T F N L I G E S T I R C T S D L Q G N G V W S S P A P R C E L S V P A APacy_CR1-like_7 C P N P P A I L N G R H I G A P L G D I P Y G K E V S Y I C D P H P D R G M T V N L I G E S T I R C T S D P Q G N G V W S S P A P R C

e C x x P P x I x N G D F x S x x R x x F x x x x V V T Y x C x x x x x x x x x F x L V G E x S x x C T S x x x x x G x W S x P x P x C 67Hosa_CR1_10 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N KHosa_CR1_17 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N KHosa_CR1_3 C G L P P T I T N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N KPatr_CR1_3 C G L P P T I T N G D F I S T N R E N F H Y G S V V T Y R C N P G S G G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N KHosa_CR1_24 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N KPatr_CR1_12 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N P G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W S G P A P Q C I I P N KPatr_CR1_19 C G L P P T I A N G D F I S T N R E N F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S N D D Q V G I W A P Q C I I P N KPaha_CR1_10 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q C I I P N KPacy_CR1-like_3 C G L P P T I A N G D F I S T S R E Y F P Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CPaha_CR1_17 C G L P P P I A N G D F I S T N R E Y F H Y G S V V T Y R C N L G S G R K K L F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CHosa_CR1-like_3 C G L P P T I A N G D F T S I S R E Y F H Y G S V V T Y H C N L G S R G K K V F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CPatr_CR1-like_3 C G L P P T I A N G D F T S I S R E Y F H Y A S V V T Y H C N L G S G G K K V F E L V G E P S I Y C T S K D D Q V G I W S G P A P Q CPaha_CR1_3 C G L P P T I D N G D F F S A N K E Y F H Y G S V V T Y R C N L G S G G R K L F E L V G E P S I Y C T S N E D Q V G I W S G P A P Q C I I P N KHosa_CR1-like_8 C G L P P N I T N G Y F I S T D R E Y F H Y G S V V T Y H C N L G S R G R K V F E L V G E P S I Y C T S K D D Q V G V W S G P V P Q CPatr_CR1-like_8 C G L P P N I T N G Y F I S T D R E Y F H Y G S V V T Y R C N L G S R G R K V F E L V G E P S I Y C T S K G D Q V G V W Q CRano_Crry_3 C E I P P S I P N G D F F S P N R E D F H Y G M V V T Y Q C N T D A R G K K L F N L V G E P S I H C T S I D G Q V G V W S G P P P Q C I E L N KMumu_Crry_3 C E I P P G I P N G D F F S S T R E D F H Y G M V V T Y R C N T D A R G K A L F N L V G E P S L Y C T S N D G E I G V W S G P P P Q C I E L N KHosa_CR1_31 C E P P P T I S N G D F Y S N N R T S F H N G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R C I S T N KPatr_CR1_26 C E P P P T I S N G D F Y S N N R A S F H N G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G V W S S P P P R C I S T N KPaha_CR1_24 C K P P P T I S N G D F Y S N N R T S F H S G T V V T Y Q C H T G P D G E Q L F E L V G E R S I Y C T S K D D Q V G A W S S P P P R C I S T N K

f C x x P x V x x x x x x x x N x S x F S L x x x V x F R C x x G F x M x G x x x V x C x x x x x W x P x L P x C 56Hosa_CR1_32 C T A P E V E N A I R V P G N R S F F S L T E I V R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H C S R VPatr_CR1_27 C T A P E V E N A I R V P G N R S F F S L T E I V R F R C Q P G F V M V G S H T V Q C Q T N G R W G P K L P H C S R VPaha_CR1_25 C T A P E V K N G I R V P G N R S F F S L N E I V R F R C Q P G F V M V G S H T V Q C Q T N N R W G P K L P H C S R VRano_Crry_4 C T P P H V E N A V I V S K N K S L F S L R D M V E F R C Q D G F M M K G D S S V Y C R S L N R W E P Q L P S C F K V K SMumu_Crry_4 C T P P P Y V E N A V M L S E N R S L F S L R D I V E F R C H P G F I M K G A S S V H C Q S L N K W E P E L P S C F K G V IHosa_CR1_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R VHosa_CR1_11 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R VHosa_CR1_18 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R VHosa_CR1_25 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R R V K C Q A L N K W E P E L P S C S R VPatr_CR1_13 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F A M K G P R R V K C Q A L N K W E P E L P S C S R VPatr_CR1_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P P R V K C Q A L N K W E P E L P S C S R VPatr_CR1_20 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P H R V K C Q A L N K W E P E L P S C S R VPatr_CR1-like_9 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C L P G F V M K R P P P R V Q C Q A L N K W E T E L P S CHosa_CR1-like_9 C T P P N V E G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P H R V Q C Q A L N K W E T E L P S CPacy_CR1-like_4 C M P P N V E N G V L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R H V Q C Q A L N K W E P E L P S CPatr_CR1-like_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F V M K G P R H V H C Q A L N K W E P E L P S CPaha_CR1_11 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S C S R VPaha_CR1_18 C M P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S C S R VPaha_CR1_4 C T P P N V E N G I L V S V N R S L F S L N E V V E F R C Q P G F V M K G P R R V Q C Q A L N K W E P E L P S C S R VHosa_CR1-like_4 C T P P N V E N G I L V S D N R S L F S L N E V V E F R C Q P G F G M K G P S H V K C Q A L N K W E P E L P S C

g-like x x x P H I x N G F R I x x x x P x x F x x x x x x x x x x x x x x x x x x x x x x x x x x

h C x x P x x M x G x x K x L x M K K x Y x Y G x x V x L x C E D G Y x L E G S x x S Q C Q x D x x W x P x L x x C 57

j C x x P x x P x N G x V H x x x x x x x G S x x x Y x C x x G x R L x G x x x x x C x x x x x x x x W x x x x P x C 58Pacy_CR1-like_2 C R N P K D P V N G M V H V I K D I Q F G S Q I N Y S C N K G Y R L I G S S S A T C I I S G N T V I W D N E T P I C

^ Paha_CR1_2 C R N P R D P V N G M V H V I K D I Q F G S Q I N Y S C T E G H R L I G S S S A T C I I S G N T V I W D N E T P I C E K I SPatr_CR1-like_2 C R N P P D P V N G M V H V I K D I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G N T V I W D N K T P V CHosa_CR1-like_2 C R N P P D P V N G M A H V I K D I Q F G S Q I K Y S C P K G Y R L I G S S S A T C I I S G N T V I W D N K T P V CPatr_CR1_2 C R N P P D P V N G M V H V I K D I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I C D R I PHosa_CR1_2 C R N P P D P V N G M V H V I K G I Q F G S Q I K Y S C T K G Y R L I G S S S A T C I I S G D T V I W D N E T P I C D R I PRano_Crry_2 C E T P L D P Q N G I V H V N T D I R F G S S I T Y T C N E G Y R L I G S S S A M C I I S D Q S V A W D A E A P I C E S I PMumu_Crry_2 C K T P S D P E N G L V H V H T G I Q F G S R I N Y T C N Q G Y R L I G S S S A V C V I T D Q S V D W D T E A P I C E W I PHosa_CR1_9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I C Q R I PHosa_CR1_16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I C Q R I PPatr_CR1_11 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N A A H W S T K P P I C Q R I PPatr_CR1_18 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N S A H W S T K P P I C Q R I PHosa_CR1_23 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I L S G N T A H W S T K P P I C Q R I PPaha_CR1_16 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C I I S G N T A H W S T K P P I C Q R I PPaha_CR1_9 C K T P P D P V N G M V H V I T D I Q V G S R I N Y S C T T G H R L I G H S S A E C V T S G N T A H W S T K P P I C Q R I PHosa_CR1-like_7 C E T P P V P V N G M V H V I T D I H V G S R I N Y S C T T G H R L I G H S S A E C I L S G N T A H W S M K P P I CPatr_CR1-like_7 C E T P P V P V N G M V H V I T D I H V G S R I N Y S C I T G H R L I G H S S A E C I L S G N T A H W S M K P P I CHosa_CR1_30 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I C E I I SPatr_CR1_25 C G P P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K K A P I C E I I SPaha_CR1_23 C G T P P E P F N G M V H I N T D T Q F G S T V N Y S C N E G F R L I G S P S T T C L V S G N N V T W D K E A P I C E I I S

k C x x x x G x L x x G x V x x P x x L Q L G A K V x F V C x x G x x L K G x x x S x C V L x G x x x x W N x S V P V C 59

A3650GH1208R

G5575CD1850H

A4041CSILENT

C5507GP1827R

A4828GR1601G

T4855AS1610T

A4870GI1615V

G3093TQ981H

C5654TT1876I

T455CV115A

A1360GT445A

T2078CI684T

A4795GK1590E

Figure 7. Polymorphisms within SCR subfamiliesCCPs such as CR1, CR1-like and Crry contain Short Consensus Repeats 17 which we have classified into subfamilies as a, b, c etc.3-5 Each CCP has its particular order such as (ajefbkd)5 ch in the case of CR15 but the subfamilies are remarkably conserved as indicated by the degree of shading. Some of the known SNPs 6,7,18 have been mapped to the subfamilies since those changing conserved residues are likely to have profound functional effects. SNPs within a, j or e are likely to alter ligand binding.6 The BstN1 site is within j.^ Translated from the mRNA sequence but absent in respective protein sequence. Note Hosa is Homo sapien, Mumu is Mus musculus, Rano is Rattus norvegicus, Patr is Pan troglodytes, Paha is Papio hamadryas and Pacy is Papio Cynocephalus.

Page 207: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 251 -

CHAPTER 5 – MS0504

McLure, C., P. Kesners, S. Lester, D.Male, C. Amadou, J. Dawkins, B. Stewart, J.

Williamson and R. Dawkins. (In Press). "Haplotyping of the Canine MHC

without the need for DLA typing". Intl J. Immunogenetics

FIGURES AND TABLES

Page 208: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 252 -

Page 209: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

Figure 2. MHC Genotyping Report Reporting date: Family ID: CYO1-Peta)Cafa DLA-DQ-F1 + DLA-DQ-R1

Generation Lab No P1 P2 P3 p4

P5 P6 P7 P8 P9 DRB1 DQA1 DQB1a c d e f

I a C04/0000499J c d 5 3 1 3 1 1 1 1 1 01301,01501 00301,0601 00301,00501 II 1 C04/0000470Q a c 3 3 1 1 1 1 1 1 3 00201,01501 00601,00901 00101,00301

C05/0000512L a c 7 4 1 1 1 1 1 1 3

C05/0000534J a c 9 4 1 1 1 1 1 1 3

II 1a C04/0000473K e f 7 1 1 1 3 1 1 3 1 00201,01501 00601,00901 00101,02002C05/0000511E e f 9 1 1 1 6 1 1 3 1

III 1 C04/0000512M c f 5 3 1 1 1 1 1 2 1 00201,01501 00601,00901 00101,00301C04/0000530J c f 9 5 1 1 1 1 1 4 1

III 2 C04/0000513T a e 4 1 1 1 3 1 1 1 3 00201,01501 00601,00901 00101,02002C04/0000531Q a e 9 1 1 1 3 1 1 1 3

III 3 C04/0000514A c f 6 3 1 1 1 1 1 3 1 00201,01501 00601,00901 00101,00301C04/0000532X c f 9 4 1 1 1 1 1 4 1

III 4 C04/0000518B c e 4 3 1 1 3 1 1 1 1 01501 00601 00301,02002C04/0000533D c e 9 4 1 1 6 1 1 1 1

C05/0000535Q c e 9 4 1 1 4 1 1 1 1

III 5 C04/0000516N c e 6 4 1 1 4 1 1 1 1 01501 00601 00301,02002C04/0000534K c e 9 4 1 1 6 1 1 1 1

III 6 C04/0000517U a e 5 1 1 1 2 1 1 1 3 00201,01501 00601,00901 00101,02002C04/0000535R a e 9 1 1 1 5 1 1 1 3

III 7 C04/0000515G a f 4 1 1 1 1 1 1 3 3 00201 00901 00101C04/0000536Y a f 8 1 1 1 1 1 1 3 3

III 8 C05/0000566Z c f 9 5 1 1 1 1 1 4 1 00101,00301III 9 C05/0000567F c f 9 5 1 1 1 1 1 4 1 00101,00301III 10 C05/0000568M a e 9 1 1 1 4 1 1 1 4 00101,02002III 11 C05/0000569T c f 9 5 1 1 1 1 1 4 1 00101,00301III 12 C05/0000570C a f 9 1 1 1 1 1 1 4 4 00101,III 13 C05/0000571J c f 9 5 1 1 1 1 1 4 1 00101,00301III 14 C05/0000572Q a e 9 1 1 1 4 1 1 1 4 00101,02002III 15 C05/0000573X a e 9 1 1 1 4 1 1 1 4 00101,02002

DQB1 allele 3 5 2 1 1

0 0 0 0 0

1 1 0 1 1

2

Name Lab No P1 P2 P3 P4 P5 P6 P7 P8 P9 DRB1 DQA1 DQB1

Ned C04/0000474R 5 1 3 1 3 1 1 1 1 01501 00601 02002, 02301

Theo C04/0000471X 6 1 3 1 1 1 3 1 1 01501 00601, 00901 00101, 02301

Kinky 5 * 01501 00601 02301*1 additional polymorphic fragment.

b) c)Deduced Haplotypes

DRB1 DQA1 DQB1a 00201 00901 00101b NTc 01501 00601 00301d 01301 00301 00501e 01501 00601 02002f 00201 00901 00101

Figure 2. MHC Genotyping Report. Genotype analysis of 21 Blue Heeler dogs using GMT and a comparison to DLA alleles assigned by coventional SBT. Figure 1a reports the results of GMT typing and DLA DR, DQA and DQB of the 3-generation family. 3 unrelated individuals have also been included. 1b shows the family tree of the grand parents, parents and offspring that have resulted from two separate matings. 1c is a summary of the deduced DLA haplotypes. Three unrelated Blue Heelers (Ned, Theo, Kinky) are included.

COMMENT1. The Cafa_classII F1 & R1 primers amplify within DLA DQ alpha 1 and DQ beta 1 (NW_139872, positions 143791-144189 and 210163-210376 respectively).

Page 210: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 252 -

Page 211: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 255 -

CHAPTER 6 – MS0507

McLure, C., J. Dawkins, B. Stewart, and R. Dawkins. (In preparation). " Identification

of traits and function in livestock by genomic matching: Genetically determining

homozygous and heterozygous polled cattle".

FIGURES AND TABLES

Page 212: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 256 -

Page 213: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

F

0003 9706980099190004 9835 9918

00059805 99250006982099310007 000898199915

000997109809 9917001098119908

0011

0135 9716 98349914

001297369831 9906

0013970898029923 00149705 9821 9909 9910

0015 0016 00170018 0019

0020 9702 97039839 9912 00219735 9921

0022

00239838 99270024 9731 9812 9929

0025

0026 0027

0028 98019904

0029 97349825 9924

0030 9829 9928 00319711 9823 9920 0032 0288005000510150 9985 0052 0053 005401550055 00560304 0057 015100580059 0060 0061 0062 9984 0063 006400650173 99920066

0067

006800690070 007100720073 0172 00740170 00750076 0154 00770156 998600780153 0079008000810113 0160 9991 008200830084 008501710086 00870088

0089 02539952

00909967 00910099

97779888 9970

01000101 9982 0102 01030104 99880105 010601070108 0109 0110 02040111 01120114 01150116

0117

01180119 01200121 012201230124 012501260127 0128 0129 99890130 0131 01320133 01340152 0157 0303 01580159

0161

0162 0163 02850164 0286 0287 0165

0166

0167

0168

01690174 0175 0176017701780179 0180 01810182 9979 01830184 0185

0186

0197

0198

019902000201 02020203 020502060207

0208

02090210 0210002101

02102

02110212 021302140215 02160217 021802190220 022102220223 0224

0250

0251

0252

02540255

0256 02570258

0259026002610262 0263 02640265

0266

026702680269 0270 02710272 02730274

0275

0276 027702780279

02800281

02820283

0284

02890290

0291 0292

029302940295 0295a

0296 0297 0298

0299

0300040903010427

0302 0413

03050363 045503060423 03070308

0309 04000310

031 1 0424 0312 0313

03140315

0316

0317

0318

03190422 03200401

0321 0322 03230324 0325

03260327 03280329 0330 033103320333 0334

0335

0336

03370338 0339

03400341 03420343 0344 03450346 03470348

034903500351

0352 03530354

03550356 0357 03580359 0360036103620388 0391 0392 039403640365 0366 0367

03680369

0370

03710372

0373 0374 03750376037703780379 038003810382 0383 03840385 0386 03870389 0390039303950396 0397 0398

04020403

0405

0406

0407 040804100411 0412

0414 04150416 0417 04180419 0420 0421

04250426 0428 0429

0430

04310432

0433

0434

0435

04360437

0438

0439

0440 0441

0442 0443

0444 044504460447 04480449 04500451 0452

0453

04540456 0457

0N 033

1U 058

6X Y

7019

79Y

8102

8117

8223

8276

8278

83038324

8336

8342

84088439

8470

8495

8504 8520

8531

8552 8557

8574

8607

8615 86618676

8679

8681

8703 8710

8717

8718

8755

8766

8767

8769

8828 88538856

8857

8865

8878 8890

8900

8903 89058909

8970 8914 8934 8941 8956

8959

8962 9003

9009

9028

9054

9056 9078

9081

9087 90929112 9127 91369138

9156

9160

9163

9181 9202 9203 92169217

9219

922092219223

9239

9242 92539258 9301

9302

9314 9318 93199320

9326

9327

9335

9336

9339

9349 93539361

93639365 9370 93839398 9402 9403 94049405

9407940994139414

9415

9416

942794309432 9438 9445 9451

9453

9455 95029503 9505

9506

9509

951 19521

9526 95489555

9559

96049608

9612 9729 9613

9615 96169617

9624 97049804

9626

9631

9634 96359636

9639 96419645

9649

965697559876

96699675

97009701 98169902 99039707 9818 9950 9709 9712 98159913

9713

9714 9715 9717971897199826 9827 97209814 990099019721 9722975098839723 98039724 97259726 97279728 973097329733 9737 985198529751 9752

9753 9890

9754 97569857 9969 97579966 975898629759 97609853 99639761 9762

976398759960

97649879 9971 9765 9884

9766 9882

9767 98649973

9768

97699865 99549770 9771

9772 9873

97739866 997297749964

9775 9870 9978

9776 9886979Z9806 9807

9808

9810

9813993098179822

9824

98289974

9830

983299589833 9836 99269837 99329850

9854

9855

9856

98589956

9859

9860 9861 99579863 9867

9868

9869 9871 9872 9874996898779959 9878 988099519881

9885

9887 9889 98919892

9899 9905

9907

9911

9916

9922

99539955 9961

9962

9965 99759976

9977

99809981 998399879990 9993

9994

ADAM

ARNIE

B301

BE70

BE8CALIDJ

D1 14

DexII

Du sty D

Emir

Exc els

Exodus

EXTRA G4 1HMAYER JNT1 Ken

L157

Lad y

Martello

MPF

Mrs

O2 7PANZER

PK09 2PL1 72

Poloni

R?? ?1 R?? ?2 R?? ?3

R?? ?4

RaffDex3

RB???2 RB???3

RedRogue

RedVolt

RNT1

SBL6 3K

SPHERC

SPP

U0 24

U0 36 Usc h

W00 3

W01 6

W03 6 W13 W39

Y0 82

Yarl

YellVolt

Z012Z509Zazou Zodia c

Zulu

0002

0001

KarKarg

0004 9835 9918 001098119908

0028 9801990400560304 00860101 9982 0104 99880109 01200130 01340157 0303

0168

0200 0207 0212 0224

0250

0254

0258

0267 0270 0274

028403000409

03140317

0318

032004010327 0329

03370339

0343 0348

0355035803590373 03780379 03810382039504310448 0450

0N033

1U058

8102 8223

8278

830383248495

8552

8607

8679 8828

8865

89038959

9112 9138930193189339

9349 9361

9370 9383

9612 9729 9624 970498049950 97099728 97569857 9969

9775 9870 9978

9817

9830

9837 99329850

98689981

ADAM

ARNIE

CALIDJ

DustyD

Excels

EXTRAHMAYER PANZER

SPP

U024

U036 Usch

W003

W016Y082 Zazou

Zulu

Figure 1. Pedigree of 920 cattle.The pedigree above shows the controlled interbreeding within a herd over the last 20 years. Names are assigned to those cattle introduced tothe family, Numbers are assigned to cattle that were born on the property, such that the first two numbers represent the year and the last tworepresent the number in that year. DNA and serum has been collected from all the individual shaded grey. The cattle shaded green have not beencollected as yet. The DNA collection covers at least 4 generations with multiple offspring in each. These samples will be invaluable with thefuture genetic analysis of phenotypic traits.

Page 214: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

Figure 2. Beef Breeder database for phenotypic and genotypic data management.

Screen shots of the many layers to the Beef Breeder database. a) Main Page, b) Data entry and analysis options, 1c) Search cattle based on any category, d) Individual cattle details, e) 4 generation family trees identifying parents, grand parents and great grand parents, f) Progeny Table, f) Entry of updated data , h) Laboratory data management.

a)

b)

c)

d)

e)

f)

g)

h)

Page 215: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 261 -

BIBLIOGRAPHY

Page 216: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 262 -

Page 217: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 263 -

BIBLIOGRAPHY Abraham, L. J., D. C. Du, K. Zahedi, R. L. Dawkins and A. S. Whitehead (1991). "Haplotypic

polymorphisms of the TNFB gene." Immunogenetics 33(1): 50-3.

Abraham, L. J., C. Leelayuwat, G. Grimsley, M. A. Degli-Esposti, A. Mann, W. J. Zhang, F. T.

Christiansen and R. L. Dawkins (1992). "Sequence differences between HLA-B and TNF

distinguish different MHC ancestral haplotypes." Tissue Antigens 39(3): 117-21.

Accardo, P., P. Sanchez-Corral, O. Criado, E. Garcia and S. Rodriguez de Cordoba (1996). "Binding of

human complement component C4b-binding protein (C4BP) to Streptococcus pyogenes involves

the C4b-binding site." J Immunol 157(11): 4935-9.

Acland, G. M., K. Ray, C. S. Mellersh, W. Gu, A. A. Langston, J. Rine, E. A. Ostrander and G. D.

Aguirre (1998). "Linkage analysis and comparative mapping of canine progressive rod-cone

degeneration (prcd) establishes potential locus homology with retinitis pigmentosa (RP17) in

humans." Proc Natl Acad Sci U S A 95(6): 3048-53.

Adams, E. M., M. C. Brown, M. Nunge, M. Krych and J. P. Atkinson (1991). "Contribution of the

repeating domains of membrane cofactor protein (CD46) of the complement system to ligand

binding and cofactor activity." J Immunol 147(9): 3005-11.

Ahearn, J. M. and A. M. Rosengard (1998). Complement Receptors. The human complement system in

health and disease. J. E. Volanakis and N. M. Frank. New York, Marcel Dekker: 167-202.

Albrecht, J. C. and B. Fleckenstein (1992). "New member of the multigene family of complement control

proteins in herpesvirus saimiri." J Virol 66(6): 3937-40.

Alper, C. A. (1998). "A history of complement genetics." Exp Clin Immunogenet 15(4): 203-12.

Alper, C. A., D. Raum, S. Karp, Z. L. Awdeh and E. J. Yunis (1983). "Serum complement 'supergenes' of

the major histocompatibility complex in man (complotypes)." Vox Sang 45(1): 62-7.

Awdeh, Z. L., D. Raum, E. J. Yunis and C. A. Alper (1983). "Extended HLA/complement allele

haplotypes: evidence for T/t-like complex in man." Proc Natl Acad Sci U S A 80(1): 259-63.

Bailey, J. A., Z. Gu, R. A. Clark, K. Reinert, R. V. Samonte, S. Schwartz, M. D. Adams, E. W. Myers, P.

W. Li and E. E. Eichler (2002). "Recent segmental duplications in the human genome." Science

297(5583): 1003-7.

Bell, E. (2000). "Murine embryonic survival depends on regulation of complement." Immunology Today

21(3): 109.

Bergelson, J. M., J. G. Mohanty, R. L. Crowell, N. F. St John, D. M. Lublin and R. W. Finberg (1995).

"Coxsackievirus B3 adapted to growth in RD cells binds to decay-accelerating factor (CD55)." J

Virol 69(3): 1903-6.

Page 218: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 264 -

Berggard, K., E. Johnsson, F. R. Mooi and G. Lindahl (1997). "Bordetella pertussis binds the human

complement regulator C4BP: role of filamentous hemagglutinin." Infect Immun 65(9): 3638-43.

Berkower, C., M. Ravins, A. E. Moses and E. Hanski (1999). "Expression of different group A

streptococcal M proteins in an isogenic background demonstrates diversity in adherence to and

invasion of eukaryotic cells." Mol Microbiol 31(5): 1463-75.

Bernet, J., J. Mullick, A. K. Singh and A. Sahu (2003). "Viral mimicry of the complement system." J

Biosci 28(3): 249-64.

Birmingham, D. J., W. Chen, G. Liang, H. C. Schmitt, K. Gavit and H. N. Nagaraja (2003). "A CR1

polymorphism associated with constitutive erythrocyte CR1 levels affects binding to C4b but not

C3b." Immunology 108(4): 531-8.

Birmingham, D. J. and L. A. Hebert (2001). "CR1 and CR1-like: the primate immune adherence

receptors." Immunol Rev 180: 100-11.

Blackmore, T. K., V. A. Fischetti, T. A. Sadlon, H. M. Ward and D. L. Gordon (1998). "M protein of the

group A Streptococcus binds to the seventh short consensus repeat of human complement factor

H." Infect Immun 66(4): 1427-31.

Blom, A. M., J. Webb, B. O. Villoutreix and B. Dahlback (1999). "A cluster of positively charged amino

acids in the C4BP alpha-chain is crucial for C4b binding and factor I cofactor function." J Biol

Chem 274(27): 19237-45.

Blom, A. M., A. F. Zadura, B. O. Villoutreix and B. Dahlback (2000). "Positively charged amino acids at

the interface between alpha-chain CCP1 and CCP2 of C4BP are required for regulation of the

classical C3-convertase." Mol Immunol 37(8): 445-53.

Blumbach, B., Z. Pancer, B. Diehl-Seifert, R. Steffen, J. Munkner, I. Muller and W. E. Muller (1998).

"The putative sponge aggregation receptor. Isolation and characterization of a molecule

composed of scavenger receptor cysteine-rich domains and short consensus repeats." J Cell Sci

111 ( Pt 17): 2635-44.

Bora, N. S., D. M. Lublin, B. V. Kumar, R. D. Hockett, V. M. Holers and J. P. Atkinson (1989).

"Structural gene for human membrane cofactor protein (MCP) of complement maps to within

100 kb of the 3' end of the C3b/C4b receptor gene." J Exp Med 169(2): 597-602.

Brenneman, R. A., S. K. Davis, J. O. Sanders, B. M. Burns, T. C. Wheeler, J. W. Turner and J. F. Taylor

(1996). "The polled locus maps to BTA1 in a Bos indicus x Bos taurus cross." J Hered 87(2):

156-61.

Brodbeck, W. G., L. Kuttner-Kondo, C. Mold and M. E. Medof (2000). "Structure/function studies of

human decay-accelerating factor." Immunology 101(1): 104-11.

Brodbeck, W. G., D. Liu, J. Sperry, C. Mold and M. E. Medof (1996). "Localization of classical and

alternative pathway regulatory activity within the decay-accelerating factor." J Immunol 156(7):

2528-33.

Page 219: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 265 -

Campbell, R. D. (1988). "The molecular genetics of components of the complement system." Baillieres

Clin Rheumatol 2(3): 547-75.

Campbell, R. D. and J. Trowsdale (1993). "Map of the human MHC." Immunol Today 14(7): 349-52.

Caras, I. W., M. A. Davitz, L. Rhee, G. Weddell, D. W. Martin, Jr. and V. Nussenzweig (1987). "Cloning

of decay-accelerating factor suggests novel use of splicing to generate two proteins." Nature

325(6104): 545-9.

Carroll, M. C., E. M. Alicot, P. J. Katzman, L. B. Klickstein, J. A. Smith and D. T. Fearon (1988).

"Organization of the genes encoding complement receptors type 1 and 2, decay-accelerating

factor, and C4-binding protein in the RCA locus on human chromosome 1." J Exp Med 167(4):

1271-80.

Casasnovas, J. M., M. Larvie and T. Stehle (1999). "Crystal structure of two CD46 domains reveals an

extended measles virus-binding surface." Embo J 18(11): 2911-22.

Changelian, P. S. and D. T. Fearon (1986). "Tissue-specific phosphorylation of complement receptors

CR1 and CR2." J Exp Med 163(1): 101-15.

Chen, W., C. M. Logar, X. P. Shen and D. J. Birmingham (2000). "The chimpanzee and cynomolgus

monkey erythrocyte immune adherence receptors are encoded by CR1-like genes."

Immunogenetics 52(1-2): 46-52.

Clarkson, N. A., R. Kaufman, D. M. Lublin, T. Ward, P. A. Pipkin, P. D. Minor, D. J. Evans and J. W.

Almond (1995). "Characterization of the echovirus 7 receptor: domains of CD55 critical for

virus binding." J Virol 69(9): 5497-501.

Consortium, T. C. e. S. (1998). "Genome sequence of the nematode C. elegans: a platform for

investigating biology." Science 282(5396): 2012-8.

Cooper, N. R. (1998). Complement and Viruses. The human complement system in health and disease. J.

E. Volanakis and N. M. Frank. New York, Marcel Dekker: 393-407.

Coyne, K. E., S. E. Hall, S. Thompson, M. A. Arce, T. Kinoshita, T. Fujita, D. J. Anstee, W. Rosse and

D. M. Lublin (1992). "Mapping of epitopes, glycosylation sites, and complement regulatory

domains in human decay accelerating factor." J Immunol 149(9): 2906-13.

Dahlback, B., C. A. Smith and H. J. Muller-Eberhard (1983). "Visualization of human C4b-binding

protein and its complexes with vitamin K-dependent protein S and complement protein C4b."

Proc Natl Acad Sci U S A 80(11): 3461-5.

Dawkins, R., C. Leelayuwat, S. Gaudieri, G. Tay, J. Hui, S. Cattley, P. Martinez and J. Kulski (1999).

"Genomics of the major histocompatibility complex: haplotypes, duplication, retroviruses and

disease." Immunol Rev 167: 275-304.

Page 220: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 266 -

Dawkins, R. L., F. T. Christiansen, P. H. Kay, M. Garlepp, J. McCluskey, P. N. Hollingsworth and P. J.

Zilko (1983). "Disease associations with complotypes, supratypes and haplotypes." Immunol

Rev 70: 1-22.

Dawkins, R. L., M. P. Degli-Esposti, L. J. Abraham, W. Zhang and F. T. Christiansen (1991).

Conservation versus polymorphism of the MHC in relation to transplantation, immune response

and autoimmune disease. Molecular Evolution of the Major Histocompatibility Complex. J.

Klein and D. Klein. Heidelberg, Springer-Verlag. 59.

Dawkins, R. L., A. Leaver, P. U. Cameron, E. Martin, P. H. Kay and F. T. Christiansen (1989). "Some

disease-associated ancestral haplotypes carry a polymorphism of TNF." Hum Immunol 26(2):

91-7.

Debenham, S. L., E. A. Hart, J. L. Ashurst, K. L. Howe, M. A. Quail, W. E. Ollier and M. M. Binns

(2005). "Genomic sequence of the class II region of the canine MHC: comparison with the MHC

of other mammalian species." Genomics 85(1): 48-59.

Degli-Esposti, M. A., L. J. Abraham, V. McCann, T. Spies, F. T. Christiansen and R. L. Dawkins

(1992a). "Ancestral haplotypes reveal the role of the central MHC in the immunogenetics of

IDDM." Immunogenetics 36(6): 345-56.

Degli-Esposti, M. A., A. Andreas, F. T. Christiansen, B. Schalke, E. Albert and R. L. Dawkins (1992b).

"An approach to the localization of the susceptibility genes for generalized myasthenia gravis by

mapping recombinant ancestral haplotypes." Immunogenetics 35(6): 355-64.

Degli-Esposti, M. A., A. L. Leaver, F. T. Christiansen, C. S. Witt, L. J. Abraham and R. L. Dawkins

(1992c). "Ancestral haplotypes: conserved population MHC haplotypes." Hum Immunol 34(4):

242-52.

Deininger, P. L., J. V. Moran, M. A. Batzer and H. H. Kazazian, Jr. (2003). "Mobile elements and

mammalian genome evolution." Curr Opin Genet Dev 13(6): 651-8.

DeNise, S., E. Johnston, J. Halverson, K. Marshall, D. Rosenfeld, S. McKenna, T. Sharp and J. Edwards

(2004). "Power of exclusion for parentage verification and probability of match for identity in

American Kennel Club breeds using 17 canine microsatellite markers." Anim Genet 35(1): 14-7.

Dhiman, N., R. M. Jacobson and G. A. Poland (2004). "Measles virus receptors: SLAM and CD46." Rev

Med Virol 14(4): 217-29.

Dorig, R. E., A. Marcil, A. Chopra and C. D. Richardson (1993). "The human CD46 molecule is a

receptor for measles virus (Edmonston strain)." Cell 75(2): 295-305.

Dykman, T. R., J. A. Hatch, M. S. Aqua and J. P. Atkinson (1985). "Polymorphism of the C3b/C4b

receptor (CR1): characterization of a fourth allele." J Immunol 134(3): 1787-9.

Endo, T., K. Ikeo and T. Gojobori (1996). "Large-scale search for genes on which positive selection may

operate." Mol Biol Evol 13(5): 685-90.

Page 221: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 267 -

Evans, D. J. and J. W. Almond (1998). "Cell receptors for picornaviruses as determinants of cell tropism

and pathogenesis." Trends Microbiol 6(5): 198-202.

Favoreel, H. W., G. R. Van de Walle, H. J. Nauwynck and M. B. Pensaert (2003). "Virus complement

evasion strategies." J Gen Virol 84(Pt 1): 1-15.

Fingeroth, J. D., J. J. Weis, T. F. Tedder, J. L. Strominger, P. A. Biro and D. T. Fearon (1984). "Epstein-

Barr virus receptor of human B lymphocytes is the C3d receptor CR2." Proc Natl Acad Sci U S

A 81(14): 4510-4.

Fodor, W. L., S. A. Rollins, S. Bianco-Caron, R. P. Rother, E. R. Guilmette, W. V. Burton, J. C.

Albrecht, B. Fleckenstein and S. P. Squinto (1995). "The complement control protein homolog

of herpesvirus saimiri regulates serum complement by inhibiting C3 convertase activity." J Virol

69(6): 3889-92.

Francisco, L. V., A. A. Langston, C. S. Mellersh, C. L. Neal and E. A. Ostrander (1996). "A class of

highly polymorphic tetranucleotide repeats for canine genetic mapping." Mamm Genome 7(5):

359-62.

Garbe, J. R. and Y. Da (2004). "Pedigraph user manual version 2.0 trail version." Department of Animal

Science, University of Minnesota.

Garlepp, M. J., P. H. Kay, B. R. Farrow and R. L. Dawkins (1984). "Autoimmunity in spontaneous

myasthenia gravis in dogs." Clinical Immunology and Immunopathology 31(2): 301 - 6.

Gaudieri, S., R. L. Dawkins, K. Habara, J. K. Kulski and T. Gojobori (2000a). Nucleotide diversity within

the human Major Histocompatibility Complex: Function of hitch-hiking effect, duplications,

indels and recombination. The Major Histocompatibility Complex: Evolution, Structure, and

Function. M. Kasahara. Tokyo, Springer-Verlag: 186-200.

Gaudieri, S., R. L. Dawkins, K. Habara, J. K. Kulski and T. Gojobori (2000b). "SNP profile within the

human major histocompatibility complex reveals an extreme and interrupted level of nucleotide

diversity." Genome Res 10(10): 1579-86.

Gaudieri, S., K. M. Giles, J. K. Kulski and R. L. Dawkins (1997a). "Duplication and polymorphism in the

MHC: Alu generated diversity and polymorphism within the PERB11 gene family." Hereditas

127(1-2): 37-46.

Gaudieri, S., J. K. Kulski, L. Balmer, K. M. Giles, H. Inoko and R. L. Dawkins (1997b). "Retroelements

and segmental duplications in the generation of diversity within the MHC." DNA Seq 8(3): 137-

41.

Gaudieri, S., J. K. Kulski, R. L. Dawkins and T. Gojobori (1999a). "Different evolutionary histories in

two subgenomic regions of the major histocompatibility complex." Genome Res 9(6): 541-9.

Page 222: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 268 -

Gaudieri, S., J. K. Kulski, R. L. Dawkins and T. Gojobori (1999b). "Extensive nucleotide variability

within a 370 kb sequence from the central region of the major histocompatibility complex."

Gene 238(1): 157-61.

Gaudieri, S., C. Leelayuwat, G. K. Tay, D. C. Townend and R. L. Dawkins (1997c). "The major

histocompatability complex (MHC) contains conserved polymorphic genomic sequences that are

shuffled by recombination to form ethnic-specific haplotypes." J Mol Evol 45(1): 17-23.

Gaudieri, S., C. Leelayuwat, D. C. Townend, J. K. Kulski and R. L. Dawkins (1997d). "Genomic

characterization of the region between HLA-B and TNF: implications for the evolution of

multicopy gene families." J Mol Evol 44 Suppl 1: S147-54.

Gaudieri, S., N. Longman-Jacobsen, G. K. Tay and R. L. Dawkins (2001). "Sequence analysis of the

MHC class I region reveals the basis of the genomic matching technique." Hum Immunol 62(3):

279-85.

Georges, M., R. Drinkwater, T. King, A. Mishra, S. S. Moore, D. Nielsen, L. S. Sargeant, A. Sorensen,

M. R. Steele, X. Zhao and et al. (1993). "Microsatellite mapping of a gene affecting horn

development in Bos taurus." Nat Genet 4(2): 206-10.

Gigli, I., T. Fujita and V. Nussenzweig (1979). "Modulation of the classical pathway C3 convertase by

plasma proteins C4 binding protein and C3b inactivator." Proc Natl Acad Sci U S A 76(12):

6596-600.

Gordon, D. L., R. M. Kaufman, T. K. Blackmore, J. Kwong and D. M. Lublin (1995). "Identification of

complement regulatory domains in human factor H." J Immunol 155(1): 348-56.

Guillaudeux, T., M. Janer, G. K. Wong, T. Spies and D. E. Geraghty (1998). "The complete genomic

sequence of 424,015 bp at the centromeric end of the HLA class I region: gene content and

polymorphism." Proc Natl Acad Sci U S A 95(16): 9494-9.

Halverson, J., J. Dvorak and T. Stevenson (1995). Microsatellite sequences for canine genotyping. USA.

Harlizius, B., I. Tammen, K. Eichler, A. Eggen and D. J. Hetzel (1997). "New markers on bovine

chromosome 1 are closely linked to the polled gene in Simmental and Pinzgauer cattle." Mamm

Genome 8(4): 255-7.

Hedrick, P. W., R. N. Lee and C. Buchanan (2003). "Canine parvovirus enteritis, canine distemper, and

major histocompatibility complex genetic variation in Mexican wolves." J Wildl Dis 39(4): 909-

13.

Heine-Suner, D., M. A. Diaz-Guillen, F. P. de Villena, M. Robledo, J. Benitez and S. Rodriguez de

Cordoba (1997). "A high-resolution map of the regulator of the complement activation gene

cluster on 1q32 that integrates new genes and markers." Immunogenetics 45(6): 422-7.

Heine-Suner, D., M. A. Diaz-Guillen, A. J. Lange and S. Rodriguez de Cordoba (1998). "Sequence and

structure of the human 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase heart isoform gene

(PFKFB2)." Eur J Biochem 254(1): 103-10.

Page 223: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 269 -

Hellwage, J., S. Kuhn and P. F. Zipfel (1997). "The human complement regulatory factor-H-like protein

1, which represents a truncated form of factor H, displays cell-attachment activity." Biochem J

326 ( Pt 2): 321-7.

Herrera, A. H., L. Xiang, S. G. Martin, J. Lewis and J. G. Wilson (1998). "Analysis of complement

receptor type 1 (CR1) expression on erythrocytes and of CR1 allelic markers in Caucasian and

African American populations." Clin Immunol Immunopathol 87(2): 176-83.

Holers, V. M., T. Kinoshita and H. Molina (1992). "The evolution of mouse and human complement C3-

binding proteins: divergence of form but conservation of function." Immunol Today 13(6): 231-

6.

Horstmann, R. D., H. J. Sievertsen, J. Knobloch and V. A. Fischetti (1988). "Antiphagocytic activity of

streptococcal M protein: selective binding of complement control protein factor H." Proc Natl

Acad Sci U S A 85(5): 1657-61.

Horton, R., D. Niblett, S. Milne, S. Palmer, B. Tubby, J. Trowsdale and S. Beck (1998). "Large-scale

sequence comparisons reveal unusually high levels of variation in the HLA-DQB1 locus in the

class II region of the human MHC." J Mol Biol 282(1): 71-97.

Hoshino, M., F. Matsuzaki, Y. Nabeshima and C. Hama (1993). "hikaru genki, a CNS-specific gene

identified by abnormal locomotion in Drosophila, encodes a novel type of protein." Neuron

10(3): 395-407.

Hourcade, D., V. M. Holers and J. P. Atkinson (1989). "The regulators of complement activation (RCA)

gene cluster." Adv Immunol 45: 381-416.

Hourcade, D., M. K. Liszewski, M. Krych-Goldberg and J. P. Atkinson (2000). "Functional domains,

structural variations and pathogen interactions of MCP, DAF and CR1." Immunopharmacology

49(1-2): 103-16.

Hourcade, D., T. W. Post, V. M. Holers, D. M. Lublin and J. P. Atkinson (1990). "Polymorphisms of the

regulators of complement activation gene cluster." Complement Inflamm 7(4-6): 302-14.

Hughes, J. F. and J. M. Coffin (2001). "Evidence for genomic rearrangements mediated by human

endogenous retroviruses during primate evolution." Nat Genet 29(4): 487-9.

Hughes, J. F. and J. M. Coffin (2004). "Human endogenous retrovirus K solo-LTR formation and

insertional polymorphisms: implications for human and viral evolution." Proc Natl Acad Sci U S

A 101(6): 1668-72.

Iwata, K., T. Seya, Y. Yanagi, J. M. Pesando, P. M. Johnson, M. Okabe, S. Ueda, H. Ariga and S.

Nagasawa (1995). "Diversity of sites for measles virus binding and for inactivation of

complement C3b and C4b on membrane cofactor protein CD46." J Biol Chem 270(25): 15148-

52.

Page 224: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 270 -

Johnsson, E., K. Berggard, H. Kotarsky, J. Hellwage, P. F. Zipfel, U. Sjobring and G. Lindahl (1998).

"Role of the hypervariable region in streptococcal M proteins: binding of a human complement

inhibitor." J Immunol 161(9): 4894-901.

Johnsson, E., A. Thern, B. Dahlback, L. O. Heden, M. Wikstrom and G. Lindahl (1996). "A highly

variable region in members of the streptococcal M protein family binds the human complement

regulator C4BP." J Immunol 157(7): 3021-9.

Jouve, M., M. I. Garcia, P. Courcoux, A. Labigne, P. Gounon and C. Le Bouguenec (1997). "Adhesion to

and invasion of HeLa cells by pathogenic Escherichia coli carrying the afa-3 gene cluster are

mediated by the AfaE and AfaD proteins, respectively." Infect Immun 65(10): 4082-9.

Kalli, K. R. and D. T. Fearon (1994). "Binding of C3b and C4b by the CR1-like site in murine CR1." J

Immunol 152(6): 2899-903.

Kallstrom, H., M. K. Liszewski, J. P. Atkinson and A. B. Jonsson (1997). "Membrane cofactor protein

(MCP or CD46) is a cellular pilus receptor for pathogenic Neisseria." Mol Microbiol 25(4): 639-

47.

Kennedy, L. J., A. Barnes, G. M. Happ, R. J. Quinnell, D. Bennett, J. M. Angles, M. J. Day, N.

Carmichael, J. F. Innes, D. Isherwood, S. D. Carter, W. Thomson and W. E. Ollier (2002a).

"Extensive interbreed, but minimal intrabreed, variation of DLA class II alleles and haplotypes

in dogs." Tissue Antigens 59(3): 294-304.

Kennedy, L. J., A. Barnes, G. M. Happ, R. J. Quinnell, O. Courtenay, S. D. Carter, W. E. Ollier and W.

Thomson (2002b). "Evidence for extensive DLA polymorphism in different dog populations."

Tissue Antigens 60(1): 43-52.

Ketheesan, N., S. Gaudieri, C. S. Witt, G. K. Tay, D. C. Townend, F. T. Christiansen and R. L. Dawkins

(1999). "Reconstruction of the block matching profiles." Hum Immunol 60(2): 171-6.

Kijas, J. W., B. J. Miller, S. E. Pearce-Kelling, G. D. Aguirre and G. M. Acland (2003). "Canine models

of ocular disease: outcross breedings define a dominant disorder present in the English mastiff

and bull mastiff dog breeds." J Hered 94(1): 27-30.

Kirkitadze, M. D. and P. N. Barlow (2001). "Structure and flexibility of the multiple domain proteins that

regulate complement activation." Immunol Rev 180: 146-61.

Kirkitadze, M. D., D. T. Dryden, S. M. Kelly, N. C. Price, X. Wang, M. Krych, J. P. Atkinson and P. N.

Barlow (1999). "Co-operativity between modules within a C3b-binding site of complement

receptor type 1." FEBS Lett 459(1): 133-8.

Klein, J., C. O'hUigin, M. Kasahara, V. Vincek, D. Klein and F. Figueroa (1991). Frozen haplotypes in

Mhc evolution. Molecular Evolution of the Major Histocompatibility Complex. J. Klein and D.

Klein. Berlin, Springer-Verlag: 261-86.

Klein, J. and N. Takahata (1990). "The major histocompatibility complex and the quest for origins."

Immunol Rev 113: 5-25.

Page 225: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 271 -

Klickstein, L. B., T. J. Bartow, V. Miletic, L. D. Rabson, J. A. Smith and D. T. Fearon (1988).

"Identification of distinct C3b and C4b recognition sites in the human C3b/C4b receptor (CR1,

CD35) by deletion mutagenesis." J Exp Med 168(5): 1699-717.

Kotarsky, H., J. Hellwage, E. Johnsson, C. Skerka, H. G. Svensson, G. Lindahl, U. Sjobring and P. F.

Zipfel (1998). "Identification of a domain in human factor H and factor H-like protein-1 required

for the interaction with streptococcal M proteins." J Immunol 160(7): 3349-54.

Kotwal, G. J., A. W. Hugin and B. Moss (1989). "Mapping and insertional mutagenesis of a vaccinia

virus gene encoding a 13,800-Da secreted protein." Virology 171(2): 579-87.

Kotwal, G. J., S. N. Isaacs, R. McKenzie, M. M. Frank and B. Moss (1990). "Inhibition of the

complement cascade by the major secretory protein of vaccinia virus." Science 250(4982): 827-

30.

Kotwal, G. J. and B. Moss (1988). "Analysis of a large cluster of nonessential genes deleted from a

vaccinia virus terminal transposition mutant." Virology 167(2): 524-37.

Krushkal, J., O. Bat and I. Gigli (2000). "Evolutionary relationships among proteins encoded by the

regulator of complement activation gene cluster." Mol Biol Evol 17(11): 1718-30.

Krych, M., L. Clemenza, D. Howdeshell, R. Hauhart, D. Hourcade and J. P. Atkinson (1994). "Analysis

of the functional domains of complement receptor type 1 (C3b/C4b receptor; CD35) by

substitution mutagenesis." J Biol Chem 269(18): 13273-8.

Krych, M., R. Hauhart and J. P. Atkinson (1998). "Structure-function analysis of the active sites of

complement receptor type 1." J Biol Chem 273(15): 8623-9.

Krych, M., D. Hourcade and J. P. Atkinson (1991). "Sites within the complement C3b/C4b receptor

important for the specificity of ligand binding." Proc Natl Acad Sci U S A 88(10): 4353-7.

Krych-Goldberg, M., R. E. Hauhart, V. B. Subramanian, B. M. Yurcisin, 2nd, D. L. Crimmins, D. E.

Hourcade and J. P. Atkinson (1999). "Decay accelerating activity of complement receptor type 1

(CD35). Two active sites are required for dissociating C5 convertases." J Biol Chem 274(44):

31160-8.

Kuhn, S. and P. F. Zipfel (1996). "Mapping of the domains required for decay acceleration activity of the

human factor H-like protein 1 and factor H." Eur J Immunol 26(10): 2383-7.

Kulski, J. K., S. Gaudieri, M. Bellgard, L. Balmer, K. Giles, H. Inoko and R. L. Dawkins (1997). "The

evolution of MHC diversity by segmental duplication and transposition of retroelements." J Mol

Evol 45(6): 599-609.

Kulski, J. K., S. Gaudieri and R. L. Dawkins (2000). "Using alu J elements as molecular clocks to trace

the evolutionary relationships between duplicated HLA class I genomic segments." J Mol Evol

50(6): 510-9.

Page 226: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 272 -

Kulski, J. K., S. Gaudieri, H. Inoko and R. L. Dawkins (1999a). "Comparison between two human

endogenous retrovirus (HERV)-rich regions within the major histocompatibility complex." J

Mol Evol 48(6): 675-83.

Kulski, J. K., S. Gaudieri, A. Martin and R. L. Dawkins (1999b). "Coevolution of PERB11 (MIC) and

HLA class I genes with HERV-16 and retroelements by extended genomic duplication." J Mol

Evol 49(1): 84-97.

Kumar, S., K. Tamura, I. B. Jakobsen and M. Nei (2001). "MEGA2: molecular evolutionary genetics

analysis software." Bioinformatics 17(12): 1244-5.

Kuttner-Kondo, L., M. E. Medof, W. Brodbeck and M. Shoham (1996). "Molecular modeling and

mechanism of action of human decay-accelerating factor." Protein Eng 9(12): 1143-9.

Lai, E., J. Riley, I. Purvis and A. Roses (1998). "A 4-Mb high-density single nucleotide polymorphism-

based map around human APOE." Genomics 54(1): 31-8.

Leelayuwat, C., L. J. Abraham, H. Tabarias, F. T. Christiansen and R. L. Dawkins (1992). "Genomic

organization of a polymorphic duplicated region centromeric of HLA-B." Immunogenetics

36(4): 208-12.

Leelayuwat, C., D. C. Townend, M. A. Degli-Esposti, L. J. Abraham and R. L. Dawkins (1994). "A new

polymorphic and multicopy MHC gene family related to nonmammalian class I."

Immunogenetics 40(5): 339-51.

Li, W. H. and L. A. Sadler (1991). "Low nucleotide diversity in man." Genetics 129(2): 513-23.

Lindahl, G., U. Sjobring and E. Johnsson (2000). "Human complement regulators: a major target for

pathogenic microorganisms." Curr Opin Immunol 12(1): 44-51.

Liszewski, M. K., T. C. Farries, D. M. Lublin, I. A. Rooney and J. P. Atkinson (1996). "Control of the

complement system." Adv Immunol 61: 201-83.

Liszewski, M. K., M. Leung, W. Cui, V. B. Subramanian, J. Parkinson, P. N. Barlow, M. Manchester and

J. P. Atkinson (2000). "Dissecting sites important for complement regulatory activity in

membrane cofactor protein (MCP; CD46)." J Biol Chem 275(48): 37692-701.

Liszewski, M. K., T. W. Post and J. P. Atkinson (1991). "Membrane cofactor protein (MCP or CD46):

newest member of the regulators of complement activation gene cluster." Annu Rev Immunol 9:

431-55.

Logar, C. M., W. Chen, H. Schmitt, C. Y. Yu and D. J. Birmingham (2004). "A human CR1-like

transcript containing sequence for a binding protein for iC4 is expressed in hematopoietic and

fetal lymphoid tissue." Mol Immunol 40(11): 831-40.

Longman-Jacobsen, N., J. F. Williamson, R. L. Dawkins and S. Gaudieri (2003). "In polymorphic

genomic regions indels cluster with nucleotide polymorphism: Quantum Genomics." Gene 312:

257-61.

Page 227: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 273 -

Lowell, C. A., L. B. Klickstein, R. H. Carter, J. A. Mitchell, D. T. Fearon and J. M. Ahearn (1989).

"Mapping of the Epstein-Barr virus and C3dg binding sites to a common domain on complement

receptor type 2." J Exp Med 170(6): 1931-46.

Lublin, D. M., M. K. Liszewski, T. W. Post, M. A. Arce, M. M. Le Beau, M. B. Rebentisch, L. S.

Lemons, T. Seya and J. P. Atkinson (1988). "Molecular cloning and chromosomal localization

of human membrane cofactor protein (MCP). Evidence for inclusion in the multigene family of

complement-regulatory proteins." J Exp Med 168(1): 181-94.

Manchester, M., A. Valsamakis, R. Kaufman, M. K. Liszewski, J. Alvarez, J. P. Atkinson, D. M. Lublin

and M. B. Oldstone (1995). "Measles virus and C3 binding sites are distinct on membrane

cofactor protein (CD46)." Proc Natl Acad Sci U S A 92(6): 2303-7.

Marschang, P., J. Sodroski, R. Wurzner and M. P. Dierich (1995). "Decay-accelerating factor (CD55)

protects human immunodeficiency virus type 1 from inactivation by human complement." Eur J

Immunol 25(1): 285-90.

Marshall, B., C. Leelayuwat, M. A. Degli-Esposti, M. Pinelli, L. J. Abraham and R. L. Dawkins (1993).

"New major histocompatibility complex genes." Hum Immunol 38(1): 24-9.

Martinez, O. P., N. Longman-Jacobsen, R. Davies, E. K. Chung, Y. Yang, S. Gaudieri, R. L. Dawkins

and C. Y. Yu (2001). "Genetics of human complement component C4 and evolution the central

MHC." Front Biosci 6: D904-13.

Martinez, O. P., C. S. Witt, G. Tay, F. T. Christiansen and R. L. Dawkins (1994). "Immunogenetic

analysis of successful and rejected bone marrow grafts within one family." European Journal of

Immunogenetics 21(5): 365-72.

Matsumoto, A. K., J. Kopicky-Burd, R. H. Carter, D. A. Tuveson, T. F. Tedder and D. T. Fearon (1991).

"Intersection of the complement and immune systems: a signal transduction complex of the B

lymphocyte-containing complement receptor type 2 and CD19." J Exp Med 173(1): 55-64.

McKenzie, R., G. J. Kotwal, B. Moss, C. H. Hammer and M. M. Frank (1992). "Regulation of

complement activity by vaccinia virus complement-control protein." J Infect Dis 166(6): 1245-

50.

McLure, C., P. Kesners, S. Lester, D. Male, C. Amadou, J. Dawkins, B. Stewart, J. Williamson and R.

Dawkins (Submitted-a). "Haplotyping of the canine MHC without the need for DLA typing."

McLure, C., J. Williamson, L. Smyth, S. Agrawal, S. Lester, J. Millman, P. Keating, B. Stewart and R.

Dawkins (Submitted-b). "Extensive genomic and functional polymorphism of Complement

Control Proteins."

McLure, C. A., R. L. Dawkins, J. F. Williamson, R. A. Davies, J. Berry, L. J. Natalie, R. Laird and S.

Gaudieri (2004a). "Amino acid patterns within short consensus repeats define conserved

duplicons shared by genes of the RCA complex." J Mol Evol 59(2): 143-57.

Page 228: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 274 -

McLure, C. A., J. F. Williamson, B. J. Stewart, P. J. Keating and R. L. Dawkins (2004b). "Genomic

analysis reveals a duplication of eight rather than seven short consensus repeats in primate CR1

and CR1L: evidence for an additional set shared between CR1 and CR2." Immunogenetics

56(9): 631-8.

McLure, C. A., J. F. Williamson, B. J. Stewart, P. J. Keating and R. L. Dawkins (2005). "Indels and

imperfect duplication have driven the evolution of human Complement Receptor 1 (CR1) and

CR1-like from their precursor CR1 alpha: Importance of functional sets." Hum Immunol 66(3):

258-73.

Medof, M. E., T. Kinoshita and V. Nussenzweig (1984). "Inhibition of complement activation on the

surface of cells after incorporation of decay-accelerating factor (DAF) into their membranes." J

Exp Med 160(5): 1558-78.

Medof, M. E., D. M. Lublin, V. M. Holers, D. J. Ayers, R. R. Getty, J. F. Leykam, J. P. Atkinson and M.

L. Tykocinski (1987). "Cloning and characterization of cDNAs encoding the complete sequence

of decay-accelerating factor of human complement." Proc Natl Acad Sci U S A 84(7): 2007-11.

Mikata, S., S. Miyagawa, K. Iwata, S. Nagasawa, M. Hatanaka, M. Matsumoto, W. Kamiike, H. Matsuda,

R. Shirakura and T. Seya (1998). "Regulation of complement-mediated swine endothelial cell

lysis by a surface-bound form of human C4b binding protein." Transplantation 65(3): 363-8.

Miller, C. G., S. N. Shchelkunov and G. J. Kotwal (1997). "The cowpox virus-encoded homolog of the

vaccinia virus complement control protein is an inflammation modulatory protein." Virology

229(1): 126-33.

Mold, C., B. M. Bradt, G. R. Nemerow and N. R. Cooper (1988). "Epstein-Barr virus regulates activation

and processing of the third component of complement." J Exp Med 168(3): 949-69.

Molina, H., C. Brenner, S. Jacobi, J. Gorka, J. C. Carel, T. Kinoshita and V. M. Holers (1991). "Analysis

of Epstein-Barr virus-binding sites on complement receptor 2 (CR2/CD21) using human-mouse

chimeras and peptides. At least two distinct sites are necessary for ligand-receptor interaction." J

Biol Chem 266(19): 12173-9.

Molina, H., T. Kinoshita, C. B. Webster and V. M. Holers (1994). "Analysis of C3b/C3d binding sites

and factor I cofactor regions within mouse complement receptors 1 and 2." J Immunol 153(2):

789-95.

Molina, H., S. J. Perkins, J. Guthridge, J. Gorka, T. Kinoshita and V. M. Holers (1995). "Characterization

of a complement receptor 2 (CR2, CD21) ligand binding site for C3. An initial model of ligand

interaction with two linked short consensus repeat modules." J Immunol 154(10): 5426-35.

Montefiori, D. C., R. J. Cornell, J. Y. Zhou, J. T. Zhou, V. M. Hirsch and P. R. Johnson (1994).

"Complement control proteins, CD46, CD55, and CD59, as common surface constituents of

human and simian immunodeficiency viruses and possible targets for vaccine protection."

Virology 205(1): 82-92.

Page 229: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 275 -

Moore, M. D., N. R. Cooper, B. F. Tack and G. R. Nemerow (1987). "Molecular cloning of the cDNA

encoding the Epstein-Barr virus/C3d receptor (complement receptor type 2) of human B

lymphocytes." Proc Natl Acad Sci U S A 84(24): 9194-8.

Moore, M. D., R. G. DiScipio, N. R. Cooper and G. R. Nemerow (1989). "Hydrodynamic, electron

microscopic, and ligand-binding analysis of the Epstein-Barr virus/C3dg receptor (CR2)." J Biol

Chem 264(34): 20576-82.

Moulds, J. M., J. D. Reveille and F. C. Arnett (1996). "Structural polymorphisms of complement receptor

1 (CR1) in systemic lupus erythematosus (SLE) patients and normal controls of three ethnic

groups." Clin Exp Immunol 105(2): 302-5.

Moulds, J. M., B. J. Thomas, O. Doumbo, D. A. Diallo, K. E. Lyke, C. V. Plowe, J. A. Rowe and D. J.

Birmingham (2004). "Identification of the Kna/Knb polymorphism and a method for Knops

genotyping." Transfusion 44(2): 164-9.

Moulds, J. M., P. A. Zimmerman, O. K. Doumbo, L. Kassambara, I. Sagara, D. A. Diallo, J. P. Atkinson,

M. Krych-Goldberg, R. E. Hauhart, D. E. Hourcade, D. T. McNamara, D. J. Birmingham, J. A.

Rowe, J. J. Moulds and L. H. Miller (2001). "Molecular identification of Knops blood group

polymorphisms found in long homologous region D of complement receptor 1." Blood 97(9):

2879-85.

Nagar, B., R. G. Jones, R. J. Diefenbach, D. E. Isenman and J. M. Rini (1998). "X-ray crystal structure of

C3d: a C3 fragment and ligand for complement receptor 2." Science 280(5367): 1277-81.

Naniche, D., G. Varior-Krishnan, F. Cervoni, T. F. Wild, B. Rossi, C. Rabourdin-Combe and D. Gerlier

(1993). "Human membrane cofactor protein (CD46) acts as a cellular receptor for measles

virus." J Virol 67(10): 6025-32.

Neeleman, C., S. P. Geelen, P. C. Aerts, M. R. Daha, T. E. Mollnes, J. J. Roord, G. Posthuma, H. van

Dijk and A. Fleer (1999). "Resistance to both complement activation and phagocytosis in type 3

pneumococci is mediated by the binding of complement regulatory protein factor H." Infect

Immun 67(9): 4517-24.

Nemerow, G. R., C. Mold, V. K. Schwend, V. Tollefson and N. R. Cooper (1987). "Identification of

gp350 as the viral glycoprotein mediating attachment of Epstein-Barr virus (EBV) to the

EBV/C3d receptor of B cells: sequence homology of gp350 and C3 complement fragment C3d."

J Virol 61(5): 1416-20.

Nicholson-Weller, A., J. Burge and K. F. Austen (1981). "Purification from guinea pig erythrocyte stroma

of a decay-accelerating factor for the classical c3 convertase, C4b,2a." J Immunol 127(5): 2035-

9.

Nicholson-Weller, A., J. Burge, D. T. Fearon, P. F. Weller and K. F. Austen (1982). "Isolation of a

human erythrocyte membrane glycoprotein with decay-accelerating activity for C3 convertases

of the complement system." J Immunol 129(1): 184-9.

Page 230: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 276 -

Nowicki, B., A. Hart, K. E. Coyne, D. M. Lublin and S. Nowicki (1993). "Short consensus repeat-3

domain of recombinant decay-accelerating factor is recognized by Escherichia coli recombinant

Dr adhesin in a model of a cell-cell interaction." J Exp Med 178(6): 2115-21.

Okada, N., M. K. Liszewski, J. P. Atkinson and M. Caparon (1995). "Membrane cofactor protein (CD46)

is a keratinocyte receptor for the M protein of the group A streptococcus." Proc Natl Acad Sci U

S A 92(7): 2489-93.

Ollier, W. E., L. J. Kennedy, W. Thomson, A. N. Barnes, S. C. Bell, D. Bennett, J. M. Angles, J. F. Innes

and S. D. Carter (2001). "Dog MHC alleles containing the human RA shared epitope confer

susceptibility to canine rheumatoid arthritis." Immunogenetics 53(8): 669-73.

Pahler, S., B. Blumbach, I. Muller and W. E. Muller (1998). "Putative multiadhesive protein from the

marine sponge Geodia cydonium: cloning of the cDNA encoding a fibronectin-, an SRCR-, and

a complement control protein module." J Exp Zool 282(3): 332-43.

Pardo-Manuel, F., J. Rey-Campos, A. Hillarp, B. Dahlback and S. Rodriguez de Cordoba (1990).

"Human genes for the alpha and beta chains of complement C4b-binding protein are closely

linked in a head-to-tail arrangement." Proc Natl Acad Sci U S A 87(12): 4529-32.

Peiffer, I., A. L. Servin and M. F. Bernet-Camard (1998). "Piracy of decay-accelerating factor (CD55)

signal transduction by the diffusely adhering strain Escherichia coli C1845 promotes

cytoskeletal F-actin rearrangements in cultured human intestinal INT407 cells." Infect Immun

66(9): 4036-42.

Pham, T., A. Kaul, A. Hart, P. Goluszko, J. Moulds, S. Nowicki, D. M. Lublin and B. J. Nowicki (1995).

"dra-related X adhesins of gestational pyelonephritis-associated Escherichia coli recognize SCR-

3 and SCR-4 domains of recombinant decay-accelerating factor." Infect Immun 63(5): 1663-8.

Prechl, J. and A. Erdei (2000). "Immunomodulatory functions of murine CR1/2." Immunopharmacology

49(1-2): 117-24.

Ram, S., F. G. Mackinnon, S. Gulati, D. P. McQuillen, U. Vogel, M. Frosch, C. Elkins, H. K.

Guttormsen, L. M. Wetzler, M. Oppermann, M. K. Pangburn and P. A. Rice (1999). "The

contrasting mechanisms of serum resistance of Neisseria gonorrhoeae and group B Neisseria

meningitidis." Mol Immunol 36(13-14): 915-28.

Ram, S., D. P. McQuillen, S. Gulati, C. Elkins, M. K. Pangburn and P. A. Rice (1998a). "Binding of

complement factor H to loop 5 of porin protein 1A: a molecular mechanism of serum resistance

of nonsialylated Neisseria gonorrhoeae." J Exp Med 188(4): 671-80.

Ram, S., A. K. Sharma, S. D. Simpson, S. Gulati, D. P. McQuillen, M. K. Pangburn and P. A. Rice

(1998b). "A novel sialic acid binding site on factor H mediates serum resistance of sialylated

Neisseria gonorrhoeae." J Exp Med 187(5): 743-52.

Reid, K. B. (1986). "Activation and control of the complement system." Essays Biochem 22: 27-68.

Page 231: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 277 -

Reid, K. B. and A. J. Day (1989). "Structure-function relationships of the complement components."

Immunol Today 10(6): 177-80.

Rey-Campos, J., D. Baeza-Sanz and S. Rodriguez de Cordoba (1990). "Physical linkage of the human

genes coding for complement factor H and coagulation factor XIII B subunit." Genomics 7(4):

644-6.

Rey-Campos, J., P. Rubinstein and S. Rodriguez de Cordoba (1987). "Decay-accelerating factor. Genetic

polymorphism and linkage to the RCA (regulator of complement activation) gene cluster in

humans." J Exp Med 166(1): 246-52.

Rey-Campos, J., P. Rubinstein and S. Rodriguez de Cordoba (1988). "A physical map of the human

regulator of complement activation gene cluster linking the complement genes CR1, CR2, DAF,

and C4BP." J Exp Med 167(2): 664-9.

Robinson, W. E., Jr., D. C. Montefiori and W. M. Mitchell (1988). "Antibody-dependent enhancement of

human immunodeficiency virus type 1 infection." Lancet 1(8589): 790-4.

Rodriguez de Cordoba, S., M. A. Diaz-Guillen and D. Heine-Suner (1999). "An integrated map of the

human regulator of complement activation (RCA) gene cluster on 1q32." Mol Immunol 36(13-

14): 803-8.

Rodriguez de Cordoba, S., D. M. Lublin, P. Rubinstein and J. P. Atkinson (1985). "Human genes for

three complement components that regulate the activation of C3 are tightly linked." J Exp Med

161(5): 1189-95.

Rodriguez de Cordoba, S. and P. Rubinstein (1987). "New alleles of C4-binding protein and factor H and

further linkage data in the regulator of complement activation (RCA) gene cluster in man."

Immunogenetics 25(4): 267-8.

Rosengard, A. M., Y. Liu, Z. Nie and R. Jimenez (2002). "Variola virus immune evasion design:

expression of a highly efficient inhibitor of human complement." Proc Natl Acad Sci U S A

99(13): 8808-13.

Rowe, J. A., J. M. Moulds, C. I. Newbold and L. H. Miller (1997). "P. falciparum rosetting mediated by a

parasite-variant erythrocyte membrane protein and complement-receptor 1." Nature 388(6639):

292-5.

Rowen, L., B. F. Koop and L. Hood (1996). "The complete 685-kilobase DNA sequence of the human

beta T cell receptor locus." Science 272(5269): 1755-62.

Rutherford, K., J. Parkhill, J. Crook, T. Horsnell, P. Rice, M. A. Rajandream and B. Barrell (2000).

"Artemis: sequence visualization and annotation." Bioinformatics 16(10): 944-5.

Sahu, A., S. N. Isaacs, A. M. Soulika and J. D. Lambris (1998). "Interaction of vaccinia virus

complement control protein with human complement proteins: factor I-mediated degradation of

C3b to iC3b1 inactivates the alternative complement pathway." J Immunol 160(11): 5596-604.

Page 232: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 278 -

Santoro, F., P. E. Kennedy, G. Locatelli, M. S. Malnati, E. A. Berger and P. Lusso (1999). "CD46 is a

cellular receptor for human herpesvirus 6." Cell 99(7): 817-27.

Sargiacomo, M., M. Sudol, Z. Tang and M. P. Lisanti (1993). "Signal transducing molecules and

glycosyl-phosphatidylinositol-linked proteins form a caveolin-rich insoluble complex in MDCK

cells." J Cell Biol 122(4): 789-807.

Satta, Y., Y. J. Li and N. Takahata (1998). "The neutral theory and natural selection in the HLA region."

Front Biosci 3: d459-67.

Sayer, D., R. Whidborne, B. Brestovac, F. Trimboli, C. Witt and F. Christiansen (2001). "HLA-DRB1

DNA sequencing based typing: an approach suitable for high throughput typing including

unrelated bone marrow registry donors." Tissue Antigens 57(1): 46-54.

Scharfstein, J., A. Ferreira, I. Gigli and V. Nussenzweig (1978). "Human C4-binding protein. I. Isolation

and characterization." J Exp Med 148(1): 207-22.

Schmitz, J., J. P. Zimmer, B. Kluxen, S. Aries, M. Bogel, I. Gigli and H. Schmitz (1995). "Antibody-

dependent complement-mediated cytotoxicity in sera from patients with HIV-1 infection is

controlled by CD55 and CD59." J Clin Invest 96(3): 1520-6.

Schwarzenbacher, R., K. Zeth, K. Diederichs, A. Gries, G. M. Kostner, P. Laggner and R. Prassl (1999).

"Crystal structure of human beta2-glycoprotein I: implications for phospholipid binding and the

antiphospholipid syndrome." Embo J 18(22): 6228-39.

Shafren, D. R., D. J. Dorahy, R. A. Ingham, G. F. Burns and R. D. Barry (1997). "Coxsackievirus A21

binds to decay-accelerating factor but requires intercellular adhesion molecule 1 for cell entry." J

Virol 71(6): 4736-43.

Shafren, D. R., D. J. Dorahy, R. F. Thorne, T. Kinoshita, R. D. Barry and G. F. Burns (1998). "Antibody

binding to individual short consensus repeats of decay-accelerating factor enhances enterovirus

cell attachment and infectivity." J Immunol 160(5): 2318-23.

Sharma, A. K. and M. K. Pangburn (1996). "Identification of three physically and functionally distinct

binding sites for C3b in human complement factor H by deletion mutagenesis." Proc Natl Acad

Sci U S A 93(20): 10996-1001.

Sharma, A. K. and M. K. Pangburn (1997). "Localization by site-directed mutagenesis of the site in

human complement factor H that binds to Streptococcus pyogenes M protein." Infect Immun

65(2): 484-7.

Smith, B. O., R. L. Mallin, M. Krych-Goldberg, X. Wang, R. E. Hauhart, K. Bromek, D. Uhrin, J. P.

Atkinson and P. N. Barlow (2002). "Structure of the C3b binding site of CR1 (CD35), the

immune adherence receptor." Cell 108(6): 769-80.

Sonnhammer, E. L. and R. Durbin (1995). "A dot-matrix program with dynamic threshold control suited

for genomic DNA and protein sequence analysis." Gene 167(1-2): GC1-10.

Page 233: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 279 -

Spear, G. T., M. Hart, G. G. Olinger, F. B. Hashemi and M. Saifuddin (2001). "The role of the

complement system in virus infections." Curr Top Microbiol Immunol 260: 229-45.

Spear, G. T., N. S. Lurain, C. J. Parker, M. Ghassemi, G. H. Payne and M. Saifuddin (1995). "Host cell-

derived complement control proteins CD55 and CD59 are incorporated into the virions of two

unrelated enveloped viruses. Human T cell leukemia/lymphoma virus type I (HTLV-I) and

human cytomegalovirus (HCMV)." J Immunol 155(9): 4376-81.

Spiller, O. B., M. Robinson, E. O'Donnell, S. Milligan, B. P. Morgan, A. J. Davison and D. J. Blackbourn

(2003). "Complement regulation by Kaposi's sarcoma-associated herpesvirus ORF4 protein." J

Virol 77(1): 592-9.

Stoiber, H., A. Clivio and M. P. Dierich (1997). "Role of complement in HIV infection." Annu Rev

Immunol 15: 649-74.

Stoiber, H., L. Kacani, C. Speth, R. Wurzner and M. P. Dierich (2001). "The supportive role of

complement in HIV pathogenesis." Immunol Rev 180: 168-76.

Stoiber, H., C. Pinter, A. G. Siccardi, A. Clivio and M. P. Dierich (1996). "Efficient destruction of human

immunodeficiency virus in human serum by inhibiting the protective action of complement

factor H and decay accelerating factor (DAF, CD55)." J Exp Med 183(1): 307-10.

Stoiber, H., R. Schneider, J. Janatova and M. P. Dierich (1995). "Human complement proteins C3b, C4b,

factor H and properdin react with specific sites in gp120 and gp41, the envelope proteins of

HIV-1." Immunobiology 193(1): 98-113.

Szakonyi, G., J. M. Guthridge, D. Li, K. Young, V. M. Holers and X. S. Chen (2001). "Structure of

complement receptor 2 in complex with its C3d ligand." Science 292(5522): 1725-8.

Tanner, J., J. Weis, D. Fearon, Y. Whang and E. Kieff (1987). "Epstein-Barr virus gp350/220 binding to

the B lymphocyte C3d receptor mediates adsorption, capping, and endocytosis." Cell 50(2): 203-

13.

Tay, G. K., C. S. Witt, F. T. Christiansen, D. Charron, D. Baker, R. Herrmann, L. K. Smith, D.

Diepeveen, S. Mallal, J. McCluskey and et al. (1995a). "Matching for MHC haplotypes results in

improved survival following unrelated bone marrow transplantation." Bone Marrow Transplant

15(3): 381-5.

Tay, G. K., C. S. Witt, F. T. Christiansen, J. M. Corbett and R. L. Dawkins (1995b). "The identification

of MHC identical siblings without HLA typing." Exp Hematol 23(14): 1655-60.

Thern, A., L. Stenberg, B. Dahlback and G. Lindahl (1995). "Ig-binding surface proteins of Streptococcus

pyogenes also bind human C4b-binding protein (C4BP), a regulatory component of the

complement system." J Immunol 154(1): 375-86.

Thern, A., M. Wastfelt and G. Lindahl (1998). "Expression of two different antiphagocytic M proteins by

Streptococcus pyogenes of the OF+ lineage." J Immunol 160(2): 860-9.

Page 234: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 280 -

Tokunaga, K., G. Saueracker, P. H. Kay, F. T. Christiansen, R. Anand and R. L. Dawkins (1988).

"Extensive deletions and insertions in different MHC supratypes detected by pusled field gel

electrophoresis." J Exp Med 168(3): 933-40.

Uhrinova, S., F. Lin, G. Ball, K. Bromek, D. Uhrin, M. E. Medof and P. N. Barlow (2003). "Solution

structure of a functionally active fragment of decay-accelerating factor." Proc Natl Acad Sci U S

A 100(8): 4718-23.

Vanderplasschen, A., E. Mathew, M. Hollinshead, R. B. Sim and G. L. Smith (1998). "Extracellular

enveloped vaccinia virus is resistant to complement because of incorporation of host

complement control proteins into its envelope." Proc Natl Acad Sci U S A 95(13): 7544-9.

Wagner, J. L. (2003). "Molecular organization of the canine major histocompatibility complex." J Hered

94(1): 23-6.

Wagner, J. L., R. C. Burnett, S. A. DeRose, L. V. Francisco, R. Storb and E. A. Ostrander (1996).

"Histocompatibility testing of dog families with highly polymorphic microsatellite markers."

Transplantation 62(6): 876-7.

Walport, M. J., G. D. Ross, C. Mackworth-Young, J. V. Watson, N. Hogg and P. J. Lachmann (1985).

"Family studies of erythrocyte complement receptor type 1 levels: reduced levels in patients with

SLE are acquired, not inherited." Clin Exp Immunol 59(3): 547-54.

Wei, X., M. Orchardson, J. A. Gracie, B. P. Leung, B. Gao, H. Guan, W. Niedbala, G. K. Paterson, I. B.

McInnes and F. Y. Liew (2001). "The Sushi domain of soluble IL-15 receptor alpha is essential

for binding IL-15 and inhibiting inflammatory and allogenic responses in vitro and in vivo." J

Immunol 167(1): 277-82.

Weis, J. H., C. C. Morton, G. A. Bruns, J. J. Weis, L. B. Klickstein, W. W. Wong and D. T. Fearon

(1987). "A complement receptor locus: genes encoding C3b/C4b receptor and C3d/Epstein-Barr

virus receptor map to 1q32." J Immunol 138(1): 312-5.

Williamson, J. F., S. Gaudieri, N. Longman-Jacobsen and R. L. Dawkins (2003). The use and abuse of

microsatellites markers. HLA 2002 Immunobiology of the Human MHC, Seattle, IHWG Press,

in press.

Witt, C., D. Sayer, F. Trimboli, M. Saw, R. Herrmann, P. Cannell, D. Baker and F. Christiansen (2000).

"Unrelated donors selected prospectively by block-matching have superior bone marrow

transplant outcome." Hum Immunol 61(2): 85-91.

Wong, W. W. (1990). "Structural and functional correlation of the human complement receptor type 1." J

Invest Dermatol 94(6 Suppl): 64S-7S.

Wu, X., W. J. Zhang, C. S. Witt, L. J. Abraham, F. T. Christiansen and R. L. Dawkins (1992).

"Haplospecific polymorphism between HLA B and tumor necrosis factor." Hum Immunol 33(2):

89-97.

Page 235: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 281 -

Xiang, L., J. R. Rundles, D. R. Hamilton and J. G. Wilson (1999). "Quantitative alleles of CR1: coding

sequence analysis and comparison of haplotypes in two ethnic groups." J Immunol 163(9): 4939-

45.

Xu, C., D. Mao, V. M. Holers, B. Palanca, A. M. Cheng and H. Molina (2000). "A critical role for murine

complement regulator crry in fetomaternal tolerance." Science 287(5452): 498-501.

Yunis, E. J., Z. Awdeh, D. Raum and C. A. Alper (1983). "The MHC in human bone marrow

allotransplantation." Clin Haematol 12(3): 641-80.

Yunis, E. J., C. E. Larsen, M. Fernandez-Vina, Z. L. Awdeh, T. Romero, J. A. Hansen and C. A. Alper

(2003). "Inheritable variable sizes of DNA stretches in the human MHC: conserved extended

haplotypes and their fragments or blocks." Tissue Antigens 62(1): 1-20.

Zhang, W. J., M. A. Degli-Esposti, T. J. Cobain, P. U. Cameron, F. T. Christiansen and R. L. Dawkins

(1990). "Differences in gene copy number carried by different MHC ancestral haplotypes.

Quantitation after physical separation of haplotypes by pulsed field gel electrophoresis." J Exp

Med 171(6): 2101-14.

Page 236: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 282 -

Page 237: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 283 -

EXAMINERS REPORTS

Page 238: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 284 -

Page 239: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 285 -

Examiner 1: Professor Sir Peter Lachmann Report for the University of Western Australia on the thesis "Duplication and Polymorphism with particular reference to Regulators of Complement Activation" submitted for the degree of Doctor of Philosophy by Craig Anthony McLure Mr McLure comes to his work on the RCA locus from a background of MHC genetics. He is well versed in this subject and particularly in the approaches pioneered by Roger Dawkins and his colleagues. In this thesis he describes the application of this same approach to the locus on chromosome Clq coding for the so-called "regulators of complement activation" (RCA). This is an intriguing idea even though there maybe significant differences between these two loci. The long held theory for explaining the maintenance of "extended haplotypes" within the MHC is that there may be selective advantage in holding together on a single haplotype alleles at MHC Class 1 loci and MHC Class 2 loci whose combined presence is advantageous for resistance to particular pathogens. There is, to the best of my knowledge, no evidence of this kind for the RCA locus. I am not aware of, nor does Mr. McLure quote, any studies which suggest that combining particular alleles on different molecules of the RCA cluster carries any selective advantage. For this reason one would not be surprised if the evolution of these two loci had followed rather different paths. In his Introduction, he first discusses the mechanisms by which genes and gene loci evolve describing the particular importance of duplications and of insertions and deletions (indels in the local terminology) basing this on work on the MHC. He also defines the blocks of genes in which recombination seems to be suppressed which are locally called "polymorphic frozen blocks". This part of the review focuses quite largely on the work for which the Dawkins group are well known and is well done. I do have one reservation which I think it would be interesting to raise with the candidate. In another complement locus, the one that codes for C6 and C7 and, at a greater distance, C9 - components that also gene duplicates - there is strong evidence from the mapping of intron/exon boundaries in these genes that, at any rate in "recent" evolution, the progression has been from larger molecules to smaller molecules, i.e. that the build-up of the large molecule is an early event and that the smaller molecules are derived from it by duplication and trimming rather than by being built up from a smaller precursor. However such evolutionary models are all, to some extent, speculative; and the arguments raised by Mr McLure are quite plausible. The RCA proteins are made up of linear sequences of a particular protein domain which is usually called the "short consensus repeat" (or SCR). As Mr McLure notes, this is of ancient origin and can be found even in invertebrates that have no functioning complement system. It is also found in man in a number of complement proteins that are not regulators of complement activation. Thus there are SCRs in the sequentially acting, duplicated proteins Clr and Cls and in the sequentially acting, duplicated proteins C6 and C7; as well as in the duplicated proteins C2 and Factor B which are homologues in the classical and alternative

Page 240: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 286 -

complement activation pathways. These SCRs are given little attention and only those in C2 and Factor B are even mentioned. Linear oligomers of SCRs - four or more - are characteristic of the proteins of the RCA locus. The background review of complement and of the complement control proteins and their functions show some lack of depth. Most of the discussion is based on reviews rather than on the original papers, something which I think is not wholly desirable in a PhD thesis. The NMR studies of the structure of CCPs (carried out, for example, by Campbell and his group in Oxford) which give some explanation why it is groups of SCRs rather than any individual SCR that carry function are not mentioned In Table 5 there are some errors of fact. It is stated that C4bp competes with Factor B for binding to C4b. Factor B does not bind to C4b - it is C2 that binds to C4b in the classical pathway. Mr McLure may not appreciate that CR2 is not a complement control protein at all, having neither co-factor activity for Factor I nor "Decay Accelerating" activity. Its role is as a complement receptor on B cells which is involved in cell triggering. Again, he says that it binds C4dg. I don't believe that C4dg is a described fragment, and it is C3dg that is intended on Figure 5. On the same table, he does not appreciate that CR1 is also a co-factor, and indeed the sole co-factor, for the Factor I mediated proteolysis of iC3b to C3dg, and that this is a very important function of CR1. It would also be worth pointing out that the function of CR1 is rather different in primates where it is present on erythrocytes and plays a major role in carrying immune complexes round the body, than in other mammals where CR1 is not found on red cells. It may be found on platelets as in rabbits, or it may be found on neither platelets nor red cells, as is the case in mice. Although mice do have CR1 and DAF and MCP, the major CCP is the molecule Crry, which is not seen in primates. On page 39 he comments on diverse functions of CCPs. One is their role as viral receptors, which is usually independent of the presence of complement. There can be few cell surface molecules that some virus does not use as a receptor! In other cases, however, HIV being an example, it is C3 fragments fixed on the virus that bind CR2 on B cells or follicular dendritic cells and lead to their infection. Maintaining feto-maternal "tolerance" depends on the prevention complement mediated damage in the placenta. The double "knock-out" of C3 as well as of Crry is normally fertile. It is only the animals that have a functioning complement system but lack Crry that lose their fetuses. The section on the RCA disease associations is really rather inadequate. The evidence that genetic polymorphism in CR1 is associated with lupus is weak. CR1 number polymorphism (detected by the associated DNA pattern) does not appear to be associated with disease nor does its size polymorphism. The substantial decrease in CR1 number on the red cells of patients with immune complex diseases is certainly acquired, and this is no longer a controversy. The decrease is seen not only in diseases like lupus but, very markedly, in cold autoantibody haemolytic anaemia where there is extensive complement fixation on the surface of red cells. There are more interesting associations which are not discussed. These largely concern Factor H where deficiency or mutations (often of the C-terminal part of the molecule) are associated with familial haemolytic uraemic syndrome (HUS). Similar associations occur with mutations in MCP and in factor I itself, suggesting

Page 241: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 287 -

that failure of control of the alternative complement pathway does play a real part in the genesis of familial HUS. Much more recently, (and probably too late for comment in this thesis), are the observations that there is a strong associations between a common polymorphism in Factor H and macular degeneration. Finally, in this section Mr McLure reviews the role of CCPs in microbial virulence and cites a long list of observations. Viruses and bacteria use CCPs quite differently. Bacteria fix soluble mammalian CCPs, (Factor H or C4 binding protein) from the plasma and bind them to "receptors" on their surfaces thereby achieving protection from complement attack. Viruses do not do this. They either incorporate host CCPs during budding, (this is a property largely of retroviruses but it is also seen in one form of budding vaccinia); or - in the large DNA viruses such as pox viruses and herpes viruses - frequently encode their own CCPs. Sometimes these are based on the SCR domain, (Variola, vaccinia, Herpes virus saimiri, HHV 8) and probably represent gene capture from the host. On the other hand herpes simplex virus makes a CCP that is not based on the SCR although it has rather similar functional properties. This probably represents convergent evolution. The Chapters describing the work done. The first chapter is devoted to classifying the SCRs found in the RCA locus into families on the basis of sequence analysis and is presented as a paper in the Journal of Molecular Evolution. They were able to define eleven sub-families of SCR and to show that these occur in defined blocks in the different proteins. This is a valuable contribution and also helps to clarify the organisation of the membrane bound RCA proteins. Mr. McLure has produced a dendrogram suggesting how the different SCR sub-families are inter-related in CR1, the largest of the RCA proteins. The paper also describes comparisons of RCA proteins in different species on the basis of the SCR sub-families which shows conservation in primates and that mouse Crry seems to be the true homologue of human CR1 rather than the molecule known as mouse CR1. This accords with their relative functional importance in the two species. The second chapter is presented in the form of a paper in Immunogenetics, in which the authors demonstrate that the long homologous repeat in CR1 is made out of eight rather than seven SCRs in the gene. This also is a satisfying conclusion in that the oligomer sequence in the smaller RCA proteins is made up of four SCRs. The CR1 pattern is also found in the so-called CR1-like molecule and in chimpanzee CR1. The eighth SCR, which they describe as g-like, (i.e. like the g sub-family), is generally not expressed, although the reason for this is not immediately apparent. The third chapter, which is presented as a paper in Human Immunology, concerns the evolution of human CR1 and the CR1-like gene which they regard as both being derived from a precursor, which they call CRlalpha. They show that in human CR1 and CR1-like genes, and the corresponding syntenic genes in chimpanzees, baboons, rats and mice, the order of the sub-families of SCR are maintained. The results of these elaborate sequence analyses are complex and the authors point out that it is difficult to define the unit of duplication on account of changes due to degeneracy, deletions and imperfect duplication. In the fourth chapter, Mr. McLure goes on to study polymorphisms in the RCA locus (particularly in CR1) with the intention of defining ancestral haplotypes within the RCA

Page 242: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 288 -

locus. This has been done using the "genomic matching technique" (GMT) which they have developed for the MHC and which is based on the observation that high levels of polymorphism are "packaged" within duplicated sequences and that these packages are stable and inherited over long periods of time. This paper contains experimental work looking, by PCR, at a panel of human subjects varying in ethnic group and in disease status. Some twenty ancestral haplotypes have been identified including fifteen which comprising about three-quarters of those in the population., Some data is presented to suggest that haplotype frequencies differ from normal in a group of patients with recurrent spontaneous abortion or with psoriasis. These findings are preliminary and difficult to evaluate partly because of the lack of detail about how the groups are defined and how well they are matched to the controls and partly because of a lack of statistical evaluation. It is likely that these are hypothesis-generating findings rather than ones that allow the hypothesis to be tested. The final chapters are on a rather different topic, which is the application of the GMT technique to dogs in Chapter 5, and to cattle in Chapter 6. They are both concerned with the MHC rather than the RCA locus and are both quite brief. The principal conclusion of these studies is that the GMT technique is an effective alternative to more conventional methods to detect polymorphisms and identify haplotypes in highly polymorphic regions. Their future plans are to exploit their new data on the polymorphism of the a-block of the regulators of complement activation, i.e. the portion that contains the cell-bound activators to see whether they can detect important associations with auto-immune diseases. It is not quite clear to me why they are particularly interested in auto-immune diseases rather than looking at different infections in this connection; since it seems more likely that infection is the driver of such polymorphisms, as, indeed, it is in the MHC. However it is not self-evident that the induction of auto-immunity will form the negative aspect of this as it is does in the MHC. They also propose to try to find CCPs that have lost their capacity to act as receptors for individual viruses such as measles in the case of MCP, or Epstein-Barr virus in the case of CR2. I will watch the progress of these studies with great interest and if they succeed in showing a significant clinical utility for these polymorphisms, this will be a great advance. In summary, this is essentially a study of genomes using, in large part, published sequence data to identify family relationships within a particular genetic locus that codes essentially for a single protein domain in multiple duplications and to use techniques that have been used in the MHC for studying evolution and polymorphism at this locus. The work has been productive and has added significantly to knowledge of how the RCA is organised and may have evolved. There are also some leads that may point to clinical utility with a limited amount of experimental work, largely based on PCR, looking at the distribution of polymorphisms in different disease groups - work which is still at a preliminary stage. I would have preferred to discuss the issues that I have raised in this report with him face to face; and, because this is not possible, I have perhaps been rather more critical at certain points, particularly concerning the Introduction, than I would normally be where there is a viva. I think that the Introduction needs amending to correct some errors that have been pointed out.

Page 243: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 289 -

Subject to this being done, I am happy to recommend that Mr. McLure be awarded the degree of Ph.D. Peter Lachmann

Page 244: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 290 -

Examiner 2: Professor Edmond Yunis Thesis: "Duplication and polymorphism with particular reference to regulators of complement activation" By Craig Anthony McLure Thesis for degree of Doctor of Philosophy University of Western Australia. Comments:

The cluster of genes known as the Regulators of Complement Activation is located in chromosome 1 (1q32). This region contains several genes related to the complement system. The RCA genes are phylogenetically related with the complement control proteins (CCP) family, that prevent the activation of complement. The CCP family has been conserved in invertebrates and apparently have evolved in parallel with the complement family located in the major histocompatibility complex region. Complement activation is regulated by several mechanisms mediated by different RCA genes (FH, C4BP, DAP, CR2, CR1, MCP) and different SCRs and sub-families. Previous studies have reported the existence of DNA conserved or frozen blocks in this region, each block containing a specific set of linked genes. Also in this region several studies have identified SNPs in coding and non coding sequences of RCA cluster. Several studies have described a potential relevance of the RCA genes in the pathogenesis of autoimmune and infectious diseases. This doctoral thesis represents an important approach to analyze the structure of ancestral haplotypes using extensive genomic analysis methods to define the mechanisms involved in the evolution of RCA cluster.

The aims of this study were 1) the characterization of the relationship between the genes that code for the complement control-proteins (CCPs) and domains of the regulators of complement activation (RCA), 2) Analysis of the duplicated elements in the RCA region, 3) analysis of divergence in duplicated genes, 4) applicability of the Genomic Matching Technique (GMT) to asses the presence of conserved haplotypes, 4) evaluate the potential relevance of RCA conserved haplotypes with CCP implicated diseases, 6) applicability of the analysis of GMT in other species. In this study, the author demonstrates that duplication and divergence of particular sets of SCRs have contribute with the evolution of RCA cluster. It is important to mention that the published studies derived from this thesis shows that all SCRs can be classified in 11 subfamilies (a-k), in this context, events of duplication and deletion have contributed to the specialization or selection of SCRs sequences. Primate complement receptor 1 (CR1) and its. ligand (CR1L) contain a duplication of eight rather than seven SCRs, these differences can be explained by duplications, retroviral insertions and deletions. On the other hand, the GMT was used to identify polymorphisms in the RCA. With this method at least 20 conserved haplotypes for the RCA alpha block, were detected. Interestingly, the distribution of some haplotypes such as CR1.02 and CRX.08 differ in Psoriasis VuIgaris and Recurrent Spontaneous Abortion. Finally this analysis can be used in the detection of conserved haplotypes in other species genome.

This is a very good thesis and contributes significantly in the knowledge of this

field and the approach is innovative for mapping regions in the RCA cluster that could be relevant in the pathogenesis of autoimmune diseases and could help to understand the role of genome evolution in the regulation of immune system mechanisms. The

Page 245: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 291 -

author of this thesis (Craig Anthony McLure) is a very well qualified to conduct this research projects and the proposed research. I strongly recommend that Craig Anthony McLure receive the PhD degree by the University of Western Australia with Honors.

Professor Edmond J Yunis

Page 246: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 292 -

Examiner 3: Dr Daniel J Birmingham Thesis Report Thesis Title: Duplication and Polymorphism with Particular Reference to Regulators of Complement Activation Ph.D. Candidate: Craig A. McLure Reference number: 0179071 Examiner: Daniel J. Birmingham, Ph.D. Evaluation: The thesis entitled "Duplication and Polymorphism with Particular Reference to Regulators of Complement Activation" details the research performed by Craig A. McLure in fulfillment of the degree of Doctor of Philosophy. The following is an evaluation of this thesis. The overall goal of this thesis is to identify, annotate, and detect genomic duplication and polymorphism within large genomic regions. For this purpose, Mr. McLure selected the genomic region known as the regulators of complement activation (RCA) gene cluster. The RCA gene cluster covers over 2 Mb of sequence and encompasses over 60 genes or gene-like regions, 15 of which are closely related binding proteins for fragments of the third and fourth components of complement (C3 and C4). Mr. McLure also introduces brief preliminary analyses of canine and cattle DNA. The organization of this thesis is by chapter (six total), with each chapter addressing a separate aim of the thesis. An Introduction and a Conclusion/Future Directions section are also included. This report will comment on each section before providing a summary evaluation. Introduction In general, this is a well-written introduction to the thesis in which Mr. McLure provides a competent overview of the terms and approaches that he utilizes in the remaining sections. This includes explanations of polymorphic frozen blocks (PFB), ancestral/extended haplotypes (AH), and the genomic matching technique (GMT). This section also introduces the RCA cluster, and provides a good overview of the general structure of the RCA proteins, of the short consensus repeats (SCRs), and of the evolution and divergence of RCA proteins. Of note is the use of the term CCP, which is defined in the thesis (subheading 4.1) as a "complement control protein", ie. an RCA protein. However, most if not all other investigators use the term CCP as a synonym for SCR (eg. Biochemistry, 30:2847, 1991; Immunological Reviews, 180:112, 2001). Thus, using the term CCP to refer to an RCA protein can be confusing when considering other published work. It is suggested that this difference be noted to minimize confusion. One area possibly lacking sufficient attention is the presentation of the functions of the RCA proteins. The RCA proteins are characterized in this section as complement regulators that function either through acceleration of convertase decay or through cofactor function, and some important differences in ligand specificity among the RCA proteins are listed (Table, p 38). However, there is no consideration of the apparent redundancy in function among these proteins, or of their unique physiological roles. Other important non-complement regulatory functions, such as immunomodulation

Page 247: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 293 -

(CR2, likely DAF) and immune complex clearance (CR1), are not mentioned. This reviewer realizes that RCA function is not the subject of the thesis, and emphasizing function in the Introduction might detract from the focus of the thesis. Nevertheless, when comparing RCA genes across species (eg. mouse Crry vs. human CR1), it is helpful to have an understanding of the physiological roles played by these various RCA proteins. Minor comments include: 1. The tables in Introduction should be numbered. 2. In stating the complement regulatory role of RCA proteins (p 36), point (i)

should read "acting as a cofactor for the factor I-mediated cleavage of C3b..." 3. Table on page 38:

a) For DAF function - it's unclear what convertase "activation" is. If intended "inactivation", how is this different from DAA?

b) For CR2 function - CR2 does not bind C4d (g?). Chapter 1 This chapter is a reprint of the paper published by Mr. McLure and his colleagues in 2004 in J Mol Evol (impact factor 3.1). The focus of this chapter is an analysis of RCA genomic and amino acid sequences of different species. Each SCR was analyzed and classified into 11 different SCR subfamilies (a to k). Thus, for each analyzed RCA, SCRs could be classified as one of these 11 types. This appears to provide information about the evolutionary route taken by each RCA gene, the relevant orthologs across species (eg. Crry and CR1), and the SCRs important for RCA protein function (eg. aje). One note of interest should be mentioned. While most SCRs are encoded by single exons, a number of them are encoded by "split exons" (two exons/SCR) or by "fused exons" (one exon/2 SCRs). Though not discussed by Mr. McLure, assessing these types of SCRs across genes and species would seem to be particularly informative. For instance, are the "e/f' SCRs, which are fused exons in primate CR1 (SCRs 3/4, 10/11, etc), also fused in Crry? In RCA genes containing only the "e" (DAF) or "f' (CR2) SCR, is there evidence of remnants of the other SCR? Chapter 2 Chapter 2 is a reprint describing the research, published by Mr. McLure and colleagues (R.L. Dawkins as the corresponding author) in 2004 in Immunogenetics (impact factor 2.7). The work presented by Mr. McLure in this chapter describes the discovery of genomic sequence for another previously unrecognized SCR in primate CR1 and CR1L. This SCR is closest in homology to the "g" family of SCRs that are expressed in CR2 and MCP. It is not expressed in CR1/CR1L, and has thus been designated as "g-like". The existence of the "g-like" SCR suggests that the ancestral repeating series of SCRs in CR1 (the long homologous repeat) was 8 SCRs rather than 7 SCRs. Two cormments/questions become evident to this reviewer. First, the position of the "g-like" SCR is quite proximal to SCR "d" (< 30 nucleotides away), suggesting that at one time these represented fused exons. The "g-like" SCR also demonstrates homology to the "f' SCR, which is encoded by a fused exon in CR1 (eg. SCR 4). Is it known if the dig exons are fused in MCP or CR2? Second, it is concluded that "d+g-like" SCRs likely represent a functional set, based on its widespread existence. This reasoning does not appear convincing. Can this be supported by postulating what functions would have

Page 248: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 294 -

been exhibited by the "d+g-like" SCRs, based for instance on similarities to other SCRs? Chapter 3 This chapter represents the most recent of the three publications that have resulted from this thesis (R.L. Dawkins corresponding author). It was published earlier this year (2005) in Human Immunology (impact factor 2.6). Chapter 3 describes the genomic analysis of CR1 and CR1L, and some minimal analysis of the proximal MCP regions, aimed at understanding the evolution of this region. This analysis has revealed that a critical set of SCRs (the "ajef' combination) is maintained within CR1/CR1L/Crry across species. In addition, this analysis has led to a plausible paradigm explaining the evolution of both CR1 and CR1L from a primordial unit (CR1a). This section appears to be a logical extension of the studies described in Chapters 1 and 2, and as such is a relevant and important contribution to the thesis. One question that remains unaddressed is what were the evolutionary forces and biological advantages that drove the expansion of CR1 and the divergence of CR1L. Although a definitive answer is likely not achievable, especially with the current lack of knowledge of human CR1L function, this portion of the thesis would be strengthened through educated theories. Similarly, it would be interesting to postulate why CR1/CR1L/Crry have undergone substantial indel/duplication across species, while MCP remains highly conserved. Chapter 4 The aim of the work presented in this chapter was to use the GMT to identify haplotypes that define CR1/CR1L polymorphisms. Using this technique, numerous haplotypes (ancestral) were identified based on indel differences. These haplotypes were verified as transmissible, and were determined in a population (n = 322) composed of normal controls and patients with recurrent spontaneous abortion, hemochromatosis, psoriasis vulgaris, systemic lupus erythematosus, and Sjogren's syndrome. Certain haplotypes appeared to be disproportionately represented in some of the disease states, suggesting that these regions are linked to disease susceptibility. Perhaps more important, these studies show the potential power of the GMT in determine the role of any given polymorphic region in disease susceptibility. The research described in Chapter 4, which has been submitted for publication, is based in part on the knowledge that this RCA region behaves as a polymorphic frozen block, and builds on the understanding of the specific repetitive nature of this region. As such, this work utilizes the findings of Chapters 1-3 to devise a practical approach to understanding the role of RCA genetic diversity in disease. This culmination of Mr. McLure's research efforts represents a form of translational research, and is one of the stronger aspects of the thesis. This research also represents perhaps the most complex portion of the thesis. Some sections appear to require more detailed description or additional information to clarify some of the results and fully appreciate the applicability of the GMT. For instance: 1. In presenting the data of Fig 2 (manuscript p. 3), it wasn't clear that the two sets of potential products resulting from the GMT amplification also discriminated between CR1 (Fig 2; peaks 1-8) and CR1L (peaks 9-19). Although this is stated later (p. 5), it would facilitate the reader's understanding to point this out when first discussing these peaks.

Page 249: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 295 -

2. It wasn't clear why some of the CR1L haplotypes, resulting from an observed single band, were determined to be null values as opposed to homozygotes (as is mentioned for some of the CR1 haplotypes). If this is an issue of relative band intensity, this should be directly stated, and the basis for determining this should be described (eg. how were intensities standardized). 3. The frequency ranges shown in Table 1 need explained (eg. 156-165 for the 01 AH). Do these ranges reflect that some samples were undeterminable? 4. Though Fig 3 suggests that GMT is highly reproducible under different amplification conditions, the limitations or disadvantages of the GMT were not discussed. For example, what is the maximum amplicon size for which robust amplification can be expected, and how could this affect the interpretations of GMT profiles. 5. It is mentioned in the Fig 2 legend that some of the amplicons are race-specific, as would be expected. It would be informative to show these differences, or indicate the race-specific haplotypes, in the 322 subjects tested. 6. Was a statistical analysis performed on data shown in Table 2 and Fig 5 (incorrectly referred to as "Fig 6" on p. 6)? Is race accounted for as a potential confounder in the apparent differences? 7. It is mentioned on p. 5 that "defective or deficient CR1 might be expected to be associated with poor control of C4 activation and therefore disease". This requires further explanation, as CR1 has not been shown to control C4 activation, but rather controls activation of C3, C5, and beyond through DAA and cofactor functions. Chapter 5 and 6 Both of these chapters explore the commercial use of GMT in other species. As such, they can be viewed as pilot studies for future research, though manuscripts for both are cited as "in preparation". Chapter 5 deals with the use of GMT in an Australian canine population (Blue Heelers). Specifically, using the dog leukocyte antigen region as the target for the GMT primers, specific heritable GMT haplotypes were found, they identified DQ alleles, and they appeared more diverse than the DQ haplotypes. This work represents, in the words of Mr. McLure, a proof of principle that the GMT approach is feasible across species for measuring genetic variability in targeted regions. Chapter 6 is basically a proposal to use GMT to genetically characterize cattle haplotypes associated with polling or horning. The preliminary data include the identification of 920 cattle, covering 4 or more generations, all with various phenotypes. DNA has been isolated from many of these (an N of 250 is stated in the Prelude). Current efforts are focusing on identifying the appropriate genomic region in which to devise the GMT primers. Summary Evaluation The thesis entitled "Duplication and Polymorphism with Particular Reference to Regulators of Complement Activation" is a well organized, logical, and extensive analysis of the RCA gene cluster as a model of genomic divergence and duplication, and as a proving ground for the use of GMT. The first three chapters of this thesis have resulted in three publications in reputable journals, and as such have undergone the rigors of peer-review, which is the gold standard for judging the scientific quality of research. The fourth chapter builds on these accomplishments to fully characterize the genetic diversity of the CR1/CR1L

Page 250: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 296 -

region within the RCA gene cluster. The results reveal the practical aspects of this research, and implicate associations between genetic variability of this RCA region and certain diseases. This aspect of the thesis will likely result in a fourth publication. This reviewer has presented suggestions and questions for each of the sections. For the three sections that are composed of reprints (Chapters 1-3), it is not necessary to address the suggestions/questions, as these studies have already undergone peer-review. Alternatively, they can be addressed in the Prelude of these chapters at the discretion of Mr. McLure's advisor. For the other sections, it is anticipated that these suggestions/questions will receive some attention, though again at the discretion of the advisor. In conclusion, the thesis entitled "Duplication and Polymorphism with Particular Reference to Regulators of Complement Activation" exceeds typical expectations for a doctoral dissertation, and represents an important and original contribution in this area of science. As such, this reviewer judges the thesis as passed, subject to minor revision. Daniel J Birmingham

Page 251: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 297 -

RESPONSE TO EXAMINERS REPORTS

Page 252: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 298 -

Page 253: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 299 -

Response to examiners reports Response to questions by Reviewer 1 (PJ Lachmann) Question 1. “In Table 5 (Introduction table 1) there are some errors of fact. It is stated that C4bp competes with Factor B for binding to C4b. Factor B does not bind to C4b - it is C2 that binds to C4b in the classical pathway.”

I have changed “Factor B” to “C2” to correct this Question 2. “Again, he says that it binds C4dg. I don't believe that C4dg is a described fragment, and it is C3dg that is intended on Figure 5.”

This has been corrected to “C3dg” Question 3. “On the same table, he does not appreciate that CR1 is also a co-factor, and indeed the sole co-factor, for the Factor I mediated proteolysis of iC3b to C3dg, and that this is a very important function of CR1.”

The table reads “cofactor for the factor I mediated proteolysis of C3b/C4b”. This statement has been amended and now reads “cofactor for the factor I mediated proteolysis of iC3b/C4b”

Question 4. “It would also be worth pointing out that the function of CR1 is rather different in primates where it is present on erythrocytes and plays a major role in carrying immune complexes around the body, than in other mammals where CR1 is not found on red cells. It may be found on platelets as in rabbits, or it may be found on neither platelets nor red cells, as is the case in mice. Although mice do have CR1 and DAF and MCP, the major CCP is the molecule Crry, which is not seen in primates.”

From the work presented in chapters 1 and 3, we show evidence that Crry (which is expressed on the surface of rodent RBCs) is in fact the ortholog of Human CR1/CR1L. The accumulation of SCRs and LHRs has occurred as a process of evolution. Ideally the nomenclature in different mammals should be based on evolutionary analysis.

Page 254: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 300 -

Question 5. “The section on the RCA disease associations is really rather inadequate. The evidence that genetic polymorphism in CR1 is associated with lupus is weak. CR1 number polymorphism (detected by the associated DNA pattern) does not appear to be associated with disease nor does its size polymorphism. The substantial decrease in CR1 number on the red cells of patients with immune complex diseases is certainly acquired, and this is no longer a controversy. The decrease is seen not only in diseases like lupus but, very markedly, in cold autoantibody haemolytic anaemia where there is extensive complement fixation on the surface of red cells.”

We believe the association of CR1 with Lupus will be revealed through the genomic polymorphisms that we define in chapter 4, rather than the RFLP and SNP polymorphisms. If correct, the controversy may continue.

Question 6. “There are more interesting associations which are not discussed. These largely concern Factor H where deficiency or mutations (often of the C-terminal part of the molecule) are associated with familial haemolytic uraemic syndrome (HUS). Similar associations occur with mutations in MCP and in factor I itself, suggesting that failure of control of the alternative complement pathway does play a real part in the genesis of familial HUS. Much more recently, (and probably too late for comment in this thesis), are the observations that there is a strong associations between a common polymorphism in Factor H and macular degeneration.”

The initial focus was on CR1 and MCP but we have recently extended the GMT to define haplotypes within the RCA PFB containing Factor H and its duplicates. Using the data from this analysis, we shall complete studies of the HF related diseases/disorders. I have updated the disease table to include HUS and Age-related Macular Degeneration (AMD).

Question 7. “Finally, in this section Mr McLure reviews the role of CCPs in microbial virulence and cites a long list of observations. Viruses and bacteria use CCPs quite differently. Bacteria fix soluble mammalian CCPs (Factor H or C4 binding protein) from the plasma and bind them to "receptors" on their surfaces thereby achieving protection from complement attack. Viruses do not do this.”

It is not stated that viruses do bind soluble CCPs. Rather I report that FH may bind to the envelope proteins of HIV-1. This possibility is supported by a number of references.

Page 255: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 301 -

Question 8. “These findings are preliminary and difficult to evaluate partly because of the lack of detail about how the groups are defined and how well they are matched to the controls and partly because of a lack of statistical evaluation.”

The samples were provided by Professor Agrawal and have been described in a previous publication by the Agrawal group. It has been subsequently noted that the relevant reference in the manuscript was missing and this has since been rectified. Statistical analyses of the data in table 2 have been ongoing. The latest results are included as an amendment in the prelude to chapter 4. The point to emphasise is that I have discovered new haplotypes and need to test many disease groups as they come to hand. I acknowledge the valuable contributions of Ms Lester and Dr Agrawal.

Question 9. “It is likely that these are hypothesis-generating findings rather than ones that allow the hypothesis to be tested.”

A number of pre-defined hypotheses were made prior to commencement of this study. Obvious targets were Recurrent Spontaneos Abortion, Lupus and Sjogren’s disease and their CCP interactions. Due to the inherent difficulties in defining control groups for autoimmune conditions, multiple disease groups were tested, including those where an association was not expected. The initial strategy was to compare and contrast multiple disease groups.

Response to questions by Reviewer 2 (EJ Yunis) Reviewer 2 had no questions or amendments Response to questions by Reviewer 3 (DJ Birmingham) Question 1 “Of note is the use of the term CCP, which is defined in the thesis (subheading 4.1) as a "complement control protein", ie. an RCA protein. However, most if not all other investigators use the term CCP as a synonym for SCR (eg. Biochemistry, 30:2847, 1991; Immunological Reviews, 180:112, 2001). Thus, using the term CCP to refer to an RCA protein can be confusing when considering other published work. It is suggested that this difference be noted to minimize confusion.”

A brief note has been included directly following section 4.1 to address this suggestion

Question 2. “The tables in Introduction should be numbered.”

Table numbers have been included

Page 256: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 302 -

Question 3. “In stating the complement regulatory role of RCA proteins (p 36), point (i) should read "acting as a cofactor for the factor I-mediated cleavage of C3b..."

This has been corrected

Question 4. “For DAF function - it's unclear what convertase "activation" is. If intended "inactivation", how is this different from DAA?”

The word “activation” has been replaced with “regulation”. Question 5. “For CR2 function - CR2 does not bind C4d (g?).”

See reviewer 1, question 2. Question 6. “One note of interest should be mentioned. While most SCRs are encoded by single exons, a number of them are encoded by "split exons" (two exons/SCR) or by "fused exons" (one exon/2 SCRs). Though not discussed by Mr. McLure, assessing these types of SCRs across genes and species would seem to be particularly informative. For instance, are the "e/f' SCRs, which are fused exons in primate CR1 (SCRs 3/4, 10/11, etc), also fused in Crry?”

The observation of and comparison between the “split” and “fused” exons was a critical factor in defining the subfamilies. Therefore the observation of “j/k” and “ef/dg” has been observed throughout each species analysed.

Question 7. “In RCA genes containing only the "e" (DAF) or " f ' (CR2) SCR, is there evidence of remnants of the other SCR?”

Since publication of the manuscript presented in chapter 1, our knowledge of this region has increased significantly. The observation of common ancestors between the SCR subfamilies was mentioned in chapter 1. This stated that the a,j,e,f,b,k,d,g subfamilies had likely arisen through the duplication of a/b, j/k, e/d and f/g ancestors. This point is discussed further in the subsequent chapters. In relation to CR2, the patterns (ajef/bkdg) fall out by classifying each SCR as the ancestral subfamily listed above (a/b. j/k, e/d, f/g)

Question 8. “First, the position of the "g-like" SCR is quite proximal to SCR "d" (< 30 nucleotides away), suggesting that at one time these represented fused exons. The "g-like" SCR also demonstrates homology to the " f ' SCR, which is encoded by a fused exon in CR1 (eg. SCR 4). Is it known if the d/g exons are fused in MCP or CR2?”

The d/g exons, like the e/f exons (see above) do indeed form fused exons

Page 257: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 303 -

Question 9. “Second, it is concluded that "d+g-like" SCRs likely represent a functional set, based on its widespread existence. This reasoning does not appear convincing. Can this be supported by postulating what functions would have been exhibited by the "d+g-like" SCRs, based for instance on similarities to other SCRs?”

It is suggested that the “d/g” fused exon is likely to represent a functional set for two reasons: (i) it is conserved throughout multiple genes and species; and (ii) the evolutionary relationship to the “e/f” SCRs, which have been proven to be important in ligand recognition and binding.

Question 10. “It wasn't clear why some of the CR1L haplotypes, resulting from an observed single band, were determined to be null values as opposed to homozygotes (as is mentioned for some of the CR1 haplotypes). If this is an issue of relative band intensity, this should be directly stated, and the basis for determining this should be described (eg. how were intensities standardized).”

The haplotypes for those individuals with a single CR1L product were classified by either family analysis and segregation or in the situations where we could not make a definitive call from the intensity of the amplified products, both possible haplotypes were included in the analysis. This is why ranges are presented..

Question 11. “The frequency ranges shown in Table 1 need explained (eg. 156-165 for the 01 AH). Do these ranges reflect that some samples were undeterminable?”

See above. Question 12 & 13. “It is mentioned in the Fig 2 legend that some of the amplicons are race-specific, as would be expected. It would be informative to show these differences, or indicate the race-specific haplotypes, in the 322 subjects tested. Was a statistical analysis performed on data shown in Table 2 and Fig 5 (incorrectly referred to as "Fig 6" on p. 6)? Is race accounted for as a potential confounder in the apparent differences?”

See Reviewer 1, Question 8

I would also like to bring to the attention of the reader, that since submission of the thesis, the manuscript presented in chapter 5, which describes the Genomic Matching Technique within the canine MHC has been accepted for publication in the International Journal of Immunogenetics, and its status changed from “submitted” to “In Press”. In addition to this, the manuscript presented in chapter 4 has been reviewed, revised and resubmitted to Immunogenetics and it is expected that this will be accepted shortly. The status of this manuscript may therefore change from “submitted” to “In Press”.

Page 258: DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE … · DUPLICATION AND POLYMORPHISM WITH PARTICULAR REFERENCE TO REGULATORS OF COMPLEMENT ACTIVATION By Craig Anthony McLure

- 304 -