29
Lecture 5: Recombination, IBD distributions and linkage Magnus Dehli Vigeland Statistical methods in genetic relatedness and pedigree analysis NORBIS course, 6 th – 10 th of January 2020, Oslo

Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Lecture 5:Recombination, IBD distributions

and linkage

Magnus Dehli Vigeland

Statistical methods in genetic relatedness and pedigree analysis

NORBIS course, 6th – 10th of January 2020, Oslo

Page 2: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Outline

• Review from yesterday:

– Meiotic recombination

– IBD segments

• Consequences of the discrete nature of recombination

• Measuring distance in the genome1) Physical distance

2) Genetic map distance ( = crossover rate)

3) Recombination rate

• Map functions: Translating between 2) and 3)

• Using linked markers for relatedness

Statistical methods in genetic relatedness and pedigree analysis

Page 3: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Recombination and IBD

Statistical methods in genetic relatedness and pedigree analysis

Page 4: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

IBD segments

Statistical methods in genetic relatedness and pedigree analysis

1

0

1

2

1

IBD status

Page 5: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Consequences of the discrete nature of recombination

• Distant relatives: Possible to have no IBD sharing!

• 100 % inbred?

• Indistinguishable relationships - or not?

– Half siblings

– Grandparent/grandchild

– Aunt/nephew

Statistical methods in genetic relatedness and pedigree analysis

Page 6: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance
Page 7: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

11

10

9

5

4

3

2

1

13

15

10

8

4

3

2

1

13

14

9

(Picture: Mail Online)

13th cousins once removed

P(any IBD) ≈ 0

Page 8: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Is 100 % inbreeding possible?

• Full sibs mating scheme

• Easy to show:

– inbreeding coefficient f → 1

• But never f = 1! (in a finite pedigree)

Statistical methods in genetic relatedness and pedigree analysis

Realised inbreedingAfter ~30 generations,

usually freal = 1.

f1 = 0

f2 = 0.25

f3 = 0.375

f4 = 0.5

f5 = 0.59

Page 9: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Indistinguishable relationships?

Statistical methods in genetic relatedness and pedigree analysis

Page 10: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Distributions of IBD segments

Statistical methods in genetic relatedness and pedigree analysis

Clear difference between GP and the other two!

Page 11: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Outline

• Review from yesterday:

– Meiotic recombination

– IBD segments

• Consequences of the discrete nature of recombination

• Measuring distance in the genome1) Physical distance

2) Genetic map distance ( = crossover rate)

3) Recombination rate

• Map functions: Translating between 2) and 3)

• Using linked markers for relatedness

Statistical methods in genetic relatedness and pedigree analysis

Page 12: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

• The physical distance between two loci

= number of base pairs between them

• Units:

– 1 bp (base pair)

– 1 kb = 1000 bp ("1 kilobase")

– 1 Mb = 1 000 000 bp ("1 megabase")

• The physical distance/position is often the ultimate goal, but rarely accessible by experiments

Physical distance

Statistical methods in genetic relatedness and pedigree analysis

4 bp

ATCCGCACCAAAATG.............................................GTATCGGCGTCCATAGGCGTGGTTTTAC.............................................CATAGCCGCAGGT

100 Mb

Page 13: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Map distance

• Chromosomal crossovers:

• The genetic map distance between two loci

= average number of crossovers between them

• Units:

– 1 Morgan (M) = average 1 crossover per meiosis

– 1 centiMorgan (cM) = 0.01 M

• The entire human genome: Ca 30 Morgan

Statistical methods in genetic relatedness and pedigree analysis

Page 14: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Map distance

• Rule of thumb:

1 cM ≈ 1 Mb

• But: crossover rates vary

– across the genome

– males vs. females

Statistical methods in genetic relatedness and pedigree analysis

Page 15: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Females have a much longer genome!

Statistical methods in genetic relatedness and pedigree analysis

Page 16: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Consequences

Statistical methods in genetic relatedness and pedigree analysis

Page 17: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Statistical methods in genetic relatedness and pedigree analysis

Napoleon Bonaparte (1769 - 1821) Jane Austen (1775 - 1817)

10 gen.paternal line

P(any IBD sharing) ≈ 19 %

you

P(IBD sharing) ≈ 33 %

you

10 gen.maternal line

Page 18: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Can we separate these??

Statistical methods in genetic relatedness and pedigree analysis

Page 19: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Yes!

Statistical methods in genetic relatedness and pedigree analysis

Page 20: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

• So far:

– crossover rates: meiotic process

– IBD distributions

– has nothing to do with markers

• Now: Change focus

– marker genotypes

– what can we observe?

Statistical methods in genetic relatedness and pedigree analysis

Page 21: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Map distance

• Crossovers (and thus map distances) are not directly observable with markers

• Reason: multiple crossovers between markers

• Remember: We don't observe the meioses themselves, only their outcome (the gametes)

• What can we observe directly?

– Answer: Recombination rates between markers

Statistical methods in genetic relatedness and pedigree analysis

A

a

B

b

BA

a b

Page 22: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Recombination rate

Statistical methods in genetic relatedness and pedigree analysis

• The recombination rate between two loci

= average number of recombinant gametes

non-recombinant

recombinant

non-recombinant (!)

A b

A B

A B

a Brecombinant

A

a

B

b

A

a b

b

non-recombinant

recombinant

Page 23: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Recombination rate

Statistical methods in genetic relatedness and pedigree analysis

• Loci on different chromosomes: θ = 0.5

• Loci far apart on the same chromosome: θ ≈ 0.5

• Loci right next to each other: θ = 0

• Definition: Two loci are linked θ < 0.5

(In plain language: "On the same chromosome, not too far apart")

Page 24: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Crossover rate vs. recombination rate

Map distance: Crossover rate

d = E[#crossovers]

• Based on a fundamental property of the meiosis

• Very natural measure of distance

• But:

Hard to observe directly

Recombination rate:

θ = E[#recombinant gametes]

• Perhaps not as intuitive

• Relative to markers

• But:

Easy to estimate using genotyping

Statistical methods in genetic relatedness and pedigree analysis

Can we relate these in some way?

Page 25: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Haldane's map function

• Given observations on recombinations, we would like the compute the crossover rate.

• Requires a statistical model of the crossover process

• Haldane's model:

– Crossover events occur completely at random, with fixed rate 1.

– Events are independent (in reality not true: cannot be too close)

– Poisson process!

• From this easy to compute the recombination rate, using that

θ = P(odd number of crossovers)

Statistical methods in genetic relatedness and pedigree analysis

θ = ½(1 - e-2d)Haldane's map function:

Page 26: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Statistical methods in genetic relatedness and pedigree analysis

Page 27: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Likelihood with linked markers

Statistical methods in genetic relatedness and pedigree analysis

Unlinked markers:

L = L1 x L2

This does not hold if the markers are linked!

Ignoring linkage can lead to serious errors

Page 28: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Advantages of linked markers

Statistical methods in genetic relatedness and pedigree analysis

Cannot be distinguishedwith unlinked markers.

But they CAN with linkedmarkers!

Page 29: Lecture 5: Recombination, IBD distributions and linkage€¦ · – Meiotic recombination – IBD segments • Consequences of the discrete nature of recombination • Measuring distance

Summary

• Distributions of IBD segments

• Measures of genomic distance

– physical

– genetic distance (= crossover rate):

centiMorgan

– recombination rate

• Haldanes map function

• Marker linkage in relatedness analysis

– bad (if ignored)

– good (esp. for distinguished some relationships)

Statistical methods in genetic relatedness and pedigree analysis