16
ALL YOU NEED TO KNOW ABOUT STATISTICS In 15 minutes Roberto A. Vitillo

All you need to know about Statistics

Embed Size (px)

Citation preview

Page 1: All you need to know about Statistics

ALL YOU NEED TO KNOW ABOUT STATISTICS

In 15 minutes

Roberto A. Vitillo

Page 2: All you need to know about Statistics
Page 3: All you need to know about Statistics

Setting a 95% confidence interval means that if you took repeated random samples from a population and calculated the statistics and CI for each sample, then the CIs for 95% of your samples would include the true value of the statistics.

Page 4: All you need to know about Statistics

Central Limit Theorem

For means it’s easy: the histogram of averages tends to look normal even when the histogram of the individuals doesn’t!

aka sampling distribution of the mean

Page 5: All you need to know about Statistics

It’s easy to derive a confidence interval once we know how the theoretical sampling distribution looks like.

Page 6: All you need to know about Statistics

~95% confidence interval

Page 7: All you need to know about Statistics

But I don’t care about means…

Page 8: All you need to know about Statistics

What now?call this guy if you live in the

early 20th century

Henry Berthold Mann known for the Mann-Whitney nonparametric test

throw some (virtual) dice on your laptop

Page 9: All you need to know about Statistics

not only compilers can be bootstrapped…

n bootstrap samples, each of size k, are generated by sampling with replacement from the original sample A

Page 10: All you need to know about Statistics

A X X X1 2 3* * *

Page 11: All you need to know about Statistics
Page 12: All you need to know about Statistics

In the next phase, a bootstrap statistic is calculated for all the bootstrap samples

bootstrap distribution

The bootstrap distribution is an approximation of the sampling distribution.

Page 13: All you need to know about Statistics
Page 14: All you need to know about Statistics
Page 15: All you need to know about Statistics

~95% confidence interval

Page 16: All you need to know about Statistics

• Resampling methods are powerful tools

• A similar procedure can be applied for A/B tests

• Checkout montecarlino