Beruflich Dokumente
Kultur Dokumente
Dr Lorenzo Moneta
CERN PH-SFT
CH-1211 Geneva 23
sftweb.cern.ch
root.cern.ch
Outline
Probability definition
Probability Density Functions
Some typical distributions
Bayes Theorem
Parameter Estimation
Hypothesis Testing
Lorenzo Moneta
CERN PH-SFT
References
A lot of the material for this introduction to statistical
methods is extracted from a course:
Statistical Methods for Data Analysis
(Luca Lista, INFN Napoli)
Definition Of Probability
Two main different definitions:
Frequentist
Probability is the ratio of the number of occurrences of an event to
the total number of experiments, in the limit of very large number of
repeatable experiments.
Can only be applied to a specific classes of events (repeatable
experiments)
Meaningless to state: probability that the lightest SuSy particles
mass is less tha 1 TeV
Bayesian
Probability measures someones the degree of belief that
something is or will be true: would you bet?
Probability measures someones the degree of belief that
something is or will be true: would you bet?
Probability that Barcelona will win the next Champion League
Lorenzo Moneta
CERN PH-SFT
Classical Probability
Assume all accessible cases are equally probable
Valid on discrete cases only
Problem in continuous cases (definition of metrics)
Lorenzo Moneta
CERN PH-SFT
Binomial Distribution
Distribution of number of successes on N trials
e.g. spinning a coin or a dice N times
Average: <n> = Np
Variance: <n2>-<n>2 = Np(1-p)
Used for efficiency
In ROOT is available as
ROOT::Math::binomial_pdf(n,p,N)
Lorenzo Moneta
CERN PH-SFT
Frequentist Probability
Law of large numbers
Conditional Probability
Probability of A, given B : P(A|B)
probability that an event known to belong to set B is also
member of set A
P(A | B) = P(A B) / P(B)
A is independent of B if
the conditional probability
of A given B is equal to the
probability of A:
P(A | B) = P(A)
Hence, if A is independent on B
P(A | B) = P(A) P(B)
If A is independent on B, B is independent on A
Lorenzo Moneta
CERN PH-SFT
Lorenzo Moneta
CERN PH-SFT
Average =
Variance = 2
Widely used
because of the
central limit theorem
TMath::Gaus(x, , ,true)
ROOT::Math::normal_pdf( x, , )
TF1 f(f,gausn,xmin,xmax);
x = gRandom->Gaus(, );
PDF(x)
Gaussian PDF
=0 =0.3
=0 =1
1.2
=0 =3
=-2 =1
0.8
0.6
0.4
0.2
0
5
5
x
10
2 / ndf = 422.9 / 97
220
Constant
200
Mean
180
190.8 2.3
2 / ndf
300
Constant
4.989 0.022
250
Sigma
160
87.47 / 83
306.4 3.7
Mean
5.011 0.013
Sigma
1.293 0.009
2.031 0.015
140
200
120
150
100
80
100
60
40
n=2
20
0
0
Lorenzo Moneta
CERN PH-SFT
50
10
0
0
n=5
1
10
11
Standard Deviation
Lorenzo Moneta
CERN PH-SFT
12
Cumulative Distribution
Given a PDF f(x) the cumulative is defined as
13
0.35
normal_pdf
0.3
0.25
0.2
0.15
0.1
0.05
05
ROOT::Math::normal_quantile(p,)
ROOT::Math::normal_quantile_c(p,)
x5
normal_cdf
0.6
normal_cdf_c
0.4
0.2
4
x5
0.8
0.9
p1
05
normal_quantile
normal_quantile_c
1
0
1
2
3
0
Lorenzo Moneta
CERN PH-SFT
0.8
ROOT::Math::normal_cdf_c(x,,)
0.1
0.2
0.3
0.4
0.5
0.6
0.7
14
Poisson Distribution
Probability to have n entries in x a subset of X >> x
15
Lorenzo Moneta
CERN PH-SFT
16
Lorenzo Moneta
CERN PH-SFT
ROOT::Math::crystalball_pdf(x,,n,,)
TF1 f(f,crystalballn,xmin,xmax)
17
Landau Distribution
Model the fluctuations in the energy loss of particles in
this layers
Lorenzo Moneta
CERN PH-SFT
ROOT::Math::landau_pdf(x,s,m)
TF1 f(f,landaun,xmin,xmax)
18
Bayes Theorem
Lorenzo Moneta
CERN PH-SFT
19
A concrete example
A person received a diagnosis of a serious illness
The probability to detect positively a ill person is
~100%
The probability to give a positive result on a healthy
person is 0.2%
Lorenzo Moneta
CERN PH-SFT
20
We need to know
P(ill) = probability that a random person is ill << 1
P(healthy) = 1-P(ill)
We have also
P(+) = P(+ | ill) P(ill) + P(+|healhy)P(healty)
~ P(ill) + P(+| healthy)
21
Lorenzo Moneta
CERN PH-SFT
22
Likelihood Function
Likelihood function:
given some observed events: x1, xn
Likelihood function is the PDF of the variables x1, xn
L (x1, xn | 1,n )
likelihood
likelihood function
prior
prior probability
normalisation term
normalisation term
Lorenzo Moneta
CERN PH-SFT
23
Lorenzo Moneta
CERN PH-SFT
24
Bayesian Inference
Posterior summarises all information on the unknown
parameters given the data
Lorenzo Moneta
CERN PH-SFT
25
Lorenzo Moneta
CERN PH-SFT
26
27
Lorenzo Moneta
CERN PH-SFT
28
Lorenzo Moneta
CERN PH-SFT
29
Parameter Estimation
Parameter estimate
Likelihood function
Maximum Likelihood method
Property of estimators
Dr Lorenzo Moneta
CERN PH-SFT
CH-1211 Geneva 23
sftweb.cern.ch
root.cern.ch
30
Statistical Inference
Lorenzo Moneta
CERN PH-SFT
31
Parameter estimators
Lorenzo Moneta
CERN PH-SFT
32
Likelihood Function
Lorenzo Moneta
CERN PH-SFT
33
Lorenzo Moneta
CERN PH-SFT
34
Gaussian approximation
Lorenzo Moneta
CERN PH-SFT
35
Estimator properties
Consistency
Bias
Efficiency
Robustness
Lorenzo Moneta
CERN PH-SFT
36
Estimator consistency
Lorenzo Moneta
CERN PH-SFT
37
Bias
Lorenzo Moneta
CERN PH-SFT
38
Efficiency
Lorenzo Moneta
CERN PH-SFT
39
Robustness
Lorenzo Moneta
CERN PH-SFT
40
Lorenzo Moneta
CERN PH-SFT
41
Error Determination
Lorenzo Moneta
CERN PH-SFT
42
Hypothesis Testing
Definition of hypothesis testing
Neyman-Pearson lemma and
Likelihood ratio
Dr Lorenzo Moneta
CERN PH-SFT
CH-1211 Geneva 23
sftweb.cern.ch
root.cern.ch
43
Hypothesis Tests
Lorenzo Moneta
CERN PH-SFT
44
Hypothesis Test
H0 : null hypothesis
the hypothesis we want to prove that is false
e.g. the data contains only background (no Higgs signal)
H1 : alternate hypothesis
e.g. the data contains signal (Higgs) and background
45
Lorenzo Moneta
CERN PH-SFT
46
Likelihood Ratio
Lorenzo Moneta
CERN PH-SFT
47
Lorenzo Moneta
CERN PH-SFT
48
Summary
We will look next lectures on how to
how to use multivariate (machine learning) methods to do
classification and more
estimate the parameter uncertainty (errors) in maximum
likelihood fits
estimate confidence intervals
use hypothesis tests for estimate the discovery
significance of new particles
Lorenzo Moneta
CERN PH-SFT
49