Beruflich Dokumente
Kultur Dokumente
xi /n.
n i=1 (xi
s2 . x
x)/sx )3 .
3(n1)2 . (n2)(n3)
Negative skewness means longer left tail, while positive skewness means longer right tail. Excess kurtosis =
n(n+1) (n1)(n2)(n3) n i=1 ((xi
x)/sx )4
High kurtosis results in exceptional values that are called fat tails or heavy tails. Fat/heavy tails indicate a higher percentage of very low and very high returns than would be expected under a bell shape (normal) distribution. Covariance and correlation: sxy =
n i=1 (xi
The closer rxy is to 1 the stronger the linear relationship is with a positive slope. When one goes up, the other tends to go up. The closer rxy is to -1 the stronger the linear relationship is with a negative slope. When one goes up, the other tends to go down. If p = ax+by+cz, then s2 = a2 s2 +b2 s2 +c2 s2 +2 (absxy + acsxz + bcsyz ) and p = a+b+c. x y z p x y z An example of the above linear combination is portfolio allocation where x, y and z are the rms (assets) and a,b and c are their weights with a + b + c = 1. Boxplot: Quartiles split the whole data into four quarters. Q1=Median of the rst half of the data. Q2=Median of the whole data. Q3=Median of the second half of the data.
Q11.5*(Q3Q1)
Q3+1.5*(Q3Q1)
Q1
Q2
Q3
Basic probability
Let p(x, y) denote the joint distribution of discrete random variables X and Y . Then, the marginal distribution of X can be obtained from the joint as p(x) =
y
p(x, y)
while the conditional distribution of X given that Y = y is p(x|y) = joint p(x, y) = . p(y) marginal
The table below is one example: the probability that Y=happy is equal to 0.46, the probability that X=17.5 (high salary) is equal to 0.24, while the probability that Y=happy AND X=17.5 is equal to 0.14. Mathematically, P(Y=happy)=0.46, P(X=17.5)=0.24 and Pr(Y=happy,X=17.5)=0.14. Happiness (Y) unhappy ok happy P(X) 0.03 0.12 0.07 0.22 0.02 0.13 0.11 0.26 0.01 0.13 0.14 0.28 0.01 0.09 0.14 0.24 0.07 0.47 0.46 1.00
Conditional on X=17.5, i.e. conditional on the subset whose salary is 17.5, we can completely ignore the top three rows (corresponding to salaries of 2.5, 7.5 and 12.5) from the joint distribution. Y unhappy P (Y |X = 17.5) 0.01 ok happy 0.09 0.14
However, the probabilities on the new table do not add up to 1.0 and should be normalized by 0.24, its total. Therefore, the new table with the conditional probability distribution of Y conditional on X = 17.5 is given by Y unhappy ok happy P (Y |X = 17.5) 0.0417 0.3750 0.5833
If X1 , . . . , Xn are independent and identically distributed (i.i.d.) Bernoulli(p), then X = n i=1 Xi Binomial(n, p). Standard normal distribution: Z N (0, 1). The Z-table is the standard normal table. Also, P r(Z (1, 1)) 0.68 and P r(Z (2, 2)) 0.95. Other normal distributions: X N (, 2 ). Here, P r(X ( , + )) 0.68 and P r(Z ( 2, + 2)) 0.95. Cumulative distribution: F (x) = P r(X x); so P r(a < X b) = F (b) F (a).
xi p(xi ) and 2 =
i (xi
i (xi
)2 p(xi )
If X and Y are independent then xy = xy = 0. If xy = xy = 0, then X and Y are not necessarily independent.
Approximate 95% C.I. for the population proportion p: p 2se(), where p is the sample p proportion of successes and se() = p(1 p)/n. p
P-value: For point null hypothesis (as above), the P-value is twice the probability of obtaining a test statistic (or x or p) at least as extreme as the one observed.