Introduction To Kernel Smoothing

Introduction to Kernel Smoothing
Density
0.000
0.002
0.004
0.006
700
800
900
1000
1100
1200
1300
Wilcoxon score
M. P. Wand & M. C. Jones Kernel Smoothing Monographs on Statistics and Applied Probability Chapman & Hall, 1995.
Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004
Introduction
Histogram of some pvalues

12 Density 0 2 4 6 8 10
0.0
0.2
0.4 pvalues
0.6
0.8
1.0
Introduction
Estimation of functions such as regression functions or probability density functions. Kernel-based methods are most popular non-parametric estimators. Can uncover structural features in the data which a parametric approach might not reveal.
Univariate kernel density estimator
Given a random sample X1, . . . , Xn with a continuous, univariate density f . The kernel density estimator is (x, h) = 1 f K nh i=1 with kernel K and bandwidth h. probability to the true density.
n
x Xi h Under mild conditions (h
must decrease with increasing n) the kernel estimate converges in
The kernel K
Can be a proper pdf. Usually chosen to be unimodal and symmetric about zero. Center of kernel is placed right over each data point. Inuence of each data point is spread about its neighborhood. Contribution from each point is summed to overall estimate.
Gaussian kernel density estimate

1.5 0.0 0.5 1.0
q q
q q
10
15
The bandwidth h
Scaling factor. Controls how wide the probability mass is spread around a point. Controls the smoothness or roughness of a density estimate. Bandwidth selection bears danger of under- or oversmoothing.
From over- to undersmoothing
KDE with b=0.1

12 Density 0 2 4 6 8 10
0.0
0.2
0.4 pvalues
0.6
0.8
1.0
KDE with b=0.05

12 Density 0 2 4 6 8 10
0.0
0.2
0.4 pvalues
0.6
0.8
1.0
KDE with b=0.02

12 Density 0 2 4 6 8 10
0.0
0.2
0.4 pvalues
0.6
0.8
1.0
10
KDE with b=0.005

12 Density 0 2 4 6 8 10
0.0
0.2
0.4 pvalues
0.6
0.8
1.0
11
Some kernels
Uniform
0.5 0.4 0.6
Epanechnikov
0.8
Biweight
0.3
0.4
0.2
0.2
0.0
0.1
0.0
0.0 2
0.2
0.4
0.6
Gauss
0.4 0.3
Triangular
0.8 0.0 2 0.2 0.4 0.6 1.0
0.0
0.1
0.2
12
Some kernels
(1 x2)p K(x, p) = 2p+1 1{|x|<1} 2 B(p + 1, p + 1) with B(a, b) = (a)(b)/(a + b).
p = 0: Uniform kernel. p = 1: Epanechnikov kernel. p = 2: Biweight kernel.

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004 13
Kernel eciency
Perfomance of kernel is measured by MISE (mean integrated squared error) or AMISE (asymptotic MISE). Epanechnikov kernel minimizes AMISE and is therefore optimal. Kernel eciency is measured in comparison to Epanechnikov kernel.
14
Kernel
Eciency
Epanechnikov 1.000 Biweight Triangular Normal Uniform 0.994 0.986 0.951 0.930
Choice of kernel is not as important as choice of bandwidth.
15
Modied KDEs
Local KDE: Bandwidth depends on x. Variable KDE: Smooth out the inuence of points in sparse regions. Transformation KDE: If f is dicult to estimate (highly skewed, high kurtosis), transform data to gain a pdf that is easier to estimate.
16
Bandwidth selection
Simple versus high-tech selection rules. Objective function: MISE/AMISE. R-function density oers several selection rules.
17
bw.nrd0, bw.nrd
Normal scale rule. Assumes f to be normal and calculates the AMISE-optimal bandwidth in this setting. First guess but oversmoothes if f is multimodal or otherwise not normal.
18
bw.ucv
Unbiased (or least squares) cross-validation. Estimates part of MISE by leave-one-out KDE and minimizes this estimator with respect to h. Problems: Several local minima, high variability.
19
bw.bcv
Biased cross-validation. Estimation is based on optimization of AMISE instead of MISE (as bw.ucv does). Lower variance but reasonable bias.
20
bw.SJ(method=c("ste", "dpi"))
The AMISE optimization involves the estimation of density functionals like integrated squared density derivatives. dpi: Direct plug-in rule. Estimates the needed functionals by KDE. Problem: Choice of pilot bandwidth. ste: Solve-the-equation rule. The pilot bandwidth depends on h.
21
Comparison of bandwidth selectors
Simulation results depend on selected true densities. Selectors with pilot bandwidths perform quite well but rely on asymptotics less accurate for densities with sharp features (e.g. multiple modes). UCV has high variance but does not depend on asymptotics. BCV performs bad in several simulations. Authors recommendation: DPI or STE better than UCV or BCV.
Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004 22
KDE with Epanechnikov kernel and DPI rule

12 Density 0 2 4 6 8 10
0.0
0.2
0.4 pvalues
0.6
0.8
1.0
23

Introduction To Kernel Smoothing

Hochgeladen von

Dokumentinformationen

Originalbeschreibung:

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Introduction To Kernel Smoothing

Hochgeladen von

Copyright:

Verfügbare Formate

Introduction to Kernel Smoothing

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Histogram of some pvalues

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Univariate kernel density estimator

x Xi h Under mild conditions (h

must decrease with increasing n) the kernel estimate converges in

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Gaussian kernel density estimate

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

From over- to undersmoothing

KDE with b=0.1

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

From over- to undersmoothing

KDE with b=0.05

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

From over- to undersmoothing

KDE with b=0.02

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

From over- to undersmoothing

KDE with b=0.005

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

(1 x2)p K(x, p) = 2p+1 1{|x|<1} 2 B(p + 1, p + 1) with B(a, b) = (a)(b)/(a + b).

p = 0: Uniform kernel. p = 1: Epanechnikov kernel. p = 2: Biweight kernel.

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Choice of kernel is not as important as choice of bandwidth.

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Comparison of bandwidth selectors

KDE with Epanechnikov kernel and DPI rule

Stefanie Scheid - Introduction to Kernel Smoothing - January 5, 2004

Das könnte Ihnen auch gefallen