Beruflich Dokumente
Kultur Dokumente
I.TOPIC
III.LESSON PROPER
IV.SUMMARY
V.REFLECTION
VI.REFERENCE
I. TOPIC:
the sum of all measurements divided by the number of observations in the data set.
Median
the middle value that separates the higher half from the lower half of the data set. The
median and the mode are the only measures of central tendency that can be used
for ordinal data, in which values are ranked relative to each other but are not measured
absolutely.
Mode
the most frequent value in the data set. This is the only central tendency measure that
can be used with nominal data, which have purely qualitative category assignments.
Geometric mean
The nth root of the product of the data values, where there are n of these. This
measure is valid only for data that are measured absolutely on a strictly positive scale.
Harmonic mean
the reciprocal of the arithmetic mean of the reciprocals of the data values. This measure
too is valid only for data that are measured absolutely on a strictly positive scale.
Weighted arithmetic mean
an arithmetic mean that incorporates weighting to certain data elements.
Midrange
the arithmetic mean of the maximum and minimum values of a data set.
Midhinge
the arithmetic mean of the first and third quartiles.
Trimean
the weighted arithmetic mean of the median and two quartiles.
Winsorized mean
an arithmetic mean in which extreme values are replaced by values closer to the
median.
Any of the above may be applied to each dimension
of multi-dimensional data, but the results may not
be invariant to rotations of the multi-dimensional
space. In addition, there are the
Geometric median
which minimizes the sum of distances to the data points. This is the same as the median
when applied to one-dimensional data, but it is not the same as taking the median of
dimensions.
useful in engineering, but not often used in statistics. This is because it is not a good
indicator of the center of the distribution when the distribution includes negative values.
Simplicial depth
the probability that a randomly chosen simplex with vertices from the given distribution
points
the sense of the calculus of variations, namely minimizing variation from the center. That is,
given a measure of statistical dispersion, one asks for a measure of central tendency that
minimizes variation: such that variation from the center is minimal among all choices of center.
In a quip, "dispersion precedes location". This center may or may not be unique. In the sense
of spaces, the correspondence is:
The associated functions are called p-norms: respectively 0-"norm", 1-norm, 2-norm, and ∞
-norm. The function corresponding to the L0 space is not a norm, and is thus often referred to
in quotes: 0-"norm".
In equations, for a given (finite) data set X, thought of as a vector , the dispersion about a
point c is the "distance" from x to the constant vector in the p-norm (normalized by the
number of points n):
For the largest number dominates, and thus the ∞-norm is the maximum difference.
Uniquenes
The mean (L2 center) and midrange (L∞ center) are unique (when they exist), while the
median (L1 center) and mode (L0 center) are not in general unique. This can be understood
The 2-norm and ∞-norm are strictly convex, and thus (by convex optimization) the
minimizer is unique (if it exists), and exists for bounded distributions. Thus standard
deviation about the mean is lower than standard deviation about any other point, and the
maximum deviation about the midrange is lower than the maximum deviation about any
other point.
The 1-norm is not strictly convex, whereas strict convexity is needed to ensure uniqueness
of the minimizer. Correspondingly, the median (in this sense of minimizing) is not in general
unique, and in fact any point between the two central points of a discrete distribution
Information geometry
The notion of a "center" as minimizing variation can be generalized in information
geometry as a distribution that minimizes divergence (a generalized distance) from a data
set. The most common case is maximum likelihood estimation, where the maximum
likelihood estimate (MLE) maximizes likelihood (minimizes expected surprisal), which can be
interpreted geometrically by using entropy to measure variation: the MLE minimizes cross
A simple example of this is for the center of nominal data: instead of using the mode (the
only single-valued "center"), one often uses the empirical measure (the frequency
distribution divided by the sample size) as a "center". For example, given binary data, say
heads or tails, if a data set consists of 2 heads and 1 tails, then the mode is "heads", but
the empirical measure is 2/3 heads, 1/3 tails, which minimizes the cross-entropy (total
surprisal) from the data set. This perspective is also used in regression analysis, where least
squares finds the solution that minimizes the distances from it, and analogously in logistic
center of the data distribution. Along with the variability (dispersion) of a dataset, central
The central tendency is one of the most quintessential concepts in statistics. Although it does
not provide information regarding the individual values in the dataset, it delivers a
Mean (Average): Represents the sum of all values in a dataset divided by the total
number of the values.
Median: The middle value in a dataset that is arranged in ascending order (from the
smallest value to the largest value). If a dataset contains an even number of values, the
median of the dataset is the mean of the two middle values.
Mode: Defines the most frequently occurring value in a dataset. In some cases, a
dataset may contain multiple modes while some datasets may not have any mode at all.
Even though the measures above are the most commonly used to define central tendency,
there are some other central tendency measures, including, but not limited to, geometric mean,
The selection of central tendency as a measure depends on the properties of a dataset. For
instance, mode is the only central tendency measure of categorical data while a median works
Although mean is regarded as the best measure of central tendency for quantitative data, it is
not always the case. For example, mean may not work well with quantitative datasets that
contain extremely large or extremely small values. The extreme values may distort the mean.
The measures of central tendency can be found using a formula or definition. Also, they can be
identified using a frequency distribution graph. Note that for the datasets that follow a normal
distribution, the mean, median, and mode are located on the same spot on the graph.
IV.SUMMARY
A measure of central tendency is a summary statistic that represents the center point or
typical value of a dataset. ... In statistics, the three most common measures of central
tendency are the mean, median, and mode. Each of these measures calculates the location of
the central point using a different method. Each statistic returns different values. The mean
and median provide relatively similar values therefore it would be important to understand
whether the data set represents a sample or whether the data set represents the population. If
the data is from a sample or could become more skewed by extreme values, the median is
probably a better representation of the central tendency in this case. If the population is evenly
distributed about the mean (or is normally distributed) the mean is a better representation of
central tendency. In the sample data set the mode provides little explanative power.
Understanding which measure of central tendency allows change agents to better target
changes and if your contract uses metrics to determine performance, which measure of central
measure you can have an impact. Changing or arguing over which to use smacks of poor
V.REFLECTION
This is a discovery lesson to deepen the understanding of central tendency (mean, median) by
posing relevant scenarios that students must examine and explore. It is the exploration of the
salary negotiations for the Los Angeles Lakers and the use of a see-saw to physically model
what the algorithm for an average truly finds. This will lead students to understand the pairing
of the measures of central tendency and spread dictated by the shape of the distribution. A
detailed explanation of the answers is provided with the guided discovery questions so that the
teacher will be able to deepen student knowledge by eliciting the nuances of the information
presented. “Why do We need to Measure Central Tendency of Any Data?”... Also we notice
that it is not reflecting typical on average sales and correct measure ... this article interesting
then stay tuned for more posts on interesting topics.
VI.REFERENCE
References Weisberg H.F (1992) Central Tendency and Variability, Sage University Paper Series
on Quantitative Applications in the Social Sciences, ISBN 0-8039-4007-6 p.2 Upton, G.; Cook, I.
(2008) Oxford Dictionary of Statistics, OUP ISBN 978-0-19-954145-4 (entry for "central
tendency") Dodge, Y. (2003) The Oxford Dictionary of Statistical Terms, OUP for International
Statistical Institute. ISBN 0-19-920613-9 (entry for "central tendency") Johnson NL, Rogers CA
(1951) "The moment problem for unimodal distributions". Annals of Mathematical Statistics, 22
(3) 433–439 Hotelling H, Solomons LM (1932) The limits of a measure of skewness. Annals
Math Stat 3, 141–114 Garver (1932) Concerning the limits of a mesuare of skewness. Ann
Math Stats 3(4) 141–142