Beruflich Dokumente
Kultur Dokumente
of Contents
Introduction ........................................................................................................................... 2
Objective ................................................................................................................................ 2
Concept .................................................................................................................................. 3
MEAN ..................................................................................................................................... 3
MEDIAN ................................................................................................................................. 3
MODE ..................................................................................................................................... 3
STANDARD DEVIATION ........................................................................................................... 3
COEFFICIENT OF VARIANCE ..................................................................................................... 4
Methodology .......................................................................................................................... 5
INTERPRETATION ................................................................................................................... 6
Table 1 : Comparison Of High School, Intermediate, Graduation Marks .................................. 6
Analysis:- .......................................................................................................................................... 6
High School ........................................................................................................................................... 6
Intermediate ........................................................................................................................................ 6
Graduation ........................................................................................................................................... 6
CONCLUSION .......................................................................................................................... 9
Introduction
We have the following data of ABC Institute. There are 63 students. It tells the marks scored by the
students in high school, intermediate and graduation. It also tells about the subject history of the
students that is whether they were from science background, arts and humanities background,
commerce background or had a professional background. It also tells the gender of the students and the
cat percentile they scored.
Objective
Comparison between high school, intermediate and graduation marks
Comparison of cat percentile on the basis of gender
Comparison of cat percentile on the basis of subject history
Calculation of coefficient of variance to know the consistancy
2
Concept
MEAN
The mean (or average) is the most popular and well known measure of central tendency. It can be
used with both discrete and continuous data, although its use is most often with continuous data.
The mean is equal to the sum of all the values in the data set divided by the number of values in the
data set. So, if we have n values in a data set and they have values x1, x2, ..., xn , the sample mean,
usually denoted by
MEDIAN
The median is the middle score for a set of data that has been arranged in order of magnitude. The
median is less affected by outliers and skewed data.
MODE
The mode is the most frequent score in our data set. On a histogram it represents the highest bar in
a bar chart or histogram. You can, therefore, sometimes consider the mode as being the most
popular option.
STANDARD DEVIATION
Standard deviation (SD, also represented by the Greek letter sigma or the Latin letter s) is a
measure that is used to quantify the amount of variation or dispersion of a set of data values. A
low standard deviation indicates that the data points tend to be close to the mean (also called the
expected value) of the set, while a high standard deviation indicates that the data points are
spread out over a wider range of values.
& ' #
!= !# = % = &'# #
N
COEFFICIENT OF VARIANCE
The coefficient of variation (CV) is the ratio of the standard deviation to the mean (average). For
example, the expression The standard deviation is 15% of the mean is a CV. The CV is
particularly useful when you want to compare results from two different surveys or tests that
have different measures or values.
FORMULA
4
Methodology
To obtain the first objective that is comparison of marks of high school, intermediate and
graduation:
Select data analysis descriptive statistic select input range (marks of all three years)
select labels in first row and summary statistics select ok
By following the above procedure we get the statistical summary for marks of all three years.
To obtain the second objective that is cat percentile on the basis of gender first the data was
sorted on gender basis then following procedure was followed:
Select data analysis descriptive statistic select input range (male percentile) select labels
in first row and summary statistics select ok
To get statistical summary for females instead of selecting male marks female marks were
selected.
Select data analysis descriptive statistic select input range (female percentile) select
labels in first row and summary statistics select ok
To obtain third objective that is cat percentile on the basis of subject history first the data
was sorted on the basis of subject history then following procedure was followed:
Select data analysis descriptive statistic select input range (cat percentile of students with
professional background) select labels in first row and summary statistics select ok
Same thing was followed for science, commerce and art and humanities students and a
comparative data was gained.
Coefficient of variation was calculated on the basis of high school, intermediate and graduation
marks.
On the basis of subject history and gender, coefficient of variance was also calculated
5
INTERPRETATION
Table 1 : Comparison Of High School, Intermediate, Graduation Marks
Analysis:-
High School
High school marks mean is 69.9, while median is 69.5. So we can say that there is no outlier
present in the data.
Skewness of High School students is 0.17. Hence we can say that the data is equally
distributed.
Coefficient of variance is 13.1 (approx) for High School.
Intermediate
Intermediate marks mean is 71.2, while median is 72. So we can say that there is no outlier
present in the data.
Skewness of Intermediate is -0.04. Hence we can say that the data is negatively skewed i.e.,
it is inclined towards left.
Coefficient of variance is 12.4 (approx) for Intermediate.
Graduation
Graduation marks mean is 64.2, while median is 63. So we can say that there is no outlier
present in the data.
Skewness of Graduation marks is 0.49. Hence we can say that data is equally distributed.
Coefficient of variance is 8.3 (approx) for Graduation.
According to coefficient of variance Graduation data is most consistent while data of High
School is most inconsistent.
6
Table 2: Comparison Of Cat Percentile On The Basis Of Subject History
Arts And
Professional Science Commerce Humanities
Analysis:-
Professional
Professional data mean is 71.5, while median is 72. So we can say that there are outliers
present in the data
Skewness of the data is 0.44. Hence we can say that the data is equally distributed.
Coefficient of variance is 15.1 (approx) for professional background students.
Science
Science data mean is 69.1, while median is 68.5. So we can say that there are no outliers
present in the data.
Skewness of the data is 0.22. Hence we can say that the data is equally distributed.
Coefficient of variance is 19.3 (approx) for science background students.
Commerce
Commerce data mean is 71.5, while median is 70. So we can say that there are no outliers
present in the data.
Skewness of the data is 0.63. Hence we can say that the data is equally distributed.
Coefficient of variance is 13.5 (approx) for commerce background students.
male fermale
Analysis:-
Male
Male data mean is 72.4, while median is 72. So we can say that there are no outliers present
in the data.
Skewness of the data is -0.05. Hence we can say that the data is negatively skewed i.e., it is
inclined towards left.
Coefficient of variance is 15.8 (approx) for males.
Female
Female data mean is 69.7, while median is 68. so we can say that there are no outliers
present in the data.
Skewness of the data is 0.55. Hence we can say that the data is equally distributed.
Coefficient of variance is 15.2 (approx) for females.
According to coefficient of variance we can tell that female data is more consistent than
male data.
8
CONCLUSION
After the analysis it has been derived that the performance of females is more consistent than males. It
has also been noticed that the marks of students from Arts and Humanities are more consistent in
compared to science commerce or professional courses. The marks of graduation are more consistent
than compared to other streams. Coefficient of variance or C.V has been used to compare the percentile
of all the students on different basis.