Beruflich Dokumente
Kultur Dokumente
I. INTRODUCTION
Big Data is a pool of data which becomes useful only when patterns are drawn and knowledge and information is
extracted. Student’s lifestyle has different aspects which affect their grades. This is an area of utmost interest. It
can be used for the study of evolution in crowd and changes that can be made in course curriculum to make
students more efficient and interactive. Big data is a vast topic of research; I have proposed a way for the analysis
of big data of students of college in secondary education of two Portuguese schools. The data collected various
grades classified as G1 , G2 half yearly grade and G3 Final grade and students personal devotion of time into
various day-to-day activities.G1,G2 are strongly co-related with G3 as the final grade is cumulative outcome of
these two. Other aspects of students are graded and analysed which help in Multiple regression analysis. Some
aspects show a strong relation with the Grades which I have explained in my paper. Regression technique is being
used which helps in verifying the relation of different attributes with each other .It consist of independent and
several dependent variable whose correlation is test for the data analysis .It is the basic technique of data mining
and implemented by various tools like Excel , R , Python. The data sets can be further studied by the help of graphs
specifically Scattered Graphs.
II. REGRESSION ANALYSIS
Regression Analysis is used to find trends in data. It is specifically used in statistics. For example If you want to
check how much was your profit within 3 years on the basis of cost of raw material then you can predict it by
sketching a linear regression curve and determine the value of slope to analyse the rate of cost. Since regression
can be analysed by curve, it is easy to draw conclusion and predict the nature of trend.
__________________________________________________________________________________________________
IRJCS: Impact Factor Value – SJIF: Innospace, Morocco (2016): 4.281
Indexcopernicus: (ICV 2016): 88.80
© 2014- 17, IRJCS- All Rights Reserved Page -21
International Research Journal of Computer Science (IRJCS) ISSN: 2393-9842
Issue 12, Volume 4 (December 2017) www.irjcs.com
health
absences
traveltime
goout
Dalc
In this independent variable is Y=G3 (Final Grade score) and dependent variables X is from 2 – 11. All these
attributes have contributed in the analysis and have various different coefficients of correlation- test define the
probability of luck of an even or it’s true occurrence and t-test is obtained with the help of p- value which helps in
determining the significance. The Regression test has the following output which was conducted on the data set
with use of Excel data analysis technique. The below result is not used for analysis as the p-value greater than 1.5
does not help to predict the outcome i.e. not of significance. Therefore we run the regression again to interpret the
significant answer.
__________________________________________________________________________________________________
IRJCS: Impact Factor Value – SJIF: Innospace, Morocco (2016): 4.281
Indexcopernicus: (ICV 2016): 88.80
© 2014- 17, IRJCS- All Rights Reserved Page -25
International Research Journal of Computer Science (IRJCS) ISSN: 2393-9842
Issue 12, Volume 4 (December 2017) www.irjcs.com
traveltime
4.5
4
3.5
3
2.5
2
1.5
1
0.5 y = -0.0178x + 1.6338
0 R² = 0.0137
0 5 10 15 20 25
freetime
6
1
y = 3E-05x + 3.2287
0 R² = 2E-05
0 100 200 300 400 500
Fig 10 Graph of Free Time
Going out time
goout
6
1 y = -0.0323x + 3.4449
R² = 0.0176
0
0 5 10 15 20 25
__________________________________________________________________________________________________
IRJCS: Impact Factor Value – SJIF: Innospace, Morocco (2016): 4.281
Indexcopernicus: (ICV 2016): 88.80
© 2014- 17, IRJCS- All Rights Reserved Page -26
International Research Journal of Computer Science (IRJCS) ISSN: 2393-9842
Issue 12, Volume 4 (December 2017) www.irjcs.com
VI. CONCLUSION
As we have seen in the analysis of regression test on the Data Set the Final Grade of student decrease by increase
in one unit of travelling time and going out with friends but there is slight increase by 0.23 in grade if the attribute
free time increase as it lets the student do some recreational activities and helps in clearing his /her mind.
Therefore introduction of recess is not only good for health but also for the refreshment of brain and removal of
saturation caused by studying. By this research we can conclude that students should be given free time to make
them more efficient and energetic .In future more attributes of students life could be judged to analyse their scores
as they are the future of every nation and more powerful tools can also be used for analysis like R programming or
most upcoming language Python.
VII. LITERATURE REVIEW
Few research papers were found that discuss the behaviour of students. Abdul RaufBaiga ,HajiraJabeenb wrote a
paper entitled "analytics and expands it from the limited realm of websites and Ecommerce. They argue that
enough data is available in a university environment Big data analytics for behaviour monitoring of students", In
this paper, he explains a new meaning to behavioural ornament that can be harnessed with the help of Big Data
model and accompanying technologies to monitor and predict deviant behavior in students. Sarah Mohamed
Hassanaimed and Muna S. Al-Razgan in their research paper “Pre-University Exams Effect on Students GPA: A case
Study in IT Department", analyze the Many students have a very high score in the high school, but they did not
enter the college they want because of the scores in competition exams. GhadaBadra,b*, Afnan Algobaila, Hanadi
Almutairia, ManalAlmutery in their paper “Predicting Students’ Performance in University Courses: A Case Study
and Tool in KSU Mathematics Department” tells the performance of students in programming courses based on
their performance in English and Mathematics course .An application is designed based on CBA rule generation
algorithm .It shows that English course has Significant effect on programming course. Predicting Critical Courses
Affecting Students Performance: A Case Study by Yasmeen Altujjar, Wejdan Altamimi, Isra Al-Turaiki∗, Muna Al-
Razgan explain the effect of various subjects of IT department in the future selection of courses by the student and
their effectiveness. They have used the technique of Educational Data Mining to draw patterns and predict the data
accurately.
REFERENCES
1. GhadaBadra,b, AfnanAlgobaila, HanadiAlmutairia, ManalAlmutery use eeducational data mining technique for
analysis on” Predicting Students’ Performance in University Courses: A Case Study and Tool in KSU
Mathematics Department”
2. Abdul RaufBaiga,, HajiraJabeen analyse the deviation of students towards terrorism in their paper “Big data
analytics for behavior monitoring of students”
3. Sarah Mohamed Hassana,,Muna S. Al-Razganb on their paper “Pre-University Exams Effect on Students GPA:
A case Study in IT Department” study on gpa and hig school marks and draw conclusions.
4. HatemAbdulKadera, EmadElAbdb, WaleedEadc in their paper “Protecting online social networks profiles by
hiding sensitive data attributes” study about the todays generation interaction with social media and it’s
sensitive data.
5. LinahAburahmaha, Hajar AlRawib, Yamamah Izzc, Liyaka thunisa Syedd study on online social media and
gaming impacts on current generation on their paper” Online Social Gaming and Social Networking Sites”
6. Yasmeen Altujjar, Wejdan Altamimi, Isra Al-Turaiki , Muna Al-Razgan in their paper “Predicting Critical
Courses Affecting Students Performance: A Case Study” study about difficult subjects.
7. Extraction of Data Set from:- P. Cortez and A. Silva. Using Data Mining to Predict Secondary School Student
Performance. In A. Brito and J. Teixeira Eds., Proceedings of 5th Future Business Technology Conference
(FUBUTEC 2008) pp. 5-12, Porto, Portugal, April, 2008, EUROSIS, ISBN 978-9077381-39-7.
__________________________________________________________________________________________________
IRJCS: Impact Factor Value – SJIF: Innospace, Morocco (2016): 4.281
Indexcopernicus: (ICV 2016): 88.80
© 2014- 17, IRJCS- All Rights Reserved Page -27