Sie sind auf Seite 1von 4

International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056

Volume: 06 Issue: 04 | Apr 2019 www.irjet.net p-ISSN: 2395-0072

STUDENT PLACEMENT PREDICTION USING MACHINE LEARNING


Aksha Prasad1, Shreyas Harinath2, Suma H S3, Suraksha A4, Tojo Mathew 5

1
BE, Department of CSE, NIE Mysore, Karnataka, India
2
BE, Department of CSE, NIE Mysore, Karnataka, India
3
BE, Department of CSE, NIE Mysore, Karnataka, India
4
BE, Department of CSE, NIE Mysore, Karnataka, India
5
Assistant Professor, Department of CSE, NIE Mysore, Karnataka, India

---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract – Placement of Students is one of the very the managerial system. With the machine learning
important activities in educational institutions. Admission techniques the knowledge can be extracted from
and reputation of institutions mainly depends on operational and historical data that resides within
placements. Hence all institutions strive to Strengthen the educational organization’s databases using. The
placement department. The main Objective of this paper is dataset for system implementation contains
to analyze previous year’s student’s historical data and information about past data of students. These data
predict placement possibilities of current students and aids are used for training the model for rule identification
to increase the placement percentage of the institutions.
and for testing the model for classification. The
prediction of placement status that students are most
This paper presents a recommendation system that predicts
likely to achieve will help students to put in more
whether the current student will be placed or not, if the
hard work to make appropriate progress in stepping
student is placed the company is also predicted based on the
into a career in various technical fields. It will also
data of previously placed students. Here we use two
help the teachers as well as placement cell in an
different machine learning classification algorithms, namely institution to provide proper care towards the
Naive Bayes Classifier and K- Nearest Neighbors [KNN] improvement of students in the duration of course. A
algorithm. These algorithms independently predict the high placement rate is a key entity in building the
results and we then compare the efficiency of the reputation of an educational institution. Hence such a
algorithms, which is based on the dataset. This model helps system has a significant place in the educational
the placement cell within an organization to identify the system of any higher learning institution. We use
prospective students and pay attention to and improve their Naive Bayes and K- Nearest neighbors [KNN]
technical and interpersonal skills. machine learning module to provide efficient and
accurate results.
Key Words: Machine Learning, Naive Bayes, K-
Nearest Neighbors (KNN), Database.
1.1 Prediction System

In this paper the focus on machine learning technique to


1. INTRODUCTION
predict placement status of the student provided through
text input. The placement prediction is done by machine
Nowadays educational institutes are growing in high
numbers. Aim of every higher educational institute is learning using Naïïve Bayes and K-nearest neighbor (KNN)
to get their students a well-paid job through their algorithm. The algorithm considers the parameters such
placement cell. One of the biggest challenges that as USN, Tenth and PUC/Diploma results, CGPA,Technical
higher learning institutions face today is to improve and Aptitude Skills.
the placement performance of students. The
placement prediction is more complex when the 1.2 Naive Bayes Classifier
complexity of educational entities increase. One of
The Naive Bayes classifier has proved to be very effective
the effective ways to address the challenges for
on many real data applications. Its performance usually
improving the quality is to provide new knowledge
related to the educational processes and entities to

© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 04 | Apr 2019 www.irjet.net p-ISSN: 2395-0072

benefits from an accurate estimation of univariate


conditional probabilities and from variable selection.

1.3 K-nearest neighbor (KNN) 3. METHODOLOGY

KNN is a simple algorithm which uses entire dataset Naive Bayes


during its training phase whenever prediction is required
for unseen data. It searches through entire training dataset Naive Bayes is a simple technique for constructing
for k- most similar instances and data with most similar classifiers: models that assign class labels to problem
instance are returned. instances, represented as vectors of feature values, where
the class labels are drawn from some finite set. There is
1.2.1 Features of KNN not a single algorithm for training such classifiers, but a
family of algorithms based on a common principle: all
 KNN stores the entire training dataset which it naive Bayes classifiers assume that the value of a
uses as its representation. particular feature is independent of the value of any other
feature, given the class variable. For example, a fruit may
 It makes predictions just-in-time by calculating
be considered to be an apple if it is red, round, and about
the similarity between an input sample and each
10 cm in diameter. A naive Bayes classifier considers each
training instance. of these features to contribute independently to the
 It works on similarity measures. probability that this fruit is an apple, regardless of any
possible correlations between the color, roundness, and
diameter features.
2. RELATED WORK
Working Of Naive Bayes Algorithm
Senthil Kumar Thangavel, Divya Bharathi P and Abhijith
Shankar [1] conducted a study to predict student Step 1: Scan the dataset (storage servers) retrieval of
placement status using two attributes, areas and CGPA required data for mining from the servers such as
results. They made use of Decision Tree Learning, SCI-Kit database, cloud, excel sheet etc.
leaning in machine Learning here they use only two
parameters such as CGPA and arrears used algorithm takes Step 2: Calculate the probability of each attribute value. [n,
n_c, m, p] Here for each attribute we calculate the
more time for prediction not efficient .
probability of occurrence using the following formula.
(mentioned in the next step). For each class (Course) we
Wilton W.T. FOK, Y.S. He, H.H Au Yeung and K.Y. Law[2]
should apply the formulae.
Conducted a study to predict suitable course for the
students, based on their behavior using Neural Network Step3:P(attributevalue(ai)/subjectvaluevj)=(n_c+mp)/
Technique. A research on how to use TensorFlow Artificial (n+m) apply the above formulae
Intelligence engine for classifying students performance Where:
and forecasting their future universities degree program is n = the number of training examples for which v = vj
studied. TensorFlow engine includes number of nc = number of examples for which v = vj and a = ai
p = a priori estimate for P(aivj)
intermediate node and number of deep learning layers are
m = the equivalent sample size
adjusted and compared.
Step 4: Multiply the probabilities by p for each class, here
Machine Learning deals with the development, analysis we multiple the results of each attribute with p and final
and study of algorithms that can automatically detect results are used for classification.
patterns from data and use it to predict future data or
perform decision making [3]. Machine learning does its Step 5: Compare the values and classify the attribute
functionality by creating models out of it [4]. Machine values to one of the predefined set of class.
Learning has become widespread and has its applications
in the field of bioinformatics, computer vision, robot
locomotion, computational finance, search engine etc. K-nearest neighbors classifier (KNN)

K-Nearest neighbors (KNN) [3] is a simple, lazy and


nonparametric classifier. KNN is preferred when all the
features are continuous. KNN is also called as case-based

© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 2
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 04 | Apr 2019 www.irjet.net p-ISSN: 2395-0072

reasoning and has been used in many applications like


pattern recognition, statistical estimation. Classification is
obtained by identifying the nearest neighbors to
determine the class of an unknown sample. KNN is
preferred over other classification algorithms due to its
high convergence speed and simplicity. Figure below
shows nearest neighbors classification. KNN classification
has two stages

4. FUTURE ENHANMENTS
The future enhancements of the project is to focus
on extending it as a mobile based application and to
use advanced sophisticated algorithm for
improvising the voice input. We use KNN for
prediction. Other algorithms like ID3 or C4.5 or
Naive Bayes Algorithms can also be used. Future,
we may add more and different disease and related
1) Find the k number of instances in the dataset that is treatments to the dataset which makes it more
closest to instance S approachable to the people.

2) These k number of instances then vote to determine the


class of instance S
5. CONCLUSIONS
The Accuracy of KNN depends on distance metric and K
value. Various ways of measuring the distance between Student Placement Predictor is a system which
two instances are cosine, Euclidian distance. To evaluate predicts student placement status using machine
the new unknown sample, KNN computes its K nearest learning techniques. Many research papers are
neighbors and assign a class by majority voting. there related to educational sector, all these papers
mainly concentrates on student performance
Working of KNN Algorithm predictions. All these predictions help the institute
to improvise the student performance and can
Our proposed method aims to enhance the performance of come up with 100% results. Many of the previous
KNN classifier for disease prediction. Algorithm for our research papers concentrate on less number of
proposed method is shown below parameters such as CGPA and Arrears for
placement status prediction which leads to les
Step 1: Input the data set. accurate results, but proposed work contains many
educational parameters to predict placement
Step 2: Apply KNN algorithm for input parameters. status which will be more accurate.

Step 3: Euclidean distance between existing parameters


and newly entered is calculated.

Step 4: Based on similarity measures output is predicted.

ACKNOWLEDGEMENT

© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 3
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 04 | Apr 2019 www.irjet.net p-ISSN: 2395-0072

It gives us great pleasure in presenting this project report


titled “Student Placement Prediction Using Machine
Learning” and we wish to express our immense gratitude
to the people who provided invaluable knowledge and
support in the completion of this project. Their guidance
and motivation has helped in making this project a great
success. We express our gratitude to our project guide Mr.
Tojo Mathew, who provided us with all the guidance and
encouragement throughout the project development. All
the relevant and important details are included in this
report. At beginning we have given quite summary
regarding the project we are building and as we proceed
details about how project is going to be implemented is
mentioned using technologies.

REFERENCES

[1] “Student Placement Analyzer: A Recommendation


System Using Machine Learning” 2017 International
Conference on advanced computing and
communication systems(ICACCS-2017), Jan 06-
07,2017, Coimbatore, INDIsA.
[2] “Prediction Model for Students Future Development
by Deep Learning and TensorFlow Artificial
Intelligence Engine” 2018 4th IEEE International
Conference on Information Management.
[3] Kohavi, R and F, Provost (1998).Machine Learning
30:271-274.

© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4

Das könnte Ihnen auch gefallen