Smart-Hire Personality Prediction Using ML
Smart-Hire Personality Prediction Using ML
Abstract—Through technological changes, data science and alone. This quality involves being gregarious, enthusiastic,
artificial intelligence are altering the world. One of the most and self-assured.
significant uses of machine learning is the classification of
people based on their personality features. We can see many Agreeableness: A person's capacity for empathy and
applications of machine learning in our daily lives. Each cooperation, which are indicators of their ability to relate
individual on the world has a distinct personality type. with others. Inherent in this quality are tact, gentleness, and
Targeting particular demographics has made it possible to
make marketing campaigns more effective. This is made loyalty.
possible by the availability of high-dimensional data. Such
The tendency to have unfavorable personality traits,
personality-based promotions are quite effective at raising
brand awareness and enhancing the appeal of goods and unstable emotions, and damaging thoughts is known as
services. Using the Big Five personality traits, we created a neuroticism. Pessimism, worry, insecurity, and fearfulness
system for predicting personality. Every day, a large number of are characteristics of this trait.
students take competitive exams with a strong personality
component. These tests' primary goal is to evaluate the student's
talents and personality. Writing the personality test and
assessing the subject's personality are made easier by this
initiative. The person can view their personality type and make
improvements to their personality depending on the results of
the personality classification. In our paper, we tried to combine
phrase frequency algorithm to determine a person's talent and
personality prediction utilizing ML algorithms like KNN, CNN,
and Logistic regression to predict a person's personality. From
this model or system, users can quickly determine his
personality and level of technical proficiency.
I. INTRODUCTION
The "big five" personality traits are openness,
conscientiousness, extroversion, agreeableness, and
Fig 1. Big five factors represent individual’s personality
neuroticism. They are frequently referred to as "OCEAN" and
occasionally "CANOE." These five personality traits cover a Objective of the System: Automating the selection of
wide range of analysis of individual nature is responsible for
candidates is the key goal. The idea is to create a system that
decision-making as well as personality variations. The model
is being used by HR specialists to evaluate new hires and by will make it easier to recognize the personality traits
marketers to understand the target consumers for their displayed by the applicant and learn more about them
products. In this paper, the OCEAN Model is being used to without actually meeting them. The company will have a
construct the algorithm. better understanding of the candidate and be in a better
Receptivity to new things: This personality quality, position to choose the best applicant for the open position.
sometimes known as intelligence and inventiveness,
advocates for the openness to explore new stuff and to think II. LITERATURE SURVEY
outside the box. Insightfulness, inventiveness, and curiosity “Multiple Social Networking source of data for Text-
are characteristics of this feature. based Character Prediction”
Conscientiousness: The urge to exercise self-control, be In order to extract features for a personality prediction
cautious, and work hard. This quality involves determination, system, using pre-trained language models BERT, RoBERTa,
self-control, reliability, and consistency. and XLNet along with additional NLP features, this work
Extroversion: The tendency to seek up human proposed a multi-modal deep learning architecture (sentiment
communication or connection rather than spending time analysis, TF-IGM, and NRC emotion lexicon database). In
382
Authorized licensed use limited to: Zhejiang University. Downloaded on November 24,2024 at 04:49:50 UTC from IEEE Xplore. Restrictions apply.
2023 International Conference on Disruptive Technologies (ICDT)
Manage Jobs & Options: The administrator can control the D- Design Algorithm
selections based on the specifications of the employment
position and the available job postings. Machine learning (ML) algorithms are used to deliver the
outcomes of the entire process prior to the interview stage. It
View Candidates: The administrator has access to all of requires some input training data, most of which is made up
the candidate's data. of earlier candidate selection judgments made by subject-
matter experts. The eligibility ratings from the analysis
View Results: The evaluation results for the shortlisted determine a candidate's suitability for the position. The
candidates are visible to the admin. feedback on the candidate's claimed attributes is passed along
to a learning algorithm, which uses the data to generate the
evaluation. The system then creates the final ranked list to
choose the applicants. In the proposed approach, a set of input
employee selection patterns is used to represent the training
data set. The KNN (K-Nearest Neighbors) and Logistic
Regression understanding algorithm ranking used in the
prediction model and evaluation tests do the job well.
a- KNN (K-Nearest Neighbors)
It is possible to use the KNN method for classification and
Fig.5. Admin System Workflow regression issues. Based on "feature similarity," the KNN
b- Section-2 Candidate Page algorithm predicts fresh data points' values. This suggests that
the value assigned to the new point depends on how much it
resembles the points in the train dataset. [20]
Predictions are produced by looking over the entire
training set for the K instances (neighbors) that match the new
instance (x) the most. Following that, the output variable for
these K instances is added. In classification, this might be the
modal (or most common) class value, and in regression, it
might be the mean output variable. A distance measure is used
Fig.6. Data flow diagram of Candidate Page
to identify which of the K samples in the train dataset is most
similar to the new input. [20]
Registration: To access the following sections, the
b- Logistic Regression
candidate must first complete the registration form and
create their login credentials. A CV form must be Using the supervised learning classification process
completed and submitted by the applicant as part of the known as logistic regression, the likelihood of a target
registration process. variable is predicted. There are only two feasible classes
because of the dichotomous character of the dependent
Login: By providing the necessary information, the
variable. [19]
candidate can access the sub-sections.
The dependent variable is, to put it simply, a binary
Test: A personality and aptitude test can be taken online
variable, with data recorded as either 1 (which represents
following a successful login. If the applicant meets the
success/yes) or 0 (which represents failure/no). [19]
requirements established by the candidate Admin, they will
be able to view the job specifics and select the relevant A logistic regression model forecasts the value of P(Y=1)
position. as a function of X mathematically. One of the simplest ML
techniques, it can be used to solve a variety of categorization
View Results: The test taker can see the results after
problems. [19]
finishing it.
Logout: The candidate may exit the portal after viewing
the results.
383
Authorized licensed use limited to: Zhejiang University. Downloaded on November 24,2024 at 04:49:50 UTC from IEEE Xplore. Restrictions apply.
2023 International Conference on Disruptive Technologies (ICDT)
Fig.10. Logistic Regression Result We discuss the analysis of the results in this part. Below,
we go over the findings as example of the output.
E- Implementation
The dataset is split into training and testing halves. With
the help of the Standard Sklearn Library, the dataset is scaled
further. 30% of the test is testing, and 70% is training. The
dataset has 972 rows and 8 columns, and each row contains
the candidate's age and gender as well as one of the OCEAN
Model's five personality qualities. The rows include the
participant data.
384
Authorized licensed use limited to: Zhejiang University. Downloaded on November 24,2024 at 04:49:50 UTC from IEEE Xplore. Restrictions apply.
2023 International Conference on Disruptive Technologies (ICDT)
385
Authorized licensed use limited to: Zhejiang University. Downloaded on November 24,2024 at 04:49:50 UTC from IEEE Xplore. Restrictions apply.