Ensemble Machine Learning Models in Predicting

Uploaded by

vasikas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views7 pages

Ensemble Machine Learning Models in Predicting

Uploaded by

vasikas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Ensemble Machine Learning Models in Predicting

Personality Traits and Insights using Myers-

Briggs Dataset
Prasanna Kumar R, Bharathi Mohan G, Gudivada Dhyana sai*

Department of Computer Science and Engineering Amrita School of Computing, Amrita Vishwa Vidyapeetham, Chennai, India

E-mail : r_prasannakumar@ch.amrita.edu , g_bharathimohan@ch.amrita.edu, dhyanasaigudivada@gmail.com

Abstract- Personality prediction refers to the use of is often used as a platform for expressing one's views
machine learning techniques to predict an individual's on personal matters such as family, psychological
personality traits based on various sources of data, well-being, financial issues, interactions with society
such as text, images, and social media usage and the environment, and politics.In this model the
Personality traits refer to persistent patterns of
attempt was made to predict the personality of a
behaviors, thoughts, and feelings that differentiate one
individual from another. The prediction of people’s person based on social media content. For this, we
personality traits based on their social media posts are using a Myers-Briggs personality type indicator
using various machine learning models. With the help (MBTI) to classify people’s personalities under
of this model, a person’s personality can be classified various categories. The MBTI is one of the most
based on the 16 categories of Myers-Briggs personality popular personality tests in the world. The dataset
types. With the availability of a huge amount of data contains 16 personality types across 4 characteristic
on human behavior and personality traits, it is traits, namely, introversion (I) and extraversion (E),
possible to train a machine-learning model and predict intuition (N) and sensing (S), thinking (T) and
the personality trait of a person. The ML model feeling (F), and judging (J) and perceiving (P). When
assesses the person based on their social media posts.
a person is found to be introverted, intuitive, sensible,
The data consists of the posts from social media and
the personality type to which a person belongs. The and judging, he or she will be labelled as having an
model will be using the NLTK library to assess and INSJ personality type.
pre-process the data. Here a model has been based on
built four machine learning models, which include The data pre-processing section is the most important
logistic regression, support vector machines (SVM), part of this entire project since we are required to
nave Bayes, and random forest. Finally, we compare find important features from the huge dataset and
the machine learning model results to determine train the machine learning models with it. We need to
which one is best based on evaluation metrics pre-process the data and extract important features
(accuracy score, geometric mean score, ROC-AUC from the social media posts given in the dataset. The
score). Furthermore, this can be used in the
data pre-processing step includes tokenization,
personalization of online advertising ads and
campaigns. Also, it can be used by social media lemmatization, sentiment analysis, Parts of Speech
companies to attract users based on their personality (POS) tagging, and vectorization. Furthermore, a
traits and preferences. comparison is made between different ML models,
including logistic regression, SVM, random forest,
Keywords—Personality prediction, Myers-Briggs and Naive Bayes. Then we find the best-performing
personality types, NLTK, logistic regression, SVM, Nave model and use it on the testing data.
Bayes, and Random Forest.
This model can be implemented for various
I.INTRODUCTION applications. Some of them are the personalization of
advertisement campaigns on the products sold by
Personality is defined as the unique set of thinking, companies; use in recommender systems since
feeling, and behavioral patterns that vary between personality traits are closely linked with user
individuals. In modern times, it is common for preferences; and use by social media companies to
people to have multiple social media accounts and attract users based on their personality traits and
post a large number of messages daily. Social media

Authorized licensed use limited to: St. Petersburg State University. Downloaded on January 31,2025 at 19:41:03 UTC from IEEE Xplore. Restrictions apply.
preferences. In the upcoming sections, we will using the Big Five personality traits model. The
discuss the implementation of the ML model, along results showed that linguistic features were able to
with the challenges faced and how they were solved. moderately accurately predict personality traits, with
extraversion being the most strongly predicted trait.
II.LITERATURE REVIEW There was no plagiarism detected in the original
text[6]. the potential for predicting personality traits
Predicting personality traits investigates the potential of individuals by analyzing their language patterns
of using machine learning methods to predict a in email messages through text mining. A sample of
person's personality traits through their digital 379 individuals was used, and the Big Five
footprints the data collected from social media, personality traits model was applied to the collected
online forums. The Random Forest algorithm email data. Machine learning algorithms were then
demonstrated the highest performance, with an used to predict personality traits based on email
average accuracy of 62.5% across all five features. Results showed that personality traits could
personality traits[1]. To enhance the precision of be predicted with moderate accuracy using email
personality prediction, a deep learning model is text data[7]. Data from social media, blogs, and
utilized that considers both textual and visual essays to predict personality traits through a
characteristics of social media data. The study combination of linguistic and machine learning
employed a dataset comprising self-reported methods. They employed a dimensional model of
personality traits of 861 participants and employed a language, which allowed for a more nuanced
deep neural network model known as analysis of language use beyond just the frequency
"Convolutional Recurrent Neural Network" (CRNN) of specific words[8]. Convolutional neural networks
to forecast the personality traits[2]. The analysis of (CNNs) to predict personality traits based on images
personality through social media data can transform posted on Instagram. The authors collected data
the domain of personality psychology by providing from 80 Instagram users and applied the Big Five
customized content suggestions and precise personality traits model to analyze the content of
advertising, as well as inform further investigations their images. The CNN-based approach yielded
into personality prediction based on social media good prediction accuracy for extraversion and
data. It can also aid in the formulation of ethical openness traits, with accuracies of 69.9% and
standards for the utilization of personal data in both 74.2%, respectively[9]. To predict specific aspects
research and commercial applications. There is a of personality through Facebook status updates with
possibility of a revolutionary impact on the field of higher accuracy than previous studies conducted on
personality psychology with the potential benefits of other digital media. The algorithm utilized was able
social media data analysis[3]. It is possible to to predict scores on all five personality traits
forecast certain aspects of an individual's personality accurately, based solely on Facebook status
using smartphone data. The research, based on updates[10]. The use of deep learning techniques to
information gathered from 123 participants, accurately predict personality traits from short texts,
indicates that behavioral patterns concerning activity using a convolutional neural network (CNN) to learn
level, social rhythm, and mobility demonstrate a representations of words and phrases, and a Long
noteworthy correlation with personality traits like Short-Term Memory (LSTM) network to model the
conscientiousness, openness, and extraversion. The sequences of these learned representations. The
study suggests that smartphone data can be utilized model was tested on two datasets: a set of tweets and
effectively to make predictions about an individual's a set of restaurant reviews. Results demonstrated that
personality[4]. certain precise personality traits are the deep learning model outperformed traditional
more effective in forecasting job performance than machine learning models, such as Support Vector
general personality traits. This discovery carries Machines (SVMs), in accurately predicting
significant importance for the design of selection personality traits[11]. It is possible to predict some
assessments and training programs in professional aspects of personality from Twitter language use,
settings. By identifying the specific personality traits although the accuracy varies depending on the trait a
that correlate with job performance, organizations regression model to predict each of the Big Five
can tailor their recruitment processes and training traits, Twitter usage data could predict personality
initiatives accordingly[5]. The language patterns of traits with moderate accuracy[12].The statistical
2,800 Twitter users predict their personality traits model of topic modeling is utilized to identify
concealed topics or keywords in a document

Authorized licensed use limited to: St. Petersburg State University. Downloaded on January 31,2025 at 19:41:03 UTC from IEEE Xplore. Restrictions apply.
collection. Techniques like LSA, pLSA, and LDA imbalance is left unnoticed, then it will lead to biased
are used, with LDA commonly used in multi- results since the data is skewed. This class imbalance
document summarization, providing better results can be visualized in Fig-1. which clearly shows that
than LSA when the number of features for sentence the class "INFP" has about 1832 rows of data, but the
selection is increased[13].The strategy of text class "ESTJ" has about 39 rows of data.
document analysis shows potential for content
summarization and involves two stages: text
abstraction and text summarization. Automated text
summarization uses NLP to extract significant
information from related documents, and this study
proposes a novel technique of ensemble topic vector
clustering using SA for efficient processing and
summarization [14]. The automation of machine
learning model development is achieved through
AutoML, which aims to increase productivity and
reduce time. This study proposes a genetic
algorithm-based AutoML model for network
architecture search, with an evaluation in scenarios
of binary classification and regression resulting in Fig.1 Visualization of class imbalance
98% accuracy[15].While machine learning has
shown increased accuracy in classification, the To overcome this problem, we separate these 16
quality of features used has a significant impact on personality types into 4 binary class types, namely
the predictive model outcomes. This explores the introversion (I), sensing (S), thinking (T), and
impact of feature quality on heart disease prediction judging (J). Assume a person is either an introvert or
by employing RFECV with SVM, LR, DT, and RF an extrovert; this is represented in the binary class
algorithms. It was found that RF outperformed the "introversion" as 1 if the person is an introvert and 0
other models, achieving a predictive accuracy of if the person is an extrovert. The same follows for the
99.7%[16]. remaining three classes, too, we can easily classify
based on these 4 classes and avoid the class
III.METHODOLOGY imbalance to some extent.

The data pre-processing is the most important part of B. TokenizationAndLemmatization

our project, as it involves the conversion of unclean Tokenization refers to the procedure of dividing a
textual data into cleaned numerical data that can be text document, paragraph, sentence, or phrase into
used to train the ML model. We divided the data pre- smaller and more straightforward components such
processing step into four sub-sections: tokenization, as individual words or concepts, each of which is
lemmatization, sentiment analysis, and POS tagging. identified as a token. Meanwhile, lemmatization is an
To perform these steps, we make use of the modules NLP technique that aims to reduce each word into its
provided by the NLTK library. The dataset named basic and meaningful form. We make sure that the
"MTBI_1.csv" has been downloaded from Kaggle. stop words are removed in the tokenization step
The dataset consists of two columns, "type" and before lemmatizing. The posts contain web links,
"posts." The "posts" column consists of various email IDs, symbols, etc., which are not required for
social media posts by different people in each row. sentiment scoring. So, we remove them using regular
For each person, the different posts they have made expressions (regex) modules, and the words are also
are separated by "|||." The dataset contains a total of converted to lowercase. We also removed the words
8675 rows, and there are no null values present. that were of length 1 or 2. Considering the size of the
dataset, the lemmatization step took almost 30
A. Class Imbalance minutes to complete. So, after completion, we saved
We took a count of each type of present in the dataset the lemmatized and cleaned data as a separate
and found out that there is an imbalance in classes, column in the same dataset in a separate CSV file.
i.e., a few personality types had a lesser number of
occurrences when compared to others. If the class

Authorized licensed use limited to: St. Petersburg State University. Downloaded on January 31,2025 at 19:41:03 UTC from IEEE Xplore. Restrictions apply.
C. Sentiment Scoring and Analysis speech, such as nouns, verbs, adverbs, adjectives,
Sentiment analysis, which is sometimes called pronouns, conjunctions, and their subclasses, to each
"opinion mining," is an NLP technique used to word in the text.After doing POS tagging, the tagged
identify the emotional tone of a piece of text. words are grouped based on the 12 important
Companies frequently use this technique to analyze categories of the Stanford list. For each row in the
customer opinions and classify them according to a dataset, we calculate the average value for these 12
particular product, service, or concept. Sentiment POS tags.
analysis involves utilizing machine learning (ML),
data mining, and artificial intelligence (AI) to extract E. Vectorizing
subjective information and analyze text for As we are dealing with textual data, we need to
sentiment. We loaded the cleaned dataset obtained convert it into a numerical data format for the ML
from the lemmatization step and proceeded with model to train with the dataset. To achieve this, we
sentiment scoring. We used the NLTK’s Vader make use of two well-known vectorizing methods,
module to find the sentiment scores. We received "TF-IDF vectorizer" and "Count vectorizer." The
four distinct scores: a composite score, a positive reason for choosing both is to find the best
score, a negative score, and a neutral score. Since we vectorizing method for the ML model to perform
were also using the Nave Bayes model, which cannot well. Both vectorizers are provided by the sci-kit-
handle negative values, we had to rescale and learn library.
normalize the data with a min-max scaler.

Fig.3 The box plot of word count vs personality

types.

TF-IDF stands for Term Frequency-Inverse

Document Frequency, which is a statistical measure
that evaluates the importance of a word in a corpus
based on its frequency. Count vectorizing is a
Fig.2The distribution of16 personalitytypes vs. each method of representing a set of strings as a frequency
of the4 sentiment scores. or count. Tfidf-Vectorizer() is similar to
CountVectorizer(), but it returns floats instead of
D. Parts of Speech (POS) tagging integers. The significant difference between the two
Tagging is a kind of categorization that is said to be is that CountVectorizer() provides counts, while
as theautomated assignment of a description to each Tfidf-Vectorizer() assigns scores to each term. While
token. The descriptor here is called a tag, and it may CountVectorizer() simply counts the frequency of
represent several things, including parts of speech, words, Tfidf-Vectorizer() assigns a score based on
semantic information, and so onPart-of-speech the word's frequency in the corpus and its
(POS) tagging is a technique that involves assigning significance for statistical analysis .Before
a specific part of speech to each word in a sentence vectorization, we also take a count of question marks,
or phrase. This process is crucial in natural language exclamatory marks, colons, images, URL links,
processing, as it helps to accurately identify the emojis, and unique words for each post.
function and meaning of each word. POS tagging
involves assigning one of the well-known parts of

Authorized licensed use limited to: St. Petersburg State University. Downloaded on January 31,2025 at 19:41:03 UTC from IEEE Xplore. Restrictions apply.
IV.IMPLEMENTATION we use the SelectKBest module to select the k-best
features, where k is set to 10 by default. So, finally,
In the implementation phase, we will discuss how we with the help of make_pipeline from sci-kit-learn, we
train and test different machine learning models. We make an ML pipeline consisting of the
will use four algorithms: logistic regression, support MinMaxScaler and SelectKBest functions. Applying
vector machine (SVM), naive Bayes, and random data transformations, such as scaling or vectorizing,
forest. Our first step is to split the cleaned dataset is a simple process when all input variables are of the
into training and testing sets. We allocate 90% of the same type. However, it can become challenging
data for training and 10% for testing. As discussed when the dataset contains mixed types, and we need
earlier, the dataset suffers from class imbalance, to selectively apply data transformations to certain
which needs to be reduced to a certain extent before input features but not all.
training the models.Imbalanceddatasets are those
where there is a severe skew in the class distribution, V.RESULT AND DISCUSSION
in the ratio of 1:100 or 1:1000 examples in the
minority class to the majority class. This is a The accuracy scores for each class have been found
problem, as it is typically the minority class for from the cross-validation dataset, which helps us
which predictions are most important. One approach choose the best one out of the eight pipelines which
to addressing the problem of class imbalance is to were shown in Table 1. We found out that the ML
randomly resample the training dataset.The Mayer- pipelines with TfidfVectorizer performed well when
Briggs Twitter dataset has undergone data pre- compared to CountVectorizer, based on the
processing, including various text pre-processing classification metrics, accuracy scores, and ROC-
techniques for feature extraction. Feature selection AUC scores obtained which are shown in Table 2.
was carried out using K-best feature selection. The And, of the four pipelines, the ML pipeline with the
resulting features will be used to train several logistic regression classifier performed admirably
machine learning models, including Logistic and significantly better than the others. The accuracy
Regression, SVM, Multinomial Naive Bayes, and scores of the logistic regression on testing data and
Random Forest Classifier. The ultimate goal is to use coefficient values of the model were displayed in
these models to predict personality traits as shown in Table 3 and fig 5.
fig 4.
TABLE 1. ACCURACY SCORES OF EACH CLASSIFIER ON
TRAINING DATA
Personality AccuracyScore
type Logistic SV Multinomi Random
Regressi M al Naive ForestCl
on Bayes assifier
Introvertvs 0.67 0.66 0.63 0.62
Extrovert
Intuitionvs 0.68 0.67 0.72 0.61
Fig 4:Flow diagram of the proposed method Sensing
Thinkingvs 0.80 0.80 0.76 0.72
F. Random Under Sampling and k-Best Feature Feeling
Selection JudgingvsP 0.64 0.64 0.60 0.56
There are two primary methods for randomly erceiving
resampling an imbalanced dataset under sampling,
which involves deleting examples from the majority TABLE 2. ROC-AUC SCORES OF EACH CLASSIFIER ON
class, and oversampling, which involves duplicating TRAINING DATA
examples from the minority class. In our Personalitytyp ROC-AUCScore
implementation, we have utilized the random under e Logisti SV Multino Random
sampler provided by the imblearn library to reduce c M mial ForestClas
Regress Naive sifier
the class imbalance.
ion Bayes
IntrovertvsExt 0.73 0.7 0.70 0.67
We have a lot of features and columns in the dataset rovert 0
after doing all these pre-processing steps. As a result,

Authorized licensed use limited to: St. Petersburg State University. Downloaded on January 31,2025 at 19:41:03 UTC from IEEE Xplore. Restrictions apply.
IntuitionvsSen 0.71 0.7 0.72 0.61 individuals' personality traits, which can be useful
sing 1 for a variety of applications such as career
ThinkingvsFee 0.89 0.8 0.85 0.80 counselling, team building, and personal
ling 9 development. This model can be implemented and
JudgingvsPerc 0.68 0.6 0.67 0.58 used in various applications. Social media
eiving 8
companies can use this to attract users based on their
personality traits and preferences. Companies can
Table 3. Accuracy scores of logistic regressions on advertise their products through personalized ads
testing data based on users’ personality traits and behavioural
Personalitytype AccuracyScore preferences. Further research is needed to improve
IntrovertvsExtrovert 0.6820 the accuracy and generalizability of these models
Intuition vs Sensing 0.6831 and to explore their potential for real-world
Thinking vs Feeling 0.7811 applications.
Judging vs Perceiving 0.6359
REFERENCES
[1] "Predicting personality traits from digital footprints using
machine learning" by Wu, C., Lu, H., & Zhu, Y. (2021).
[2] "Deep learning-based personality prediction using social media
data" by Choi, Y., Jo, J., & Choi, S. (2019).
[3] "Personality prediction using Facebook data: A comprehensive
review" by Farnadi, G., Sitaraman, G., & Moens, M. (2016).
[4] "Predicting personality from patterns of behaviour collected
with smartphones" by Saeb, S., Lonini, L., Jayaraman, A.,
Mohr, D. C., & Kording, K. P. (2016).
[5] "Personality and job performance: The importance of narrow
traits" by Tett, R. P., Jackson, D. N., & Rothstein, M. (1991).
[6] "Predicting personality from Twitter" by Golbeck, J., Robles,
C., & Turner, K. (2011).
[7] "Personality prediction based on text mining of email messages"
by Quercia, D., Kosinski, M., Stillwell, D., & Crowcroft, J.
(2011).
Fig 5. Words with the highest coefficient values in [8] Park, G., & Schwartz, H. A. (2015). Predicting personality from
text using dimensional models of language. Journal of
each personality class type. Personality, 83(3), 243-256.
[9] You, Q., Jin, H., & Luo, J. (2015). Predicting personality traits
VI.CONCLUSION AND DISCUSSION from Instagram images using convolutional neural networks.
Proceedings of the ACM International Conference on
Multimedia, 131-140
The results of these studies suggest that machine [10] Kosinski, M., Stillwell, D., & Graepel, T. (2013). Predicting the
Big 5 personality traits using Facebook status updates.
learning models can accurately predict certain Psychological science, 24(4), 1-8.
aspects of personality from MBTI data, such as [11] Dhingra, B., & Cohen, W. W. (2016). Deep learning for
extraversion, openness, and agreeableness. However, personality trait extraction from short texts. Proceedings of the
54th Annual Meeting of the Association for Computational
predicting other aspects of personality, such as Linguistics, 1572-1583.
neuroticism, may be more challenging. Additionally, [12] Golbeck, J., Robles, C., & Turner, K. (2011). Predicting
the accuracy of these models may be affected by personality traits from Twitter usage. Proceedings of the
International Conference on Weblogs and social media, 1-10
factors such as the size and quality of the dataset, the [13] Bharathi Mohan, G., and R. Prasanna Kumar. "A
choice of features and algorithms, and individual comprehensive survey on topic modeling in text
summarization." 5th international conference on micro-
differences in personality expression. For training electronics and telecommunication engineering, Springer book
and testing, the logistic regression classification series on “Lecture Notes in Networks and Systems. 2021.
model is chosen which is also later used for [14] Bharathi Mohan, G., Prasanna Kumar, R. (2023). Survey of
Text Document Summarization Based on Ensemble Topic
predicting the MBTI personality types in the web Vector Clustering Model. In: Joby, P.P., Balas, V.E.,
app. The class imbalance problem has been handled Palanisamy, R. (eds) IoT Based Control Networks and
by making the 16 classes into 4 classes and by using Intelligent Systems. Lecture Notes in Networks and Systems,
vol 528. Springer, Singapore. https://doi.org/10.1007/978-981-
random under-sampling. Performance and accuracy 19-5845-8_60
scores can be improved more with deep learning [15] C. Spandana, I. V. Srisurya, S. Aasha Nandhini, R. P. Kumar,
G. Bharathi Mohan and P. Srinivasan, "An Efficient Genetic
models. The use of machine learning techniques for Algorithm based Auto ML Approach for Classification and
personality prediction using the MBTI dataset has Regression," 2023 International Conference on Intelligent Data
the potential to provide valuable insights into Communication Technologies and Internet of Things (IDCIoT),
Bengaluru, India, 2023, pp. 371-376, doi:

Authorized licensed use limited to: St. Petersburg State University. Downloaded on January 31,2025 at 19:41:03 UTC from IEEE Xplore. Restrictions apply.
10.1109/IDCIoT56793.2023.10053442.
[16] Tsehay Admassu Assegie, Prasanna Kumar Rangarajan, Napa
Komal Kumar, & Dhamodaran Vigneswari. (2022). An
empirical study on machine learning algorithms for heart
disease prediction. International Journal of Artificial
Intelligence (IJ-AI), 11(3), 1066–1073.
https://doi.org/10.11591/ijai.v11.i3.pp1066-1073.

Authorized licensed use limited to: St. Petersburg State University. Downloaded on January 31,2025 at 19:41:03 UTC from IEEE Xplore. Restrictions apply.

IRJET - A Comparative Study of Different PDF
No ratings yet
IRJET - A Comparative Study of Different PDF
4 pages
Social Media Personality Prediction
No ratings yet
Social Media Personality Prediction
3 pages
Bharad Waj 2018
No ratings yet
Bharad Waj 2018
7 pages
Sahono 2020
No ratings yet
Sahono 2020
6 pages
Project Name: Personality Prediction Using Mbti
No ratings yet
Project Name: Personality Prediction Using Mbti
16 pages
Myers-Briggs Personality Classification and Person
No ratings yet
Myers-Briggs Personality Classification and Person
6 pages
Myers-Briggs Personality Classification and Personality-Specific Language Generation Using Pre-Trained Language Models
No ratings yet
Myers-Briggs Personality Classification and Personality-Specific Language Generation Using Pre-Trained Language Models
6 pages
MBTI Personality Prediction Using Machine Learning
No ratings yet
MBTI Personality Prediction Using Machine Learning
15 pages
Personality Prediction System
No ratings yet
Personality Prediction System
6 pages
Machine Mindset An MBTI Exploration of Large Language Models
No ratings yet
Machine Mindset An MBTI Exploration of Large Language Models
15 pages
Capstone Final Review
No ratings yet
Capstone Final Review
19 pages
Machine and Deep Learning For Personality Traits Detection: A Comprehensive Survey and Open Research Challenges
No ratings yet
Machine and Deep Learning For Personality Traits Detection: A Comprehensive Survey and Open Research Challenges
57 pages
Recent Trends in Deep Learning Based Personality Detection
No ratings yet
Recent Trends in Deep Learning Based Personality Detection
27 pages
MACHINE LEARNING BASED PROJECT (16) lppp89078
No ratings yet
MACHINE LEARNING BASED PROJECT (16) lppp89078
18 pages
Batch 6 Research Paper Final
No ratings yet
Batch 6 Research Paper Final
11 pages
Personality Classification From Online Text
No ratings yet
Personality Classification From Online Text
17 pages
Personality Prediction Phase I
No ratings yet
Personality Prediction Phase I
17 pages
Personality Types
No ratings yet
Personality Types
35 pages
Extraction of Personality Traits From Online Traits
No ratings yet
Extraction of Personality Traits From Online Traits
8 pages
Personality Classification System Using Data Mining: Abstract - Personality Is One Feature That Determines How
No ratings yet
Personality Classification System Using Data Mining: Abstract - Personality Is One Feature That Determines How
4 pages
A Method For MBTI Classification Based On Impact O-1
No ratings yet
A Method For MBTI Classification Based On Impact O-1
19 pages
IJRPR14136
No ratings yet
IJRPR14136
4 pages
Personality Insights for Educators
No ratings yet
Personality Insights for Educators
5 pages
CV-Based Personality Prediction Using AI
No ratings yet
CV-Based Personality Prediction Using AI
7 pages
Personality Prediction Model For Social Media Us - 2022 - Computers and Electric
No ratings yet
Personality Prediction Model For Social Media Us - 2022 - Computers and Electric
12 pages
Personality Predictor: Area/Domain: Data Science and Artificial Intelligence
No ratings yet
Personality Predictor: Area/Domain: Data Science and Artificial Intelligence
17 pages
Personality Prediction by Discrete Methodology: Gayatri Vaidya, Pratima Yadav, Reena Yadav, Prof - Chandana Nighut
No ratings yet
Personality Prediction by Discrete Methodology: Gayatri Vaidya, Pratima Yadav, Reena Yadav, Prof - Chandana Nighut
4 pages
Bleidorn & Hopwood (In Press)
No ratings yet
Bleidorn & Hopwood (In Press)
48 pages
Text-Based Personality Prediction Using Large Language Models
No ratings yet
Text-Based Personality Prediction Using Large Language Models
9 pages
Lovely Professional University
No ratings yet
Lovely Professional University
9 pages
A Method For MBTI Classification Based On Impact of Class Components
No ratings yet
A Method For MBTI Classification Based On Impact of Class Components
18 pages
Major PRJT 1
No ratings yet
Major PRJT 1
5 pages
Automation of Candidate Hiring System Using Machine Learning
No ratings yet
Automation of Candidate Hiring System Using Machine Learning
6 pages
Philip 2018
No ratings yet
Philip 2018
11 pages
Personality Internal
No ratings yet
Personality Internal
3 pages
Personality
No ratings yet
Personality
49 pages
Personality Predictor: A Project/Dissertation Review-1 Report On
No ratings yet
Personality Predictor: A Project/Dissertation Review-1 Report On
6 pages
Smart-Hire Personality Prediction Using ML
No ratings yet
Smart-Hire Personality Prediction Using ML
5 pages
Capstone Review 2
No ratings yet
Capstone Review 2
15 pages
Predicting Personality From Twitter-Week-1
No ratings yet
Predicting Personality From Twitter-Week-1
8 pages
Hybrid Deep Learning Framework For Personality Prediction in E-Recruitment
No ratings yet
Hybrid Deep Learning Framework For Personality Prediction in E-Recruitment
4 pages
Text Based Personality Prediction From Multiple So
No ratings yet
Text Based Personality Prediction From Multiple So
21 pages
Identifying Personality Trait Using Social Media
No ratings yet
Identifying Personality Trait Using Social Media
9 pages
Project Sketch
No ratings yet
Project Sketch
3 pages
Computational Personality Recognition in Social Me
No ratings yet
Computational Personality Recognition in Social Me
35 pages
Cross Domain Self-Reported Vs Apparent Personality Perception Using Deep Learning
No ratings yet
Cross Domain Self-Reported Vs Apparent Personality Perception Using Deep Learning
13 pages
AI Knows You
No ratings yet
AI Knows You
21 pages
12214033-Vishal Hudda
No ratings yet
12214033-Vishal Hudda
15 pages
AI Tools for First Impression Analysis
No ratings yet
AI Tools for First Impression Analysis
48 pages
PW Presentation
No ratings yet
PW Presentation
2 pages
Abordagem de Aprendizado de Máquina para Previsão de Tipo de Personalidade Com Base No Indicador de Tipo Myers-Briggs
No ratings yet
Abordagem de Aprendizado de Máquina para Previsão de Tipo de Personalidade Com Base No Indicador de Tipo Myers-Briggs
16 pages
Paper2 PDF
No ratings yet
Paper2 PDF
8 pages
Predicting Personality From Twitter
No ratings yet
Predicting Personality From Twitter
8 pages
Data Mining For Automated Personality Classification - New
No ratings yet
Data Mining For Automated Personality Classification - New
9 pages
Personality Prediction Using Logistic Regression
No ratings yet
Personality Prediction Using Logistic Regression
4 pages
Pers 5
No ratings yet
Pers 5
9 pages
Computer-Based Personality Judgments Are More Accurate Than Those Made by Humans
No ratings yet
Computer-Based Personality Judgments Are More Accurate Than Those Made by Humans
5 pages
Survey 1
No ratings yet
Survey 1
45 pages
Fluorescence Spectroscopy and Thermometry For
No ratings yet
Fluorescence Spectroscopy and Thermometry For
12 pages
HIFiRE 0 Flight Test Data
No ratings yet
HIFiRE 0 Flight Test Data
12 pages
Utilising Twitter Metadata For Hate
No ratings yet
Utilising Twitter Metadata For Hate
9 pages
Polarization in Online Social Networks
No ratings yet
Polarization in Online Social Networks
19 pages
The Development of An Airborne Information Management System For Flight Test
No ratings yet
The Development of An Airborne Information Management System For Flight Test
14 pages
A Framework For Personality Prediction For E-Recruitment Using
No ratings yet
A Framework For Personality Prediction For E-Recruitment Using
5 pages
Optical Measurements at The Combustor Exit of The HIFiRE
No ratings yet
Optical Measurements at The Combustor Exit of The HIFiRE
8 pages
Tanzania Freight Transport & Shipping Report - Q4 2020
100% (1)
Tanzania Freight Transport & Shipping Report - Q4 2020
48 pages
Accuracy Analysis of The Extensile Manipulator of Satellite Antenna
No ratings yet
Accuracy Analysis of The Extensile Manipulator of Satellite Antenna
5 pages
Power Grids With Renewable Energy
100% (2)
Power Grids With Renewable Energy
597 pages
Egypt Infrastructure Report - Q4 2020
No ratings yet
Egypt Infrastructure Report - Q4 2020
61 pages
Bahrain Freight Transport & Shipping Report - 2020
No ratings yet
Bahrain Freight Transport & Shipping Report - 2020
32 pages
Applications For The Archinaut in Space Manufacturing and
No ratings yet
Applications For The Archinaut in Space Manufacturing and
14 pages
Greece Freight Transport & Shipping Report - 2020
No ratings yet
Greece Freight Transport & Shipping Report - 2020
43 pages
UAV Threat Detection with Radar Systems
No ratings yet
UAV Threat Detection with Radar Systems
33 pages
CCNP Security Identity Management SISE 300-715 Official Cert Guide
67% (3)
CCNP Security Identity Management SISE 300-715 Official Cert Guide
1,190 pages
AI-generated SWOT Analysis of Emerging Technologies in Air Transportation
No ratings yet
AI-generated SWOT Analysis of Emerging Technologies in Air Transportation
10 pages
Development of A Stabilizing Adaptive Feedback Control System For
No ratings yet
Development of A Stabilizing Adaptive Feedback Control System For
25 pages
RF Circuits and Applications
100% (4)
RF Circuits and Applications
378 pages
Printed Antennas
No ratings yet
Printed Antennas
463 pages
Concise Encyclopedia of Coding Theory
100% (1)
Concise Encyclopedia of Coding Theory
998 pages
China Infrastructure Report Q4 2020 PDF
No ratings yet
China Infrastructure Report Q4 2020 PDF
83 pages
Big Data Surveillance and Security Intelligence
No ratings yet
Big Data Surveillance and Security Intelligence
303 pages
Isolation-Aware 5G RAN Slice Mapping Over WDM
No ratings yet
Isolation-Aware 5G RAN Slice Mapping Over WDM
13 pages
Electronic Scanned Array Design
100% (2)
Electronic Scanned Array Design
357 pages
5G Wireless Systems
No ratings yet
5G Wireless Systems
454 pages
412th Test Wing
No ratings yet
412th Test Wing
58 pages
Thesis Presentation Analysis and Interpretation of Data
100% (3)
Thesis Presentation Analysis and Interpretation of Data
5 pages
Business Analytics and Decision Making
No ratings yet
Business Analytics and Decision Making
47 pages
Product Description: Optix 155/622H (Metro1000) Stm-1/Stm-4/Stm-16 MSTP Optical Transmission System V300R006
No ratings yet
Product Description: Optix 155/622H (Metro1000) Stm-1/Stm-4/Stm-16 MSTP Optical Transmission System V300R006
140 pages
Original Programming Manual PDM360 NG 12" / Touch: Runtime System V02.03.xx Codesys V2.3
No ratings yet
Original Programming Manual PDM360 NG 12" / Touch: Runtime System V02.03.xx Codesys V2.3
385 pages
Komatsu - CMB Ac Compressors Application List
No ratings yet
Komatsu - CMB Ac Compressors Application List
1 page
Vaibhav
No ratings yet
Vaibhav
1 page
SignedBirthCertificate 8318846 335
No ratings yet
SignedBirthCertificate 8318846 335
1 page
COVID-19 Hospital Resource Forecasts
No ratings yet
COVID-19 Hospital Resource Forecasts
3 pages
Or QB 191ma503 CSBS V Sem
No ratings yet
Or QB 191ma503 CSBS V Sem
40 pages
LNL Iklcqd /: Employee Share Employer Share Employee Share Employer Share
No ratings yet
LNL Iklcqd /: Employee Share Employer Share Employee Share Employer Share
2 pages
Practical Ip (1) - 1
No ratings yet
Practical Ip (1) - 1
5 pages
OpenGL Cube Drawing & VTK Overview
No ratings yet
OpenGL Cube Drawing & VTK Overview
25 pages
Metavers Report Final
No ratings yet
Metavers Report Final
44 pages
Samsung 55 - Smart 4K UHD Flat TV (RU7100) Price & Specs - Samsung SG PDF
No ratings yet
Samsung 55 - Smart 4K UHD Flat TV (RU7100) Price & Specs - Samsung SG PDF
18 pages
Clearance Form
No ratings yet
Clearance Form
2 pages
Advanced C Programming Tasks
No ratings yet
Advanced C Programming Tasks
17 pages
Price List 03.07.2014
No ratings yet
Price List 03.07.2014
5 pages
Finite Element Method 1
No ratings yet
Finite Element Method 1
10 pages
Urea Prilling Tower Design
100% (8)
Urea Prilling Tower Design
5 pages
Kedziora, Damian - Penttinen, Esko - Governance Models For Robotic Process Automation - The Case of Nordea Bank
No ratings yet
Kedziora, Damian - Penttinen, Esko - Governance Models For Robotic Process Automation - The Case of Nordea Bank
10 pages
Publisher: Korea Nepal Polytechnic Institute (KNPI)
No ratings yet
Publisher: Korea Nepal Polytechnic Institute (KNPI)
136 pages
Analysing The Effects of Lean Manufacturing Using VSM Based Simulation Generator
No ratings yet
Analysing The Effects of Lean Manufacturing Using VSM Based Simulation Generator
23 pages
Đề Cương Tiếng Anh 9 Hk II
No ratings yet
Đề Cương Tiếng Anh 9 Hk II
30 pages
CMP A2f
No ratings yet
CMP A2f
2 pages
DEH-X1650UB - DEH-X1650UBG - Owner Manual - QRD3207A
No ratings yet
DEH-X1650UB - DEH-X1650UBG - Owner Manual - QRD3207A
52 pages
Practical UML: A Hands-On Introduction For Developers: Use Case Diagrams
No ratings yet
Practical UML: A Hands-On Introduction For Developers: Use Case Diagrams
10 pages
Problem: Reciprocating Pump Example 1
No ratings yet
Problem: Reciprocating Pump Example 1
8 pages
Unit 5 - Invoke Conversion Functions and Conditional Expressions
No ratings yet
Unit 5 - Invoke Conversion Functions and Conditional Expressions
15 pages
Mosfet Differential Amplifier
No ratings yet
Mosfet Differential Amplifier
5 pages
Microgrid Energy Management Optimization
No ratings yet
Microgrid Energy Management Optimization
10 pages

Ensemble Machine Learning Models in Predicting

Uploaded by

Ensemble Machine Learning Models in Predicting

Uploaded by

Ensemble Machine Learning Models in Predicting

Personality Traits and Insights using Myers-

E-mail : r_prasannakumar@ch.amrita.edu , g_bharathimohan@ch.amrita.edu, dhyanasaigudivada@gmail.com

The data pre-processing is the most important part of B. TokenizationAndLemmatization

Fig.3 The box plot of word count vs personality

TF-IDF stands for Term Frequency-Inverse

You might also like