0% found this document useful (0 votes)

27 views31 pages

SkillSync Midterm Presentation

weertweetwt

Uploaded by

cr7ashek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views31 pages

SkillSync Midterm Presentation

weertweetwt

Uploaded by

cr7ashek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

MIDTERM PROGRESS PRESENTATION ON

SkillSync: NLP BASED

RESUME RANKING SYSTEM
PRESENTED BY:
SAUJANYA SHRESTHA [PAS077BCT036]
DIPAK POUDEL [PAS077BCT019]
ANISH DAHAL [PAS077BCT005]
SAROJ ADHIKARI [PAS077BCT035]
OVERVIEW
01 INTRODUCTION
02 METHODOLOGY
03 WORK PROGRESS
04 REMAINING TASKS
INTRODUCTION
RESUME RANKING

Resume Ranking Systems allows users to compare

resumes with job posts and calculate a match score.

Based on the match score, the resumes are ranked, hence

signiﬁcantly reducing the time and labor required to
screen through hundreds or thousands of resumes.
PROBLEM STATEMENTS

Tedious to manage and analyze high volume of resumes

Traditional means introduce human bias and subjectivity

Manual analysis and ranking is time-consuming and inefficient

OBJECTIVES

To build a custom NER model for parsing relevant

information from resumes and job posts

To build a system that compares resumes and job posts,

calculates a match score and ranks them
NLP & NER

Natural Language Processing (NLP) is a ﬁeld of

artiﬁcial intelligence that enables machines to
understand, interpret, and generate human language.

Named Entity Recognition (NER) is a subtask of NLP

that focuses on identifying and classifying speciﬁc
entities (e.g., names, dates, locations) in text.
spaCy is a fast and efficient open-source library for
NLP in Python.

It provides tools for tasks like tokenization,

part-of-speech tagging, dependency parsing, and NER.
METHODOLOGY
TOOLS & TECHNOLOGIES
● NLP Libraries: SpaCy
● Programming Language: Python
● Machine Learning: Scikit-Learn, TensorFlow, PyTorch
● Database: MongoDB
● Deployment: AWS or Docker or Google Cloud
● Development Tools: Jupyter Notebook, GitHub, VS Code
● Web Framework: Django or Flask or Streamlit
● Other Libraries & Modules: Pandas, Numpy, Matplotlib,
Seaborn, Pytesseract, Pillow, PyMuPDF, Python-Docx…
DATA COLLECTION

Resume dataset uploaded by Mr. Roman Shilpakar on Google

Drive that contains 1014 resumes

659 job descriptions collected from various online sources

and compiled into a single text ﬁle
DATA COLLECTION
DATA PREPROCESSING

The text ﬁles containing the job descriptions are manually

annotated using an online NER annotator called “arunmozhi”

After annotation, a dataset is obtained in json format with

the custom annotations of various entities.
DATA PREPROCESSING
DATA PREPROCESSING
MODEL GENERATION

Using the dataset, a custom NER model is trained using

spaCy’s command line interface
MODEL EVALUATION

It took over 3 hours to train the custom NER model using

the resume dataset containing 1014 resumes with a score
of 85%

It took over 2 hours to train the custom NER model

using the job post dataset containing 659 job posts
with a score of 70%
MODEL EVALUATION
MODEL EVALUATION
MODEL DEPLOYMENT

The custom NER model was saved in the local directory

for deployment in Python app

This stage involves implementing the NER models to parse

the resumes and job descriptions, which will later on be
used to compare the two and calculate a match score
WORK PROGRESS
TASKS COMPLETED

We collected a text ﬁle containing 659 job descriptions

We manually annotated the job descriptions into a

dataset for training a model

We trained a custom NER model using the publicly available

resume dataset and the self-annotated job post dataset for
parsing key entities from resumes and job posts
SAMPLE OUTPUT (RESUME)
SAMPLE OUTPUT (RESUME)
SAMPLE OUTPUT (JOB POST)
SAMPLE OUTPUT (JOB POST)
REMAINING TASKS
TASKS REMAINING

Extending and reﬁning the datasets and model

Developing a comparison algorithm

Developing a python web application

Final detailed documentation

REFERENCES

https://spacy.io/usage/training

https://drive.google.com/ﬁle/d/1dduSVVa0QKXvVx
4_OGQXr16FFsTFb7rJ/view
THANK YOU!!!

INTE 222 BBIT 314 COMP 226 COSF 311COMP 326 OBJECT ORIENTED PROGRAMMING WITH JAVA - kabarak university
No ratings yet
INTE 222 BBIT 314 COMP 226 COSF 311COMP 326 OBJECT ORIENTED PROGRAMMING WITH JAVA - kabarak university
5 pages
Cuckoo's Eggs and Trojan Horses
100% (1)
Cuckoo's Eggs and Trojan Horses
15 pages
2024ICPC Problemset Online
No ratings yet
2024ICPC Problemset Online
32 pages
W01 Overview
No ratings yet
W01 Overview
38 pages
Trace
No ratings yet
Trace
62 pages
CareerBuilder Presentation
No ratings yet
CareerBuilder Presentation
25 pages
Project Report 8th Sem
No ratings yet
Project Report 8th Sem
36 pages
SOP - ASN & Invoice Creation by Supplier
No ratings yet
SOP - ASN & Invoice Creation by Supplier
46 pages
EPC QoS VCC
No ratings yet
EPC QoS VCC
47 pages
Narendra GO, ResearchGate Paper 3029
No ratings yet
Narendra GO, ResearchGate Paper 3029
9 pages
Innovation Case Study PPT
No ratings yet
Innovation Case Study PPT
12 pages
Innovation Case Study PPT [Autosaved][1] [Read-Only]
No ratings yet
Innovation Case Study PPT [Autosaved][1] [Read-Only]
12 pages
Doc
No ratings yet
Doc
7 pages
SaraHackos-TSSPioneer
No ratings yet
SaraHackos-TSSPioneer
8 pages
report12
No ratings yet
report12
40 pages
Greater-Noida-Institute-of-Technology-Greater-Noida
No ratings yet
Greater-Noida-Institute-of-Technology-Greater-Noida
18 pages
Puma HSSE ver 4.0
No ratings yet
Puma HSSE ver 4.0
6 pages
LTRSPG 1192
No ratings yet
LTRSPG 1192
39 pages
Resume Recommendation Using RNN Classification and Cosine Similarity
No ratings yet
Resume Recommendation Using RNN Classification and Cosine Similarity
12 pages
NLP PROJECT
No ratings yet
NLP PROJECT
12 pages
Installation Guide UCIM-08C 8-Position Universal Customer Interface Module
No ratings yet
Installation Guide UCIM-08C 8-Position Universal Customer Interface Module
2 pages
? Project Title
No ratings yet
? Project Title
3 pages
Resum (1) (3) Pro
No ratings yet
Resum (1) (3) Pro
16 pages
Presentation
No ratings yet
Presentation
14 pages
Presentation
No ratings yet
Presentation
14 pages
ICCSAI 2025
No ratings yet
ICCSAI 2025
9 pages
Anthony, Oluwatobiloba Emmanuel 180404027: Department of Computer Science Adekunle Ajasin University, Akungba Akoko
No ratings yet
Anthony, Oluwatobiloba Emmanuel 180404027: Department of Computer Science Adekunle Ajasin University, Akungba Akoko
13 pages
technical seminar_4 (1)
No ratings yet
technical seminar_4 (1)
14 pages
SHS 2 ICT DEC 2019 - Semester Exams
No ratings yet
SHS 2 ICT DEC 2019 - Semester Exams
8 pages
MIRAJ_PWP_REPORT
No ratings yet
MIRAJ_PWP_REPORT
16 pages
Resume Screening
No ratings yet
Resume Screening
16 pages
Resume Parser and Job Recommendation System Using Machine Learning
No ratings yet
Resume Parser and Job Recommendation System Using Machine Learning
6 pages
Purple Futuristic Technology Presentation
No ratings yet
Purple Futuristic Technology Presentation
19 pages
MAJOR_REVIEW_1_199[1]
No ratings yet
MAJOR_REVIEW_1_199[1]
18 pages
Proposal - Digital Service Delivery Through PACS - Himank Sharma
No ratings yet
Proposal - Digital Service Delivery Through PACS - Himank Sharma
10 pages
Project Review 2
No ratings yet
Project Review 2
15 pages
LHM Machine Learning and Intelligent Data Analysis 2022-23
No ratings yet
LHM Machine Learning and Intelligent Data Analysis 2022-23
6 pages
Andrew Robinson Case
No ratings yet
Andrew Robinson Case
9 pages
先進計劃排程 (APS) 系統在生產排程的使用者體驗
No ratings yet
先進計劃排程 (APS) 系統在生產排程的使用者體驗
10 pages
Resume_Parsing_using_Natural_Language_Pr (4)
No ratings yet
Resume_Parsing_using_Natural_Language_Pr (4)
6 pages
PLN 65 06
No ratings yet
PLN 65 06
6 pages
nlp
No ratings yet
nlp
6 pages
Resume Shortlisting System(14!2!2025)[1][1][1][1] - Copy
No ratings yet
Resume Shortlisting System(14!2!2025)[1][1][1][1] - Copy
15 pages
Resume mini
No ratings yet
Resume mini
10 pages
Project Review 2 Final
No ratings yet
Project Review 2 Final
17 pages
Paper 3029
No ratings yet
Paper 3029
8 pages
research 2
No ratings yet
research 2
5 pages
A Study On Deep Learning For Fake News Detection
No ratings yet
A Study On Deep Learning For Fake News Detection
48 pages
Scholarly_paper
No ratings yet
Scholarly_paper
8 pages
Master of Business Administration - MBA Semester 2 (Book ID: B1136) Assignment Set-1 60 Marks
No ratings yet
Master of Business Administration - MBA Semester 2 (Book ID: B1136) Assignment Set-1 60 Marks
23 pages
Project_phase_II_final[1] (1) 1 (5) (1)
No ratings yet
Project_phase_II_final[1] (1) 1 (5) (1)
17 pages
Resume_Analyzer_and_Skill_Enhancement_Recommender_System
No ratings yet
Resume_Analyzer_and_Skill_Enhancement_Recommender_System
6 pages
E-Recruiting and shortlisting using Candidate Resume with NLP and Machine Learning (1)
No ratings yet
E-Recruiting and shortlisting using Candidate Resume with NLP and Machine Learning (1)
5 pages
Extract Keywords JD
No ratings yet
Extract Keywords JD
3 pages
Research Paper Resume Builder B13 Final
No ratings yet
Research Paper Resume Builder B13 Final
7 pages
Capstone Project AI
No ratings yet
Capstone Project AI
10 pages
ieee paper
No ratings yet
ieee paper
7 pages
10881885
No ratings yet
10881885
6 pages
IEEE-conference Sample 1
No ratings yet
IEEE-conference Sample 1
3 pages
Software Requirements Specification for Restaurant Automation System
No ratings yet
Software Requirements Specification for Restaurant Automation System
6 pages
Ai Resume Analyzer
No ratings yet
Ai Resume Analyzer
13 pages
IEEE_Conference_Template__1_ (1)
No ratings yet
IEEE_Conference_Template__1_ (1)
5 pages
Data Integrity
No ratings yet
Data Integrity
71 pages
How Nesta Uses NLP to Process 7m Job Ads and Shed Light on the UK’s Labor Market · Explosion
No ratings yet
How Nesta Uses NLP to Process 7m Job Ads and Shed Light on the UK’s Labor Market · Explosion
3 pages
Fin Irjmets1683342426
No ratings yet
Fin Irjmets1683342426
7 pages
Synopsis
No ratings yet
Synopsis
8 pages
Research paper
No ratings yet
Research paper
4 pages
Dictionary - How To Append Several Dictionaries While Looping Through Pagination (API) - Python 3 - Stack Overflow
No ratings yet
Dictionary - How To Append Several Dictionaries While Looping Through Pagination (API) - Python 3 - Stack Overflow
1 page
Resume Screening Using Machine Learning
No ratings yet
Resume Screening Using Machine Learning
7 pages
Resume_Classification_Using_ML_Techniques
No ratings yet
Resume_Classification_Using_ML_Techniques
5 pages
shwetamajorsynopsis
No ratings yet
shwetamajorsynopsis
5 pages
Vinod Kumar K Mobile: 9789853879
No ratings yet
Vinod Kumar K Mobile: 9789853879
4 pages
Automated_Resume_Classification_System_Using_Ensemble_Learning
No ratings yet
Automated_Resume_Classification_System_Using_Ensemble_Learning
4 pages
ABB Drives Courses Public 2023 9AKK107046A8986 RevD EN 23112022
No ratings yet
ABB Drives Courses Public 2023 9AKK107046A8986 RevD EN 23112022
4 pages
roadmap-sh-flutter...
No ratings yet
roadmap-sh-flutter...
4 pages
International Journal of Research Publication and Reviews: A Smart Resume Analyser For Career Optimization Using NLP
No ratings yet
International Journal of Research Publication and Reviews: A Smart Resume Analyser For Career Optimization Using NLP
6 pages
Poster (Resume_Parser) (420 × 297 mm)
No ratings yet
Poster (Resume_Parser) (420 × 297 mm)
1 page
KNKN
No ratings yet
KNKN
6 pages
Bootstrap and React
No ratings yet
Bootstrap and React
20 pages
Reserach Paper No 2 IRJET Resume Ranking Based On Job Descri
No ratings yet
Reserach Paper No 2 IRJET Resume Ranking Based On Job Descri
4 pages
4 - Ensure Freedom From Interference FFI For ASIL Applications - TASKING
No ratings yet
4 - Ensure Freedom From Interference FFI For ASIL Applications - TASKING
19 pages
Life Is The Most Difficult Exam. Many People Fail Because They Try To Copy Others, Not Realizing That Everyone Has A Different Question Paper.
No ratings yet
Life Is The Most Difficult Exam. Many People Fail Because They Try To Copy Others, Not Realizing That Everyone Has A Different Question Paper.
3 pages
Developing Robots For Daily Life: What Is A Robot?
No ratings yet
Developing Robots For Daily Life: What Is A Robot?
4 pages
Customer Relationship Management MCQ
67% (3)
Customer Relationship Management MCQ
5 pages
Oracle ASM
100% (1)
Oracle ASM
13 pages
5440 - 1 - NASSCOM Perspective 2025 - Shaping The Digital Revolution
No ratings yet
5440 - 1 - NASSCOM Perspective 2025 - Shaping The Digital Revolution
13 pages
Study Guide 300-835 CLAUTO Automating and Programming Cisco Collaboration Solutions Exam
From Everand
Study Guide 300-835 CLAUTO Automating and Programming Cisco Collaboration Solutions Exam
Anand Vemula
No ratings yet
SpaCy for Natural Language Processing: Definitive Reference for Developers and Engineers
From Everand
SpaCy for Natural Language Processing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet

SkillSync Midterm Presentation

Uploaded by

SkillSync Midterm Presentation

Uploaded by

MIDTERM PROGRESS PRESENTATION ON

SkillSync: NLP BASED

Resume Ranking Systems allows users to compare

Based on the match score, the resumes are ranked, hence

Tedious to manage and analyze high volume of resumes

Traditional means introduce human bias and subjectivity

Manual analysis and ranking is time-consuming and inefficient

To build a custom NER model for parsing relevant

To build a system that compares resumes and job posts,

Natural Language Processing (NLP) is a ﬁeld of

Named Entity Recognition (NER) is a subtask of NLP

It provides tools for tasks like tokenization,

Resume dataset uploaded by Mr. Roman Shilpakar on Google

659 job descriptions collected from various online sources

The text ﬁles containing the job descriptions are manually

After annotation, a dataset is obtained in json format with

Using the dataset, a custom NER model is trained using

It took over 3 hours to train the custom NER model using

It took over 2 hours to train the custom NER model

The custom NER model was saved in the local directory

This stage involves implementing the NER models to parse

We collected a text ﬁle containing 659 job descriptions

We manually annotated the job descriptions into a

We trained a custom NER model using the publicly available

Extending and reﬁning the datasets and model

Developing a comparison algorithm

Developing a python web application

Final detailed documentation

You might also like