0% found this document useful (0 votes)

32 views2 pages

Data Science Assessment Task

The document outlines an assessment task for data science involving the development of a model to predict semantic similarity between pairs of text paragraphs, with a scoring range from 0 to 1. Candidates are required to build and deploy this model as a Server API Endpoint, providing specific request and response formats. The final submission must include the live API endpoint, complete code, a short report, and an updated resume, with a completion deadline of three days from receipt of the task.

Uploaded by

Anmol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views2 pages

Data Science Assessment Task

Uploaded by

Anmol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

ASSESSMENT FOR DATA SCIENCE

PROBLEM STATEMENT

Dataset (attached with the task): The data contains a pair of paragraphs. These text paragraphs are
randomly sampled from a raw dataset. Each pair of sentences may or may not be semantically similar.
The candidate is to predict a value between 0-1 indicating the similarity between the pair of text paras.
A sample of a similar dataset will be used as test data, therefore it’s crucial to the model solution using
provided dataset.

Part A

Build an algorithm/model that can quantify the degree of similarity between the two text-based on
Semantic similarity. Semantic Textual Similarity (STS) assesses the degree to which two sentences are
semantically equivalent to each other.
1 means highly similar

0 means highly dissimilar

Part B

Deploy the Algorithm/Model built-in Part A in any cloud service provider. Your final algorithm should be
exposed as a Server API Endpoint. In order to test this API make sure you hit a request to the server to
get the result as a response to the API. The request-response body should be in the following format:

Request body: {“text1”: ”nuclear body seeks new tech .......”, ”text2”: ”terror suspects face arrest ......”}
Response body: {“similarity score”: 0.2 }
Note: “text1”, “text2”, and “similarity score” keys should be kept as it is, without any change.

THE FINAL SUBMISSION MUST INCLUDE THE FOLLOWING -

• - Live API endpoint(IP Address of hosted app) of the Algorithm Deployed on the Server
• - Complete Code for Part A and Part B (.py files)
• - 1-2 page short Report explaining only the core approach taken in Part A and Part B.
• - Your updated resume with contact number

INSTRUCTIONS

• - Use only Python programming language

• - The correctness of similarity scores on test data will be evaluated from the results obtained

from the Server Response.

• - Task evaluation is equally based on both Part A and Part B. Finally delivery of task A is

through task B itself. Therefore it’s mandatory to attempt both parts.

• - Please ensure the structure of the API endpoint is as per requirement.

• - Code must be well commented
dataneuron.ai | mail@dataneuron.ai |
• - Use any approach to solve algorithms using Statistical models Machine Learning or Deep

Learning

• - Use any cloud service providers to deploy solutions eg. Azure, GCP, AWS, Heroku, etc.
• - Candidates will be judged on three criteria namely the Model/Algo approach, Successfully

deployed API, and API response results on test data.

• - Time duration: 3 days from the day of receiving the task.

NOTE:

1. The given dataset does not contain any label. Therefore, can be treated as an unsupervised
learning problem. However, this does not imply that supervised techniques/algorithms are not
applicable. The candidate is free to use any technique.
2. Please attach your updated resume and contact information with the submission mail.
3. Your time should start from when this task was sent to you.
4. If you intend to take more than 3 days, you may do so without permission. However, it would be
appreciated if you state the reasons for the delay in your report.
5. Every step in the task is self-explanatory to the best of our knowledge. If any part is unclear, use
your best judgment and mention it in your report.
6. Your project will not be used for the benefit of the company in any manner. The intention of this
task is ONLY to evaluate your skills.
7. Your submission will showcase your skills and knowledge of the said field and help us evaluate
your candidature in a better manner, so kindly try to keep the work as original as possible.
8. The deployed server can be closed after the final results are announced. We recommend that
candidates should use freely available resources only to deploy their APIs.
9. Final submission must be sent at aviral.saw@dataneuron.ai and cc mail@dataneuron.ai
Submissions via any other platform will not be considered.
10. We wish you all the best!

VenkataRamana - Data Scientist - 5Y
No ratings yet
VenkataRamana - Data Scientist - 5Y
3 pages
Tue+Sep+20+23 56 35+GMT+05 00+2022
No ratings yet
Tue+Sep+20+23 56 35+GMT+05 00+2022
1 page
Question Answering Systems For Customer Relationship Management
No ratings yet
Question Answering Systems For Customer Relationship Management
6 pages
67a050ecd4b14 Unstop AIML Intership Assessment
No ratings yet
67a050ecd4b14 Unstop AIML Intership Assessment
1 page
Get A Job
No ratings yet
Get A Job
1 page
AI Project Challenges for Developers
No ratings yet
AI Project Challenges for Developers
6 pages
AIML Developer - Assignment (Level 1) - 250607 - 120042
No ratings yet
AIML Developer - Assignment (Level 1) - 250607 - 120042
4 pages
AI Recruit
No ratings yet
AI Recruit
7 pages
JD For - Data Science Intern - 1
No ratings yet
JD For - Data Science Intern - 1
2 pages
Online Assignment Plagiarism Check
No ratings yet
Online Assignment Plagiarism Check
5 pages
Artificial Intelligence Internship - JD
No ratings yet
Artificial Intelligence Internship - JD
1 page
AI-Powered Coding Interview Prep
No ratings yet
AI-Powered Coding Interview Prep
3 pages
GrowthLink - DS
No ratings yet
GrowthLink - DS
8 pages
Software Problem Authoring For AI Research
No ratings yet
Software Problem Authoring For AI Research
2 pages
Machine Learning Project Guide
No ratings yet
Machine Learning Project Guide
11 pages
Assignment Data Science
No ratings yet
Assignment Data Science
6 pages
Python Engineer Problem Statements
No ratings yet
Python Engineer Problem Statements
5 pages
AI Engineer
No ratings yet
AI Engineer
3 pages
JD Data Scientist 25
No ratings yet
JD Data Scientist 25
2 pages
Lang Chain
No ratings yet
Lang Chain
11 pages
JD - Sr. Data Scientist
No ratings yet
JD - Sr. Data Scientist
2 pages
Final Exam
No ratings yet
Final Exam
3 pages
RAI AI Engineer Intern Assignments
No ratings yet
RAI AI Engineer Intern Assignments
3 pages
NLP Text Summarization App
No ratings yet
NLP Text Summarization App
3 pages
ChatGPT Data Science Prompts
80% (15)
ChatGPT Data Science Prompts
67 pages
Myresume
No ratings yet
Myresume
2 pages
SSP-Data Science-TaskList
No ratings yet
SSP-Data Science-TaskList
2 pages
Problem Statements For Intel Unnati Industrial Training 2025
No ratings yet
Problem Statements For Intel Unnati Industrial Training 2025
13 pages
Neeraj - Singh CV
No ratings yet
Neeraj - Singh CV
4 pages
Vijayi WFH Tech - Assignment - AI Internship - Jan 2025
No ratings yet
Vijayi WFH Tech - Assignment - AI Internship - Jan 2025
3 pages
Dnyaneshwar Data Scientist CV
No ratings yet
Dnyaneshwar Data Scientist CV
1 page
Assignment For AI Writer Role-1
No ratings yet
Assignment For AI Writer Role-1
3 pages
Task 1 ML
No ratings yet
Task 1 ML
7 pages
Aspireit - Artificial Intelligence Engineer
No ratings yet
Aspireit - Artificial Intelligence Engineer
4 pages
Job Description - Gen AI Developer
No ratings yet
Job Description - Gen AI Developer
2 pages
New Text Document
No ratings yet
New Text Document
3 pages
Naukri YogendraVerma (6y 6m)
No ratings yet
Naukri YogendraVerma (6y 6m)
3 pages
Project - Restaurant Rating Prediction: Problem Statement
No ratings yet
Project - Restaurant Rating Prediction: Problem Statement
3 pages
MLOps Task
No ratings yet
MLOps Task
2 pages
AI Assignment - M25
No ratings yet
AI Assignment - M25
3 pages
JD Review - Python Developer - DM 4065
No ratings yet
JD Review - Python Developer - DM 4065
1 page
Genpact - Research Data Scientist
No ratings yet
Genpact - Research Data Scientist
3 pages
Data Mining Competition Guide
No ratings yet
Data Mining Competition Guide
5 pages
Computer Science
No ratings yet
Computer Science
2 pages
Sample Template - Advance Data Science Students
No ratings yet
Sample Template - Advance Data Science Students
3 pages
Report 12
No ratings yet
Report 12
40 pages
Lemur Astrologer Coding
No ratings yet
Lemur Astrologer Coding
28 pages
Updated JD - Data Science Intern at Ai Palette
No ratings yet
Updated JD - Data Science Intern at Ai Palette
2 pages
Hack - To - Hire - Case Study - Data Science
No ratings yet
Hack - To - Hire - Case Study - Data Science
2 pages
Updated JD For Python Fresher
No ratings yet
Updated JD For Python Fresher
2 pages
Sushanth Resume
No ratings yet
Sushanth Resume
1 page
Automated ML
No ratings yet
Automated ML
4 pages
Investment Predictions
No ratings yet
Investment Predictions
5 pages
Resume Karun Sharma
No ratings yet
Resume Karun Sharma
1 page
MLE JobDescription
No ratings yet
MLE JobDescription
2 pages
Nouman CV
No ratings yet
Nouman CV
6 pages
AI Role Assignment 4
No ratings yet
AI Role Assignment 4
2 pages
Senior Data Scientist Role Overview
No ratings yet
Senior Data Scientist Role Overview
3 pages
Class 12 Pandas Practical Guide
No ratings yet
Class 12 Pandas Practical Guide
15 pages
Chemistry (Answer Key)
No ratings yet
Chemistry (Answer Key)
104 pages
KEOFITT BASIX CIP Sampling Valve
No ratings yet
KEOFITT BASIX CIP Sampling Valve
1 page
Segmental Tunnel Lining Design
67% (3)
Segmental Tunnel Lining Design
90 pages
A Common-Sense Pragmatic Theory of Truth
No ratings yet
A Common-Sense Pragmatic Theory of Truth
19 pages
Bridging Particle Size Distribution in Drilling Fluid and Formation Damage
No ratings yet
Bridging Particle Size Distribution in Drilling Fluid and Formation Damage
11 pages
MATH2404: Introduction To Probability Theory: Edition Prescribed
No ratings yet
MATH2404: Introduction To Probability Theory: Edition Prescribed
2 pages
(Answer Key) Math Final Exam Review
No ratings yet
(Answer Key) Math Final Exam Review
15 pages
5b. Ion Exchange Controlled Dds
No ratings yet
5b. Ion Exchange Controlled Dds
37 pages
Lesson Plan Academic Year: 2018-19 2018/Univ/Eee/Lp
No ratings yet
Lesson Plan Academic Year: 2018-19 2018/Univ/Eee/Lp
22 pages
Bus Ticket Reservation
100% (1)
Bus Ticket Reservation
20 pages
12-Threads - Multicore Programming - Multithreading Models - Thread Libraries - Implicit Threading-07!08!2024
100% (1)
12-Threads - Multicore Programming - Multithreading Models - Thread Libraries - Implicit Threading-07!08!2024
14 pages
Seismic Innovations for Geophysicists
No ratings yet
Seismic Innovations for Geophysicists
14 pages
Gujarat Boiler Examination Board Boiler Operation Engineer Exam-2017
No ratings yet
Gujarat Boiler Examination Board Boiler Operation Engineer Exam-2017
10 pages
U1A
No ratings yet
U1A
7 pages
Crystal Field Theory Explained
No ratings yet
Crystal Field Theory Explained
20 pages
YLAA Installation
No ratings yet
YLAA Installation
62 pages
W268.01 DLV FDR WWTP - & - Disposal Palembang
100% (1)
W268.01 DLV FDR WWTP - & - Disposal Palembang
88 pages
Impact of Runway Capacity On Flight Efficiency and Delay.
No ratings yet
Impact of Runway Capacity On Flight Efficiency and Delay.
73 pages
Fallacious Appeal to Authority Guide
No ratings yet
Fallacious Appeal to Authority Guide
4 pages
LC1D09M7: Product Data Sheet
No ratings yet
LC1D09M7: Product Data Sheet
4 pages
Ma1200 HW2
No ratings yet
Ma1200 HW2
5 pages
SPE 62922 History Matching Geostatistical Reservoir Models With Gradual Deformation Method
No ratings yet
SPE 62922 History Matching Geostatistical Reservoir Models With Gradual Deformation Method
13 pages
User's Guide: NWA/WAC Series
No ratings yet
User's Guide: NWA/WAC Series
242 pages
Grid Trading of Coin
No ratings yet
Grid Trading of Coin
14 pages
OpenGL Shadow Simulation Project
63% (8)
OpenGL Shadow Simulation Project
43 pages
Introduction To Scanning Tunneling Microscopy 3rd Edition C Julian Chen Available Any Format
No ratings yet
Introduction To Scanning Tunneling Microscopy 3rd Edition C Julian Chen Available Any Format
141 pages
Ai in Two Dimensional Random Variable
No ratings yet
Ai in Two Dimensional Random Variable
12 pages
Computer 7 1Q Learning Module
No ratings yet
Computer 7 1Q Learning Module
18 pages
XPD FRC
No ratings yet
XPD FRC
11 pages

Data Science Assessment Task

Uploaded by

Data Science Assessment Task

Uploaded by

ASSESSMENT FOR DATA SCIENCE

0 means highly dissimilar

THE FINAL SUBMISSION MUST INCLUDE THE FOLLOWING -

• - Use only Python programming language

from the Server Response.

through task B itself. Therefore it’s mandatory to attempt both parts.

• - Please ensure the structure of the API endpoint is as per requirement.

deployed API, and API response results on test data.

• - Time duration: 3 days from the day of receiving the task.

You might also like