Aniket PDF
Aniket PDF
Aniket PDF
E-mail: aniket.diat@gmail.com
Experience
Data Scientist CredR, Mumbai
Education
M.Tech, Modelling and Simulation,
Defence Institute of Advanced Technology, Pune, India
Percentage 69.71
May 2013
Percentage 56.5
June 2009
Projects
At CredR
Recommendation System : Designed recommendation system for suggesting
bikes for CredR. This involves analysing features of bikes in which purchaser
can be interested and recommending bikes on that basis. This project involves
design, development and coding in python and postgresql. This system is Item
based recommendation system for bike recommendation.
Analysing Business data : This task involves statistical analysis bike sales data
over period of time. The data is processed using machine learning techniques
like clustering, PCA. Then this data is visualised and meaningful insights are
found which helps in taking the business decisions.
Accelerometer data analysis and bike state prediction: It involves capturing
accelerometer data gathered by mobile app. Data consist of accelerometer and
gyroscope reading in x,y,z direction Then this data is processed to estimate the
state of bike like normal ride, bumpy ride,continous brakes etc. By applying ML
technique the approximate time at which brake, bump occurred are found out .
The results from it are used to give recommendation to rider.
At Tiger Analytics
Sales Analytics: Inspecting opportunity in loss reasons using NLP, Finding out
most frequent loss reasons and analyse it on basis of quantities and revenue.
Weighting lose reasons. Analysing the association between product sales.
Reporting valuable Insights got through Data Analysis.
Pega System Training: Completed training Pega Decision enablement
process which include Next-Best-Action decision strategies real-time
interactions and simulation activities.
Exploratory Analysis: It is internal project with aim to reduce data
exploration and cleaning time. Designed Data Exploration tools in R
which helps to graphically inspect features/attributes in dataset. It helps
to summarize data at feature level enable studying its statistical
properties like mean, mode, median, most frequent observations. This
package also uses different techniques for outlier detection and missing
value imputation.
Business Intelligence: This project involve analysing sales and purchase data
across different merchants, clients using Teradata SQL. The main tasks in this
project include Categorising merchants on basis of different products sales,
volume of sale, geography of sale, quantity of product sale. Searching for the
measure changes in pattern of sale. Inspecting cross border sales/ purchase,
Product categories. Understanding products and associated buyers. Inspecting
behaviour of buyer across all segments.
Cost Optimisation: The aim of this POC was to Identify cost savings
opportunities in ordering parts/components through data analytics. Dataset
consist of orders of different manufacturing parts required for the networking
product. These parts were ordered from different manufactures across different
countries from several manufacturers. The main tasks done in these project
involve building a forecasting model to predict future cost of parts based on
historical data. It also involves Correlation analysis, Cost saving through
distributing order to different manufacturer.
At CDAC, Pune
Knowledge based elucidation of tertiary structure of protein on their
function algorithm development and web hosting of prediction server.
This project involve designing, developing and testing data mining and machine
learning algorithms for various classification and regression datasets. The
AWARDS/ Scholarship
Top 10% in Kaggle Africa Soil Property Prediction Challenge
DST ( Department of Science and Technology) sponsored Fellowship. CDAC
Pune, India. 2013 2014
DRDO ( Defence Research & Development Organization, Ministry of Defence)
Sponsored Scholarship. 2011-2013
Skills- R, Core Java , C, C++, TeradataSQL
Statistical Environments: Octave/ Matlab
Popular Libraries/ tools: Weka, LibSVM , Random forest
Linear Regression, Logistic regression, Machine
Learning, Probabilistic modelling
Typesetting: LATEX, MS word, Ms Excel
Aniket Gurav