What is Data
Science?
DATAS CIE N CE F O R B U S IN E S S LE
A DE R S
Mari Nazary
VP of Content, DataCamp
Let's ask
Google!
DATA SCIENCE FOR BUSINESS
Making data work for
you
DATA SCIENCE FOR BUSINESS
What can data
do?
Describe the current state of an organization or process
Detect anomalous events
Diagnose the causes of events and behaviors
Predict future events
DATA SCIENCE FOR BUSINESS
Why
now?
DATA SCIENCE FOR BUSINESS
Why
now?
DATA SCIENCE FOR BUSINESS
Why
now?
DATA SCIENCE FOR BUSINESS
The data science
workflow
Data collection Exploration and visualization
Experimentation and
prediction
DATA SCIENCE FOR BUSINESS
Let's
practice!
DATAS CIE N CE F O R B U S IN E S S
A DE R S
LE
Applications of
Data
DATAS CIE N CE F O R B U S IN E S S LE
A DE R S
Science
Mari Nazary
VP of Content, DataCamp
More case
studies
Traditional machine learning
Internet of Things (IoT) Deep
Learning
DATA SCIENCE FOR BUSINESS
Case study: fraud
detection
DATA SCIENCE FOR BUSINESS
Case study: fraud
detection Amount Date Type ...
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
DATA SCIENCE FOR BUSINESS
What do we need for machine
learning?
A well-dehned question
"What is the probability that this transaction is
fraudulent?"
A set of example data
Old transactions labeled as "fraudulent" or "valid"
A new set of data to use our algorithm on
New credit card transactions
DATA SCIENCE FOR BUSINESS
Case study: smart
watch
DATA SCIENCE FOR BUSINESS
Internet of
Things
Smart watches
Internet-connected home security systems
Electronic toll collection systems
Building energy management systems
Much, much more!
DATA SCIENCE FOR BUSINESS
Case study: image
recognition
DATA SCIENCE FOR BUSINESS
Case study: image
recognition
DATA SCIENCE FOR BUSINESS
Deep
learning
Many neurons work together
Requires much more training data
Used in complex problems
Image classihcation
Language learning/understanding
DATA SCIENCE FOR BUSINESS
Let's
practice!
DATAS CIE N CE F O R B U S IN E S S
A DE R S
LE
Building a data
science team
DATAS CIE N CE F O R B U S IN E S S LE
A DE R S
Mari Nazary
VP of Content, DataCamp
Members of your
team
DATA SCIENCE FOR BUSINESS
Data
engineering Information architects
Build storage solutions
Maintain data access
DATA SCIENCE FOR BUSINESS
Data engineering
tools SQL
Storing large quantities of data
Java, Scala, or Python
Programming languages for processing data
and automating tasks
DATA SCIENCE FOR BUSINESS
Data
analysis Creating dashboards
Hypothesis testing
Data visualization
DATA SCIENCE FOR BUSINESS
Data analysis
tools Spreadsheets (Excel or Google Sheets)
Simple storage and analysis
SQL
Large-scale analysis
BI Tools (Tableau, Power BI, Looker)
Dashboarding and sharing information
DATA SCIENCE FOR BUSINESS
Machine
learning Predictions and extrapolations
Classihcation
Stock price prediction
Image processing
Automated text analysis
DATA SCIENCE FOR BUSINESS
Machine learning
tools Python and R
Programming languages for creating
predictive models
DATA SCIENCE FOR BUSINESS
Review: members of your
team
Data Engineer Data Analyst Machine Learning Scientist
Store and maintain data Visualize and describe data Model and predict with data
SQL + Java/Scala/Python SQL + BI Tools + Spreadsheets Python/R
DATA SCIENCE FOR BUSINESS
Data science team
structure
Isolated
Embedded
Hybrid
DATA SCIENCE FOR BUSINESS
Team structure:
isolated
DATA SCIENCE FOR BUSINESS
Team structure:
embedded
DATA SCIENCE FOR BUSINESS
Team structure:
hybrid
DATA SCIENCE FOR BUSINESS
Let's
practice!
DATAS CIE N CE F O R B U S IN E S S
A DE R S
LE