Data ethics and
privacy
I N T R O D U C T I O N T O D ATA
Maarten Van den Broeck
Senior Content Developer at DataCamp
Basics of data ethics
Data can be impactful, but
It comes with responsibilities!
INTRODUCTION TO DATA
Basics of data ethics
Data can be impactful, but
It comes with responsibilities!
Organizations should ensure:
Data collection
INTRODUCTION TO DATA
Basics of data ethics
Data can be impactful, but
It comes with responsibilities!
Organizations should ensure:
Data collection
Data storage
INTRODUCTION TO DATA
Basics of data ethics
Data can be impactful, but
It comes with responsibilities!
Organizations should ensure:
Data collection
Data storage
Data usage
Protect customers' safety
Prevent legal issues
INTRODUCTION TO DATA
Principles of data ethics
Data ethics is constantly evolving
INTRODUCTION TO DATA
Principles of data ethics
Data ethics is constantly evolving
1
Five principles
1. Permission for data collection
2. Transparency about the plan
3. Privacy of data
4. Good intentions
5. Consider the outcome
1 Harvard Business School, 2021
INTRODUCTION TO DATA
Permission and transparency
Permission
Ask before collecting
Users are in control of their data
Transparency
Be transparent of how you plan to use,
store, and collect data
Lack of transparency may lead to
reputation and legal damage
INTRODUCTION TO DATA
Data privacy
Privacy refers to secluding (information
about) yourself
Personal identifiable information (PII) must
remain confidential
Full name
Date of birth
Bank account
ID card/Passport number
Phone number
...
INTRODUCTION TO DATA
Privacy protection
Strong passwords
Up-to-date operating systems
Cautionary Internet browsing
Individual responsibility
Preventing data breaches:
Limit sharing sensitive data
(Pseudo-)data anonymization
INTRODUCTION TO DATA
Intentions and outcome
Intentions
Data is collected for the right reasons
Question yourself about the reasons you
collect data
Outcome
Are there consequences of my actions?
Protecting vulnerable populations
INTRODUCTION TO DATA
Let's practice!
I N T R O D U C T I O N T O D ATA
Data life cycle
I N T R O D U C T I O N T O D ATA
Maarten Van den Broeck
Senior Content Developer at DataCamp
What is the data life cycle?
Planning and collecting
Storing and managing
Cleaning and processing
Analyzing and visualizing
Sharing
Archiving/destroying
INTRODUCTION TO DATA
Why is the data life cycle important?
Ensure data is regulated responsibly
Identify potential areas for improvement
Improve efficiency and effectiveness of
operations
INTRODUCTION TO DATA
Plan and collect
Planning for other stages of the data life
cycle
Sharing of roles and responsibilities
To achieve optimal results in terms of time
and cost
Collect or create data?
INTRODUCTION TO DATA
Store and manage
Data stored in databases or data
warehouses
Easily accessible to the right person
Removal of PII
INTRODUCTION TO DATA
Clean and process
Formatting data
Missing data
Transforming data
INTRODUCTION TO DATA
Analyze and visualize
Analyzing raw data for new insights
Data is easier to interpret when visualized
Analyzing and visualizing data is done in
various ways
INTRODUCTION TO DATA
Share
Communicate your results
Dashboards, reports, papers
INTRODUCTION TO DATA
Archive or destroy
Decide if data should be kept or deleted
Data archiving:
Data backups
Documentation
Digitizing
Data destruction:
In rare cases
Protecting private information
Resources can be freed up
INTRODUCTION TO DATA
Let's practice!
I N T R O D U C T I O N T O D ATA
Common data
mistakes
I N T R O D U C T I O N T O D ATA
Maarten Van den Broeck
Senior Content Developer at DataCamp
Common mistakes about data
INTRODUCTION TO DATA
Common mistakes about data
Not having a clear goal or question
INTRODUCTION TO DATA
Common mistakes about data
Not having a clear goal or question
Insufficient or wrong data
INTRODUCTION TO DATA
Common mistakes about data
Not having a clear goal or question
Insufficient or wrong data
Lack of appropriate analysis
INTRODUCTION TO DATA
Common mistakes about data
Not having a clear goal or question
Insufficient or wrong data
Lack of appropriate analysis
No clear communication of results
Carefully plan the data analysis process
INTRODUCTION TO DATA
Not clearly defining the problem
"Did you buy anything in the last month?"
"Where did you make your last purchase?"
"Which payment method did you use?"
May lead to inappropriate data collection,
analysis, and conclusions
INTRODUCTION TO DATA
Insufficient or wrong data
Data bias: the data sample doesn't represent
all the data
Collecting the wrong data doesn't allow
you to answer the research question
Data still needs cleaning before analysis
INTRODUCTION TO DATA
Lack of appropriate analysis
Jumping to conclusions too quickly
Lack of context: a missing reason
explaining the results
Other examples include
Incorrect aggregations and calculations
Confusing correlation with causation
INTRODUCTION TO DATA
No clear communication of results
Most valuable part of data life cycle
Could lead to misunderstandings or
incorrect conclusions
Examples:
Too technical
Cherry-picking data points
Unclear visualizations
INTRODUCTION TO DATA
Let's practice!
I N T R O D U C T I O N T O D ATA
Wrap-up
I N T R O D U C T I O N T O D ATA
Maarten Van den Broeck
Senior Content Developer at DataCamp
Chapter 1
(Un)structured data
Qualitative/quantitative data
Data context
Data is valuable in various fields
Data-driven decision making
Changes in data storage
INTRODUCTION TO DATA
Chapter 2
DIKW framework
Decision making process
Data aggregations
INTRODUCTION TO DATA
Chapter 3
Data ethics
Data life cycle
Common data mistakes
Understanding GDPR
Understanding Data Visualization
Introduction to Data Literacy
Communication Data Insights
INTRODUCTION TO DATA
Congratulations!
I N T R O D U C T I O N T O D ATA