[go: up one dir, main page]

0% found this document useful (0 votes)
54 views103 pages

365 Data Science Axs

Uploaded by

youness bait
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views103 pages

365 Data Science Axs

Uploaded by

youness bait
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 103

cours 1 :

Introduction to Data and Data Science :

1. The Different Data Science Fields

*Course Introduction
*Why are there so many business and data science buzzwords?
*Analysis vs Analytics
*Intro to Business Analytics, Data Analytics, and Data Science
*Adding Business Intelligence (BI), Machine Learning (ML), and Artificial
Intelligence (AI) to the picture
*Traditional AI vs. Generative AI
*More Examples of Generative AI
*An Overview of our Data Science Infographic

2. The Relationship between Different Data Science Fields

*When are Traditional data, Big Data, BI, Traditional Data Science and ML
applied?

3. What is the Purpose of each Data Science field

*Why do we Need each of these Disciplines?

4. Common Data Science Techniques

*Traditional Data: Techniques


*Traditional Data: Real-life Examples
*Big Data: Techniques
*Big Data: Real-life Examples
*Business Intelligence (BI): Techniques
*Business Intelligence (BI): Real-life Examples
*Traditional Methods: Techniques
*Traditional Methods: Real-life Examples
*Machine Learning (ML): Techniques
*Machine Learning (ML): Types of Machine Learning
*Machine Learning (ML): Evolution and Latest Trends
*Machine Learning (ML): Real-life Examples

5. Common Data Science Tools

*Programming Languages & Software Employed in Data Science - All the


Tools You Need

6. Data Science Job Positions: What do they Involve and What to Look out for?

*Data Science Job Positions: What do they Involve and What to Look out
for?

7. Dispelling common Misconceptions

*Dispelling Common Misconceptions

cours 2:

Statistics:

1. Introduction
*What does the course cover
*Population vs sample

2. Descriptive Statistics Fundamentals

*Types of data and levels of measurement


*Levels of measurement
*Categorical Variables. Visualization techniques
*Numerical Variables. Frequency distribution table
*The histogram
*Cross table and scatter plot
*Mean, median, mode
*Skewness
*Variance
*Standard deviation and coefficient of variation
*Covariance
*Correlation
*Practical Example - Descriptive Statistics

3. Inferential Statistics Fundamentals

*Introduction
*What is a distribution
*The Normal Distribution
*The Standard Normal Distribution
*Central limit theorem
*Standard error
*Estimators and estimates

4. Confidence Intervals

*Definition of confidence intervals


*Population variance known, z-score
*Confidence Interval Clarifications
*Student's T Distribution
*Population variance unknown, t-score
*Margin of error
*Confidence intervals. Two means. Dependent samples
*Confidence intervals. Two means. Independent samples (Part1)
*Confidence intervals. Two means. Independent samples (Part2)
*Confidence intervals. Two means. Independent Samples (Part 3)
*Practical Example - Confidence Intervals

5. Hypothesis testing

*Null vs Alternative
*Further Reading on Null and Alternative Hypothesis
*Rejection region and significance level
*Type I error vs type II error
*Test for the mean. Population variance known
*p-value
*Test for the mean. Population variance unknown
*Test for the mean. Dependent samples
*Test for the mean. Independent Samples (Part 1)
*Test for the mean. Independent Samples (Part 2)
*Practical Example - Hypothesis Testing
cours 3:

Probability:

1. The Basics of Probability

*Course Introduction
*What is the Probability Formula
*Expected Values
*Probability Frequency Distribution
*Complements

2. Combinatorics

*Fundamentals of Combinatorics
*Computing Permutations
*Solving Factorials
*Variations with Repetition
*Variations without Repetition
*Combinations without Repetition
*Combinations with Repetition
*Symmetry of Combinations
*Combinations with Separate Sample Spaces
*Winning The Lottery
*Summary of Combinatorics
*Practical Example - Combinatrics

3. Bayesian Inference

*Sets and Events


*The Different Ways Events Can Interact
*The Intersection of Two Sets
*The Union of Two Sets
*Mutually Exclusive Sets
*Dependent and Independent Events
*Conditional Probability
*Law of Total Probability
*Additive Law
*Multiplication Rule
*Bayes Rule
*Practical Example - Bayesian Inference

4. Discrete Distributions
.

*An Overview of Distributions


*Types of Distributions
*Discrete Distributions
*Uniform Distribution
*Bernoulli Distribution
*Binomial Distribution
*Poisson Distribution

5. Continuous Distributions
*Continuous Distributions
*Normal Distribution
*Standardizing a Normal Distribution
*Students T Distribution
*Chi-Squared Distribution
*Exponential Distribution
*Logistic Distribution
*Practical Example - Distributions

6. Probability in Other Fields

*Probability in Finance
*Probability in Statistics
*Probability in Data Science

cours 3:

Python Programmer Bootcamp:

1. Course Introduction

*What does the course cover


*Why code? Why Python?

2. Setting up the environment

*Installing Python
*Introducing Spyder

3. Python basics + Hands-on Coding

*PRINT function

4. Variables and strings - Exercises

*The solution
*Variables
*Strings
*Let's write a program
*Question 1
*Question 2
*Question 3
*Question 4
*Question 5

5. Conditionals

*Intro to conditionals
*Logical operators
*Not TRUE is FALSE
*IF statement
*More on strings
*More strings!
*Practice challenge

6. For Loops

*Intro to For loops


*A little more on variables
*Lists
*The power of lists and loops and working together
*Some list methods
*While loops
*Modulus
*Practical challenge

7. Dictionaries

*Modules
*Dictionaries
*Zip function
*More string methods
*Tuples
*More than 1 dimension
*Import counters (Part I)
*Import counters (Part II)

8. Files and Functions

*Files and functions


*Appending to a file
*More on functions
*Fibonacci in a function
*Recursion

9. Classes

*Objects
*Class
*__Init__
*Class variables
*Methods
*Inheritance
*Practical exercise 1
*Practical exercise 2

10. General Tips

*General tips

11. Big O

*_Big_O
*O(n)
*O(n²)
*O(n!)
*Summary

12. Advanced Python: Caesar two sum


*Intro
*Caesar cipher
*Jupyter notebook intro
*Caesar cipher solution
*Two sum
*Two sum solution
*Card class
*Playing card class
*Playing card class solution

13. Advanced Python: Matplotlib

*Matplotlib
*Random walk explanation
*Simulating a random walk
*Sierpinski triangle
*Creating the triangle in Matplotlib
*Python imaging library

14. Advanced Python: Stacks

*Intro to stacks
*Coding a stack in Python
*Practical challenge - stacks
*Hint on solving the challenge
*Solving the stacks challenge
*Towers of Hanoi
*Some hints
*Code for Tower of Hanoi
*Enumerate sets & none

15. Search and sort

*Intro to search and sort


*Linear search
*Binary search
*Insertion sort

16. Advanced Python: Cards and Coins

*Intro to cards and coins


*Luhn's algorithm
*Coins
*Coins - Python code

17. Advanced Python: Debugging

*Debugging
*Beyond the PRINT function
*Debugging in Spyder

18. Advanced Python: Strings and Expressions

*A little more on strings


*Regular expressions
*More regular expressions
*Type hints

19. Capstone project - Computer Vision

*Introduction to the project


*Coin photographs
*Introducing OpenCV
*Starting to work with Visual Studio
*Hough circle transform
*Get circle radii
*Get brightness values
*Performning coin classification
*Closing remarks

20. Further Exercises

*Further exrecises

21. Installing Python and Using Virtual Environments on Windows and Unix

*Introduction
*Downloading Python
*Setting up the virtual environment
*Sharing a project - requirements file
*Downloading and Installing Virtual Box & Setting up a Virtual Machine
*Installing Ubuntu on the Virtual Machine
*Setting up a virtual environment on Ubuntu
*Introduction to the command line
*Handling files on in the command line
*Copying, moving and deleting files on the command line
*Echo, environment variables and path
*Section summary

22. PyCharm

*Introduction and installing PyCharm


*Setting up PyCharm
*Using PyCharm to write code
*PyCharm and Git

23. Exception handling in Python

*Exception handling - Introduction


*Setting the scene
*How Python handles errors
*The block stops running as soon as the exception is encountered
*When to use tryblock
*Exception handling - Conclusion

24. Farewell

*Farewell

cours 4:
Mathematics:

1. Introduction to Linear Algebra

*Welcome to the course


*Scalars and Vectors
*Linear Algebra and Geometry
*Setting up the Environment
*Scalars, Vectors, and Matrices as Python Arrays
*What is a Tensor?
*Adding and Subtraction
*Errors when Adding Matrices
*Transpose of a Matrix
*Dot Product
*Dot Product of Matrices
*Why is Linear Algebra Useful

cours 5:

SQL:

1. Introduction to Databases, SQL, and MySQL

*Welcome to SQL
*Why SQL?
*Why MySQL?
*Introduction to databases

2. SQL Theory

*SQL as a declarative language


*Data definition language (DDL)
*SQL keywords
*Data manipulation language (DML)
*Data control language (DCL)
*Transaction control language (TCL)

3. Basic Database Terminology

*Relational database essentials


*Databases vs spreadsheets
*Database terminolog
*Relational schemas - Primary key
*Relational schemas - Foreign key
*Relational schemas - Unique key and null values
*Relation ships

4. Installing MySQL and Getting Acquainted with the Interface

*Installing MySQL
*Additional note – Installing – Viisual C
*Installing MySQL on macOS and Unix systems
*The Client-Server Model
*Setting up a connection
*Important - Please read
*New Authentication Plugin - Creating a New User
*Introduction to the MySQL interface

5. First Steps in SQL

*Creating a Database - Part I


*Creating a Database - Part I - Exercise
*SQL files
*Creating a Database - Part II
*Creating a Database - Part II - Еxercise
*Introduction to data types
*String data types
*Integers
*Fixed and floating-point data types
*Other useful data types
*Creating a table
*Creating a table - Exercise
*Using databases and tables
*Using databases and tables - Exercise
*Additional notes on using tables
*Additional notes on using tables - Exercise

6. MySQL Constraints

*PRIMARY KEY constraint


*PRIMARY KEY constraint- Exercise
*FOREIGN KEY constraint - Part I
*FOREIGN KEY constraint - Part II
*FOREIGN KEY constraint - Part II - Exercise
*UNIQUE Constraint
*UNIQUE Constraint - Code
*DEFAULT Constraint
*DEFAULT Constraint - Code
*NOT NULL Constraint - Part I
*NOT NULL Constraint - Part I - Code
*NOT NULL Constraint - Part II

7. SQL Best Practices

*Coding techniques and best practices - Part I


*Coding techniques and best practices - Part II

8. Loading the Data

*Loading the 'employees' database


*Loading the 'employees' database - Link

9. SQL SELECT STATEMENT

*SELECT… FROM…
*SELECT - FROM - Exercise
*WHERE
*WHERE - Exercise
*AND
*AND - Exercise
*OR
*OR - Exercise
*Operator Precedence
*Operator Precedence - Exercise
*IN - NOT IN
*IN - NOT IN - Exercise
*LIKE - NOT LIKE
*LIKE - NOT LIKE - Exercise
*Wildcard Characters
*Wildcard Characters - Exercise
*BETWEEN… AND…
*BETWEEN - AND - Exercise
*IS NOT NULL - IS NULL
*IS NOT NULL - IS NULL - Exercise
*Other Comparison Operators
*Other Comparison Operators - Exercise
*SELECT DISTINCT
*SELECT DISTINCT - Exercise
*Introduction to Aggregate Functions
*Introduction to Aggregate Functions - Exercise
*ORDER BY
*ORDER BY - Exercise
*GROUP BY
*Using Aliases (AS)
*Using Aliases (AS) - Exercise
*HAVING
*HAVING - Exercise
*WHERE vs HAVING - Part I
*WHERE vs HAVING - Part II
*WHERE vs HAVING - Exercise 2
*LIMIT
*LIMIT - Exercise

10. SQL INSERT Statement

*The INSERT statement - Part I


*The INSERT statement - Part II
*Inserting Data INTO a New Table
*The INSERT statement - Part II - Exercise
*Inserting Data INTO a New Table - Exercise

11. SQL UPDATE Statement

*TCL's COMMIT and ROLLBACK


*The UPDATE Statement - Part I
*The UPDATE Statement - Part II
*The UPDATE Statement - Part II - Exercise

12. SQL DELETE Statement

*The DELETE Statement - Part I


*The DELETE Statement - Part II
*The DELETE Statement - Part II - Exercise
*DROP vs TRUNCATE vs DELETE

13. MySQL Aggregate Functions


*COUNT()
*COUNT() - Exercise
*SUM()
*SUM() - Exercise
*MIN() and MAX()
*MIN() and MAX() - Exercise
*AVG()
*AVG() - Exercise
*ROUND()
*ROUND() - Exercise
*COALESCE() - Preamble
*IFNULL() and COALESCE()
*Another Example of Using COALESCE()
*Another Example of Using COALESCE() - Exercise

14. SQL JOINs

*Introduction to JOINs
*Introduction to JOINs - Exercise
*INNER JOIN - Part I
*INNER JOIN - Part II
*INNER JOIN - Part II - Exercise
*A Note on Using Joins
*Duplicate Records
*LEFT JOIN - Part I
*LEFT JOIN - Part II
*LEFT JOIN - Part II - Exercise
*RIGHT JOIN
*The new and the old join syntax
*The new and the old join syntax - Exercise
*JOIN and WHERE Used Together
*JOIN and WHERE Used Together - Exercise
*CROSS JOIN
*CROSS JOIN - Exercise
*Using Aggregate Functions with Joins
*JOIN more than two tables in SQL
*JOIN more than two tables in SQL - Exercise
*Tips and tricks for joins
*Tips and tricks for joins - Exercise
*UNION vs UNION ALL
*UNION vs UNION ALL - Exercise

15. SQL Subqueries

*SQL Subqueries with IN nested inside WHERE


*SQL Subqueries with IN nested inside WHERE - Exercise
*SQL Subqueries with EXISTS-NOT EXISTS nested inside WHERE
*SQL Subqueries with EXISTS-NOT EXISTS nested inside WHERE - Exercise
*SQL Subqueries nested in SELECT and FROM
*SQL Subqueries nested in SELECT and FROM - Exercise Text

16. SQL Self Join

*SQL Self Join

17. SQL Views


*Using SQL views
*Using SQL views - Exercise

18. Stored Routines

*Introduction to Stored Routines


*The MySQL syntax for stored procedures
*Stored Procedures - Example - Part I
*Stored Procedures - Example - Part II
*Another Way to Create a Procedure in MySQL
*Stored Procedures - Example - Part II - Exercise
*Stored Procedures with an Input Parameter
*Stored Procedures with an Output Parameter
*Variables
*Stored Procedures with an Output Parameter - Exercise
*User-defined functions in MySQL
*Variables - Exercise
*Stored routines - conclusion
*User-defined functions in MySQL - Exercise

19. Advanced SQL Topics

*Types of MySQL Variables - Local Variables


*Session Variables
*Global Variables
*User-Defined vs System Variables
*MySQL Triggers - Preamble
*MySQL Triggers
*MySQL Indexes
*MySQL Triggers - Exercise
*The CASE Statement
*MySQL Indexes - Exercise Text
*The CASE Statement - Exercise Text

cours 6:

Machine Learning in Python:

1. Linear Regression

*Course Introduction
*The linear regression model
*Correlation vs regression
*Geometrical representation of the Linear Regression Model
*Setting up the Environment
*Python packages installation
*First regression in Python
*Using Seaborn for graphs
*How to interpret the regression table
*Decomposition of variability
*What is the OLS?
*R-squared
*Multiple linear regression Theory
*Adjusted R-squared Theory
*F-test
*OLS assumptions
*A1: Linearity
*A2: No endogeneity
*A3: Normality and homoscedasticity
*A4: No autocorrelation
*A5: No multicollinearity
*Dealing with categorical data - Dummy variables
*Making predictions with the linear regression

2. Linear Regression with sklearn

*What is sklearn
*Game plan for sklearn
*Simple linear regression
*Simple linear regression - summary table
*A Note on Normalization
*Multiple linear regression
*Adjusted R-squared
*Feature Selection through p-values
*Creating a summary table
*A Note on Calculation of P-Values with sklearn
*Feature Scaling
*Feature Selection through standardization
*Making predictions with standardized coefficients
*Underfitting and overfitting
*Training and testing

3. Linear Regression Practical Example

*Linear Regression Practical Example (Part 1)


*Linear Regression Practical Example (Part 2)
*A note on multicollinearity
*Linear Regression Practical Example (Part 3)
*Linear Regression Practical Example (Part 4)
*Linear Regression Practical Example (Part 5)

4. Logistic Regression

*Introduction to logistic regression


*A simple example in Python
*Logistic vs logit function
*Building a logistic regression
*An invaluable coding tip
*Understanding the tables
*What do the odds actually mean
*Binary predictors in a logistic regression
*Calculating the accuracy of the model
*The concept of overfitting
*Testing the model

5. Cluster Analysis (Basics and Prerequisites)

*Introduction to Cluster Analysis


*Some examples of clusters
*Difference between classification and clustering
*Math prerequisites
6. K-Means Clustering

*K-means clustering
*A simple example of clustering
*Clustering categorical data
*How to choose the number of clusters
*Pros and Cons of K-means and clustering
*To standardize or to not standardize
*Relationship between clustering and regression
*Market Segmentation with Cluster Analysis (Part 1)
*Market Segmentation with Cluster Analysis (Part 2)
*How is clustering useful

7. Other Types of Clustering

*Types of clustering
*Dendrogram
*Heatmaps using Seaborn

cours 7:

Deep Learning with TensorFlow 2:

1. Introduction

*Why machine learning

2. Neural networks Intro

*Introduction to neural networks


*Training the model theory
*Types of machine learning
*The linear model
*The linear model. Multiple inputs.
*The linear model. Multiple inputs and multiple outputs
*Graphical representation
*The objective function
*L2-norm loss
*Cross-entropy loss
*One-parameter gradient descent
*N-parameter gradient descent

3. Setting up the environment

*Setting up the environment - Do not skip, please!


*Why Python and why Jupyter
*Installing Anaconda
*Jupyter Dashboard - Part 1
*Jupyter Dashboard - Part 2
*Installing the TensorFlow package
*Installing the packages

4. Minimal example

*Outline
*Generating the data (optional)
*Initializing the variables
*Training the model
*Minimal Example - Exercises.txt

5. Introduction to TensorFlow 2

*TensorFlow Outline
*TensorFlow 2 Intro
*A note on coding in TensorFlow
*Types of file formats in Tensorflow and data handling
*Model layout - inputs, outputs, targets, weights, bias, optimizer, and loss
*Interpreting the result and extracting the weights and bias
*Customizing your model
*Introduction to TensorFlow - Exercises

6. Deep nets overview

*The layer
*What is a deep net
*Really understand deep nets
*Why do we need non-linearities
*Activation functions
*Softmax activation
*Backpropagation
*Backpropagation - intuition

7. Backpropagation (optional)

*Backpropagation mathematics

8. Overfitting

*Underfitting and overfitting. A regression example


*Underfitting and overfitting. A classification example
*Train vs validation
*Train vs validation vs test
*N-fold cross validation
*Early stopping - motivation and types

9. Initialization

*Initialization
*Types of simple initializations
*Xavier's initialization

10. Optimizers

*SGD&Batching
*Local minima pitfalls
*Momentum
*Learning rate schedules
*Learning rate schedules. A picture
*Adaptive learning schedules
*Adaptive moment estimation

11. Preprocessing

*Preprocessing
*Basic preprocessing
*Standardization
*Dealing with categorical data
*One hot vs binary

12. Deeper example

*MNIST dataset
*How to tackle the MNIST dataset
*MNIST - Importing libraries and data
*Preprocess the data - create a validation dataset and scale the data
*Preprocess the data - scale the test data
*Preprocess the data - shuffle and batch
*Preprocess the data - shuffle and batch Exercise
*Outline the model
*Select the loss and the optimizer
*Learning
*MNIST - Exercises
*Testing the model

13. Business case

*Exploring the dataset and identifying predictors


*Outlining the business case solution
*Balancing a dataset
*Preprocessing the data
*Preprocessing Exercise
*Load the preprocessed data
*Load the preprocessed data Exercise
*Learning and interpreting the result
*Setting an early stopping mechanism
*Setting an early stopping mechanism Exercise
*Testing the business model
*Homework

14. Conclusion

*Summary
*What's more out there
*An overview of CNNs
*How DeepMind uses deep learning
*An overview of RNNs
*Non-NN approaches

cours 8;

Introduction to Tableau

1. Introduction to Tableau

*Welcome to Tableau
*Why use Tableau: Make your data make an impact
*Let's download Tableau Public
*Connecting data in Tableau
*Exploring Tableau's interface
*Exploring Tableau's interface
*Let's create our first chart in Tableau!
2. Tableau Functionalities

*Creating a table
*Creating a table - link
*Creating custom fields
*Creating a custom field and adding calculations to a table
*Creating a custom field and adding calculations to a table - link
*Adding totals and subtotals
*Adding totals and subtotals - link
*Adding a custom calculation
*Adding a custom calculation - link
*Inserting a filter
*Inserting a filter - link
*Working with joins in Tableau

3. The Tableau Exercise

*Introduction to the exercise


*Let's create a dashboard - Visualizing the three charts we want to create
*Using Joins in Tableau
*Performing a Numbers Check - Attempt #1
*Performing a Numbers Check - Attempt #1 - link
*Blending Data in Tableau
*Blending Data in Tableau - link
*Performing a Numbers Check - Attempt #2
*Performing a Numbers Check - Attempt #2 - link
*First chart
*First chart - link
*Second chart
*Second chart - link
*Third Chart
*Third Chart - link
*Creating and Formatting a Dashboard
*Creating and Formatting a Dashboard - link
*Adding Interactive Filters for Improved Analysis
*Adding Interactive Filters for Improved Analysis - link
*Interactive filters - fix

cours 9:

The Complete Data Visualization Course with Python, R, Tableau, and Excel:

1. Introduction

*What does the Course Cover


*Why Learn Data Visualization
*How to Choose the Right Visualization - Popular Approaches and Frameworks
*Color Theory and Colors

2. Setting Up the Environments

*Setting Up The Environments - Do Not Skip, Please!


*Tableau - Downloading Tableau
*Python - Why Python and Why Jupyter
*Python - Installing Anaconda
*Python - Jupyter Dashboard - Part 1
*Python - Jupyter Dashboard - Part 2
*Python - Installing the Seaborn Package
*R - Installing R and RStudio
*R - Quick Guide to RStudio
*R - Changing the Appearance in Rstudio
*R - Installing Packages and Using Libraries

3. Bar Chart

*Bar Chart - Introduction - General Theory and Dataset


*Bar Chart - Excel - How to Create a Bar Chart
*Bar Chart - Tableau - How to Create a Bar Chart
*Bar Chart - Python - How to Create a Bar Chart
*Bar Chart - R - How to Create a Bar Chart
*Bar Chart - Interpretation & What Makes a Good Bar Chart
*Bar Chart Homework

4. Pie Chart

*Pie Chart - Introduction - General Theory and Dataset


*Pie Chart - Excel - How to Create a Pie Chart
*Pie Chart - Tableau - How to Create a Pie Chart
*Pie Chart - Python - How to Create a Pie Chart
*Pie Chart - R - How to Create a Pie Chart
*Pie Chart - Interpretation
*Pie Chart - Why You Should Never Use a Pie Chart

5. Stacked Area Chart

*Stacked Area Chart - Introduction - General Theory and Dataset


*Stacked Area Chart - Excel - How to Create a Stacked Area Chart
*Stacked Area Chart - Tableau - How to Create a Stacked Area Chart
*Stacked Area Chart - Python - How to Create a Stacked Area Chart
*Stacked Area Chart - R - How to Create a Stacked Area Chart
*Stacked Area Chart - Interpretation
*Stacked Area Chart - What Makes a Good Stacked Area Chart
*Stacked Area Chart Homework

6. Line Chart

*Line Chart - Introduction - General Theory and Dataset


*Line Chart - Excel - How to Create a Line Chart
*Line Chart - Tableau - How to Create a Line Chart
*Line Chart - Python - How to Create a Line Chart
*Line Chart - R - How to Create a Line Chart
*Line Chart - Interpretation
*Line Chart - What Makes a Good Line Chart
*Line Chart Homework

7. Histogram

*Histogram - Introduction - General Theory and Dataset


*Histogram - Excel - How to Create a Histogram Chart
*Histogram - Tableau - How to Create a Histogram
*Histogram - Python - How to Create a Histogram
*Histogram - R - How to Create a Histogram
*Histogram - Interpretation
*Histogram - How to Choose the Right Number of Bins
*Histogram - What Makes a Good Histogram
*Histogram Homework

8. Scatter Plot

*Scatter Plot - Introduction - General Theory and Dataset


*Scatter Plot - Excel - How to Create a Scatter Plot
*Scatter Plot - Tableau - How to Create a Scatter Plot
*Scatter Plot - Python - How to Create a Scatter Plot
*Scatter Plot - R - How to Create a Scatter Plot
*Scatter Plot - Interpretation
*Scatter Plot - What Makes a Good Scatter Plot
*Scatter Plot Homework

9. Combo Plots Part 1 - Regression Plot

*Regression Plot - Introduction - General Theory and Dataset


*Regression Plot - Excel - How to Create a Regression Plot
*Regression Plot - Tableau - How to Create a Regression Plot
*Regression Plot - Python - How to Create a Regression Plot
*Regression Plot - R - How to Create a Regression Plot
*Regression Plot - Interpretation
*Regression Plot - What Makes a Good Regression Plot
*Regression Plot Homework

10. Combo Plots Part 2 - Bar and Line Chart

*Bar and Line Chart - Introduction - General Theory and Dataset


*Bar and Line Chart - Excel - How to Create a Bar and Line Chart
*Bar and Line Chart - Tableau - How to Create a Bar and Line Chart
*Bar and Line Chart - Python - How to Create a Bar and Line Chart
*Bar and Line Chart - R - How to Create a Bar and Line Chart
*Bar and Line Chart - Interpretation
*Bar and Line Chart - What Makes a Good Combination Chart
*Bar and Line Chart Homework

11. Advanced Topics - Dashboard in Excel

*Dashboard in Excel - Introduction


*Dashboard in Excel- Getting to Know the Data Set
*Dashboard in Excel- Creating the Design for our Dashboard
*Dashboard in Excel– Creating a Drop-down List and Radio Buttons in Excel
*Dashboard in Excel – Using the Developer Tab and Creating Radio Buttons
in Excel
*Dashboard in Excel – Including Additional Features to the Data Set
*Dashboard in Excel - Pivot Tables
*Dashboard in Excel - Tables – Creating LTM and YTD References in Excel
*Dashboard in Excel - Tables – Creating Month, Year and Selected Dates
Fields in Excel
*Dashboard in Excel – Tables -Calculating Net Sales
*Dashboard in Excel – Tables -Calculating GP% and Distribution Costs
*Dashboard in Excel - Tables – Volume by Size
*Dashboard in Excel – Charts - Bar and Line Combination Chart
*Dashboard in Excel Homework
*Dashboard in Excel - Charts – Clustered Bar Chart I
*Dashboard in Excel - Charts – Clustered Bar Chart II
*Dashboard in Excel – Adding KPIs and a Slicer to the Dashboard
*Dashboard in Excel - Customizing the Appearance of the Dashboard
*Dashboard in Excel - Customizing the Appearance of the Slicer
*Dashboard in Excel - Interpretation

12. Advanced Topics - Dashboard in Tableau

*Dashboard in Tableau - Introduction


*Dashboard in Tableau - Loading the Data and Preparing the Sheets for the
Tableau Dashboard
*Dashboard in Tableau - Bar and Line Combination Chart
*Dashboard in Tableau - Horizontal Bars Chart I
*Dashboard in Tableau - Horizontal Bars Chart II
*Dashboard in Tableau - Adding KPIs to the Tableau Dashboard
*Dashboard in Tableau - Creating the Tableau Dashboard and Adding a Brand
Filter
*Dashboard in Tableau - Incorporating a Date Filter into the Dashboard
*Dashboard in Tableau - Styling the Tableau Dashboard
*Dashboard in Tableau - Interpretation

cours 10:

Introduction to R Programming:

1. Introduction & Getting Started

*What does this course cover


*What does section 1 cover
*Downloading and installing R and RStudio
*Quick guide to the RStudio user interface
*Changing the appearance of RStudio
*Installing packages and using the library

2. The building blocks of R

*Creating an object in R
*Data types in R (Integers and doubles)
*Data types in R (Characters and logicals)
*Coercion rules in R
*Functions in R
*Functions and arguments
*Building a function in R
*Using the script vs. using the console

3. Vectors and vector operations


*What does section 3 cover
*Introduction to vectors
*Vector recycling
*Naming a vector
*Getting help with R
*Slicing and indexing a vector
*Changing the dimensions of an object in R

4. Matrices

*Creating a matrix
*Faster code - creating a matrix in a single line of code
*Do matrices recycle
*Indexing an element from a matrix
*Slicing a matrix
*Matrix arithmetic
*Matrix operations
*Categorical data
*Creating a factor in R
*Lists in R

5. Fundamentals Of Programming With R

*Relational operators in R
*Logical operators in R
*Logical operators and vectors
*If else else-if statements
*If else else-if keep-in-minds's
*For loops in R
*While loops in R
*Repeat loops in R
*Building a function in R 2.0
*Building a function in R 2.0 Scoping

6. Data frames in R

*What does section 6 cover


*Creating a data frame
*The Tidyverse package
*Data import into R
*Importing a CSV into R
*Data export in R
*Getting a sense of your data frame
*Indexing and slicing a data frame in R
*Extending a data frame in R
*Dealing with missing data

7. Manipulating data with R

*What does section 7 cover


*Data transformation with R - the Dplyr package - Part I
*Data transformation with R - the Dplyr package - Part II
*Sampling data with the Dplyr package
*Using the pipe operator
*Tidying your data - gather() and separate()
*Tidying your data - unite() and spread()

8. Visualizing data with R

*What does section 8 cover


*Intro to data visualisation
*Intro to ggplot2
*Variables revisited
*Building a histogram with ggplot2
*Building a bar chart with ggplot2
*Building a box and whiskers plot with ggplot2
*Building a scatterplot with ggplot2

9. Exploratory data analysis with R

*Population vs Sample
*Mean, median, mode
*Skewness
*Variance, standard deviation, and coefficient of variability
*Covariance and correlation

10. Hypothesis Testing

*Distributions
*Standard Error and Confidence Intervals
*Hypothesis Testing
*Type I and Type II errors
*Test for the mean. Population variance known
*The P-value
*Test for the mean. Population variance unknown
*Dependent samples
*Comparing two means. Independent samples

11. Linear Regression Analysis in R

*The Linear Regression Model


*Correlation vs Regression
*Geometrical Representation
*Doing the Regression in R
*How to interpret the regression table
*Decomposition of variability
*R-Squared

cours 11:

Data Preprocessing with NumPy

1. Introduction to NumPy

*Course Introduction
*The NumPy Package and Its Applications
*Installing and Upgrading NumPy
*What is an array?
*Using The NumPy Documentation
*Frequently Asked Questions

2. Why do we use NumPy?

*History of NumPy
*Ndarrays
*Arrays vs Lists

3. NumPy Fundamentals

*Indexing
*Assigning Values
*Elementwise Properties
*Types of Data Supported by NumPy
*Characteristics of NumPy Functions - Part 1
*Characteristics of NumPy Functions - Part 2

4. Working with Arrays

*Basic Slicing
*Stepwise Slicing
*Conditional Slicing
*Dimensions and the Squeeze Function

5. Generating Data with NumPy

*Arrays of 0s and 1s
*"_like" functions in NumPy
*A Non-Random Sequence of Numbers
*Random Generators and Seeds
*Basic Random Functions in NumPy
*Probability Distributions in NumPy
*Applications of Random Data in NumPy

6. Importing and Saving Data with NumPy

*np.loadtxt() vs np.genfromtxt()
*Simple Cleaning when Importing
*String vs Object vs Numbers
*np.save()
*np.savez()
*np.savetxt()

7. Statistics with NumPy

*Using Statistical Functions in NumPy


*Minimal and Maximal Values in NumPy
*Statistical Order Functions in NumPy
*Averages and Variance in NumPy
*Covariance and Correlation in NumPy
*Histograms in NumPy (Part 1)
*Histograms in NumPy (Part 2)
*NAN Equivalent Functions in NumPy
8. Data Manipulation with NumPy

*Checking for Missing Values in Ndarrays


*Substituting Missing Values in Ndarrays
*Reshaping Ndarrays
*Removing Values from Ndarrays
*Sorting Ndarrays
*Argument Sort in NumPy
*Argument Where in NumPy
*Shuffling Ndarrays
*Casting Ndarrays
*Striping Values from Ndarrays
*Stacking Ndarrays
*Concatenating Ndarrays
*Finding Unique Vaules in Ndarrays

9. A Loan Data Practical Example with NumPy

*Setting Up: Introduction to the Practical Example


*Setting Up: Importing the Data Set
*Setting Up: Checking for Incomplete Data
*Setting Up: Splitting the Dataset
*Setting Up: Creating Checkpoints
*Manipulating Text Data: Issue Date
*Manipulating Text Data: Loan Status and Term
*Manipulating Text Data: Grade and Sub Grade
*Manipulating Text Data: Verification Status & URL
*Manipulating Text Data: State Address
*Manipulating Text Data: Converting Strings and Creating a Checkpoint
*Manipulating Numeric Data: Substitute Filler Values
*Manipulating Numeric Data: Currency Change – The Exchange Rate
*Manipulating Numeric Data: Currency Change - From USD to EUR
*Completing the Dataset

cours 12:

Data Cleaning and Preprocessing with pandas

1. pandas - Basics

*Introduction to the pandas Library


*Installing and Running pandas
*Introduction to pandas Series
*Working with Attributes in Python
*Using an Index in pandas
*Label-based vs Position-based Indexing
*More on Working with Indices in Python
*Using Methods in Python - Part I
*Using Methods in Python - Part II
*Parameters vs Arguments
*The pandas Documentation
*Introduction to pandas DataFrames
*Creating DataFrames from Scratch - Part I
*Creating DataFrames from Scratch - Part II
*Additional Notes on Using DataFrames

2. Data Cleaning and Data Preprocessing

*Data Cleaning and Data Preprocessing

3. pandas Series

*.unique(), .nunique()
*Converting Series into Arrays
*.sort_values()
*Attribute and Method Chaining
*.sort_index()

4. pandas DataFrames

*A Revision to pandas DataFrames


*Common Attributes for Working with DataFrames
*Data Selection in pandas DataFrames
*Data Selection - Indexing Data with .iloc[]
*Data Selection - Indexing Data with .loc[]
*A Few Comments on Using .loc[] and .iloc[]

cours 13:

Introduction to Business Analytics

1. Setting the scene

*What does the course cover?


*Managing expectations of different stakeholder groups
*Preparing an annual business plan and a long-range plan
*Analytics vs analysis

2. Understanding your business

*Section introduction
*Stakeholder mapping
*Defining business intelligence
*End-to-end process mapping (part 1)
*End-to-end process mapping (part 2)

3. An in-depth view of end-to-end processes in a corporation

*Hire-to-retire (H2R)
*Source-to-Pay (S2P)
*Record-to-report (R2R)
*Order-to-Cash (O2C)
*Global process owners - Conclusion
4. Target setting

*Identifying a firm's key value drivers


*Metrics vs KPIs
*Define your metrics
*The benchmarking exercise
*The importance of data
*The master data governance function

5. Maturity stages in analytics

*Section introduction
*Descriptive analytics
*Diagnostic analytics
*Predictive analytics
*Prescriptive analytics

6. Analytics techniques in practice

*Section introduction
*Trend analysis
*Trend analysis in Excel
*Comparative analysis
*Comparative analysis in Excel
*Value-based analysis
*Value-based analysis in Excel
*Correlation analysis
*Correlation analysis in Excel
*Time series analysis
*Time series analysis in Excel
*Regression analysis
*Regression analysis in Excel
*Machine learning analysis
*Types of machine learning
*Conclusion

7. Analytics life cycle

*Into to Section 7
*Hypothesis development
*Situational analysis
*Building a RACI matrix
*What makes for a successful project?
*Current state analysis
*Blueprint and design
*Build and test
*Build and test: step-by-step
*Deploy and operationalise: operationalise-and-operationalise

8. Data visualization for business analytics

*Data visualization

9. Practical case study


*Practical case study (part 1)
*Practical case study (part 2)

cours 13:

Data Analysis with Excel Pivot Tables

1. Introduction to pivot tables

*Introduction to the course


*Intro to Pivot tables
*Editing a Pivot table
*Formatting a Pivot table
*Adjusting a Pivot table
*GETPIVOTDATA
*Slicers

2. Building a world-class report with GETPIVOTDATA (practical exercise)

*Introduction to the case study


*Working with an SAP data extraction
*Preliminary mapping of the data extraction
*Creating an output structure of the FMCG model
*Improving the layout and appearance of the FMCG report
*Inserting formulas and automating calculations
*Creating a Master Pivot Table: The main data source for the FMCG report
*GetPivotData is great! Extracting data from the Master Pivot Table
*GETPIVOTDATA - fix
*Combining Slicers and GetPivotData: The key to our success
*Getting fancy with Excel slicers
*This is how the report can be used in practice by high-level executives

cours 14:

Credit Risk Modeling in Python:

1. Introduction

*What does the course cover


*What is credit risk and why is it important?
*Expected loss (EL) and its components: PD, LGD and EAD
*Capital adequacy, regulations, and the Basel II accord
*Basel II approaches: SA, F-IRB, and A-IRB
*Different facility types (asset classes) and credit risk modeling approaches

2. Setting up the environment

*Setting up the environment


*Installing the relevant packages

3. Dataset description
*Our example: consumer loans. A first look at the dataset
*Dependent variables and independent variables

4. General preprocessing

*Importing the data into Python


*Preprocessing few continuous variables
*Preprocessing few continuous variables Homework
*Preprocessing few discrete variables
*Check for missing values and clean
*Check for missing values and clean Homework

5. PD model: data preparation

*How is the PD model going to look like?


*Dependent variable: Good/ Bad (default) definition
*Constructing independent variables
*Information value
*Data preparation. Splitting data
*Data preparation. Preprocessing one discrete variable
*Data preparation. Preprocessing discrete variables: automating calculations
*Data preparation. Preprocessing discrete variables: visualizing results
*Data Preparation. Preprocessing Discrete Variables: Creating Dummies (part
1
*Data Preparation. Preprocessing Discrete Variables: Creating Dummies (part
2)
*Data Preparation. Preprocessing Discrete Variables: Creating Dummies (part
2)
*Data preparation. Preprocessing continuous variables: automating
calculations
*Data preparation. Preprocessing continuous variables: creating dummies
(part 1)
*Data preparation. Preprocessing continuous variables: creating dummies
(part 2)
*Data preparation. Preprocessing continuous variables: creating dummies
(part 2)
*Creating dummies Homework
*Data preparation. Preprocessing continuous variables: creating dummies
(part 3)
*Data preparation. Preprocessing the test dataset

6. PD model estimation

*The PD model. Logistic regression with dummy variables


*Loading the data and selecting the features
*PD model estimation
*Build a logistic regression model with p-values.
*Interpreting the coefficients in the PD model

7. PD model validation (test)

*Out-of-sample validation (test).


*Evaluation of model performance: accuracy and area under the curve (AUC)
*Evaluation of model performance: Gini and Kolmogorov-Smirnov.

8. Applying the PD model for decision making


*Calculating probability of default for a single customer
*Creating a scorecard
*Calculating credit score
*From credit score to PD
*Setting cut-offs
*Setting cut-offs Homework

9. PD model monitoring

*PD model monitoring via assessing population stability


*Population stability index: preprocessing
*Population stability index: calculation and interpretation
*Population stability index: calculation and interpretation Homework

10. LGD and EAD models

*LGD and EAD models: independent variables


*LGD and EAD models: dependent variables
*LGD and EAD models: distribution of recovery rates and credit conversion
factors

11. LGD model

*LGD model: preparing the inputs


*LGD model: testing the model
*LGD model: estimating the accuracy of the model
*LGD model: saving the model
*LGD model: stage 2 – linear regression
*LGD model: stage 2 – linear regression evaluation
*LGD model: combining stage 1 and stage 2
*LGD model: combining stage 1 and stage 2 Homework

12. EAD model

*EAD model estimation and interpretation


*EAD model validation
*EAD model validation

13. Calculating expected loss

*Calculating expected loss Homework


*Calculating expected loss

cours 15:

SQL + Tableau + Python:

1. Software Integration

*Course Introduction
*Properties and Definitions: Data, Servers, Clients, Requests, and Responses
*Properties and Definitions: Data Connectivity, APIs, and Endpoint
*Further Details on APIs
*Text Files as Means of Communication
*Definitions and Applications

2. What's Next in the Course?

*Up Ahead
*Real-Life Example: Absenteeism at Work
*Real-Life Example: The Dataset

3. Preprocessing the 'Absenteeism_data

*What to Expect from the Next Couple of Sections


*importing the Dataset in Python
*Eyeballing the Data
*Introduction to Terms with Multiple Meanings
*A Refresher on Regression Analysis
*An Analytical Approach to Solving the Task
*Dropping the "ID" Column
*Analysis of the "Reason for Absence" Column
*Converting a Feature into Multiple Dummy Variables
*Dropping a Dummy Variable
*Working with Dummy Variables from a Statistical Perspective
*Grouping the Various Reasons for Absence
*Concatenating Column Values
*Reordering Columns
*Creating Checkpoints in Jupyter
*Working on the "Date" Column
*Extracting the Month Value
*Creating the "Day of the Week" Column
*Dropping the "Date" Column
*Modifying "Education" and Discussing "Children" and "Pets"
*Analyzing the Next 5 Columns in our DataFrame
*Final Remarks on the Data Preprocessing Part of the Exercise
*A Note on Exporting Your Data as a *.csv File

4. Machine Learning

*Exploring the Problem from a Machine Learning Point of View


*Creating the Targets for the Logistic Regression
*Selecting the Inputs for the Regression
*Standardizing the Dataset for Better Results
*Train-test Split of the Data
*Training and Evaluating the Model
*Extracting the Intercept and Coefficients
*Interpreting the Coefficients
*Creating a Custom Scaler to Standardize Only Numerical Features
*Interpreting the Important Predictors
*Simplifying the Model (Backward Elimination)
*esting the Machine Learning Model
*Saving theLogistic Regression Model
*More about 'pickling'
*Creating a Module for Later Use of the Model

5. Connecting Python and SQL

*Downloading the Section Resources


*Loading the "abssenteeism_module"
*Working with the "absenteeism_module"
*Creating a Database Structure in MySQL
*Installing and Importing 'pymysql'
*Setting up a Connection and Creating a Cursor
*Creating the 'predicted_outputs' table in MySQL
*Executing and SQL Query from Python
*Moving Data from Python to SQL - Part I
*Moving Data from Python to SQL - Part II
*Moving Data from Python to SQL - Part III

6. Analyzing the Obtained Data in Tableau

*Tableau Analysis: Age vs Probability - Homework and Dataset


*Tableau Analysis: Age vs Probability
*Tableau Analysis: Reasons vs Probability
*Tableau Analysis: Transportation Expense vs Probability

cours 16:

Introduction to Jupyter:

1. Introduction to Jupyter

*Introduction
*A 5-minute explanation of Programming
*Introduction to Jupyter
*Installing Anaconda
*Intro to Using Jupyter
*Working with Notebook Files
*Using Shortcuts
*Handlin g Error Messages
*Restarting the Kernel

cours 17:

Introduction to Excel

1. Welcome! Course Introduction

*Welcome to the Course


*Excel Made Easy: A Beginner's Guide to Excel Spreadsheets
*Data Entry Techniques in Excel
*How to Make Your Spreadsheets Look Professional
*Inserting, Deleting, and Modifying Rows & Columns
*Excel Formulas for Beginners
*Excel Functions
*Work Efficiently by Using Cut, Copy, and Paste
*Formatting
*Pasting Values, Formulas and Formats with Paste Special

2. Useful Tools & Tips

*Inserting a Line Break with Alt + Enter


*Do More with Your Sales Data with Excel's Text to Columns Feature
*Create Easily Printable Excel Documents
*How to Wrap Text in Excel and Adjust a Cell's Size
*Insert Hyperlinks into Excel Spreadsheets
*Using Excel's Freeze Panes to Handle Large Datasets
*Find Excel Functionalities in a Quicker Way - Tell Me What You Want to Do
*A Quick Introduction to Excel's Pivot Tables
*Initial Formatting Is Key for Creating Professional-looking Spreadsheets
*Macros Are a Great Timesaver! Here's Why
*How to Use the Same Macro On Multiple Workbooks
*The Secret to Faster Scrolling in Excel
*Be even quicker: F5 + Enter
*Using Absolute and Relative Cell References
*Find and Select Cells That Meet Specific Conditions
*How to Create Dynamic Names in Excel Spreadsheets
*Using Named Ranges to Make Formulas More Readable
*How to Add a Drop-down List in Excel
*Using Custom-sort to Sort Multiple Columns Within a Table
*Saving Time in Excel and Doing Everything Faster by Using Excel Shortcuts
*Multiply by 1
*Find and Replace - References
*Find and Replace - Formatting
*Green References
*Beauty Saving - The Professional Way of Saving Files
*The Power of F2
*Conditional Formatting
*Introduction to Custom Cell Formatting
*Custom Formatting - An example

3. Beginner, Intermediate & Advanced Functions

*Key Excel Functions: IF


*Enlarge the formula bar
*Key Excel Functions: SUM, SUMIF, SUMIFS
*Key Excel Functions: COUNT, COUNTIF, COUNTIFS
*Key Excel Functions: AVERAGE, AVERAGEIF
*Key Excel Functions: LEFT, RIGHT, MID, UPPER, LOWER, PROPER
*Working with text in Excel
*Find the Highest and Lowest Values in a Range: MAX & MIN
*When you start typing a formula
*Use ROUND in Your Financial Models
*Excel's Lookup Functions: VLOOKUP & HLOOKUP made easy
*INDEX, MATCH, and Their Combination - The Perfect Substitute for
VLOOKUP
*Using Excel's IFERROR Function to Trap Spreadsheet Errors
*A Useful Tool for Financial Analysis - The RANK Function
*Create Flexible Financial Models with CHOOSE
*Goal Seek Will Help You Find the Result You Are Looking For
*Perform Sensitivity Analysis with Excel's Data Tables Functionality
4. Practical Exercise "Build a P&L From Scratch"

*Introduction to the Case Study


*What You Will See Next
*Understand Your Data Source Before You Start Working on It
*Ordering the Source Worksheet
*Create a Code: The Best Way to Organize Your Data and Work Efficiently
with It
*Learn How to Create a Database
*Using Lookup Functions (Vlookup) to Fill the Database Sheet
*Use Sumif to Complete the Database Sheet
*Using Index & Match as a Substitute for Vlookup
*The Mapping Exercise
*Mapping the Rows in the Database Sheet
*Building the Structure of the P&L Sheet
*A Practical Example of Professional Formatting in Excel
*Populate the P&L Sheet with Sumif
*Learn How to Find Mistakes with Countif
*Calculating Growth Rates in Excel

5. Introduction to Excel Charts

*Introduction to Excel Charts


*Beginner's Guide to Inserting Charts in Excel
*Modifying Excel Charts - The Easy Way
*Making Your Excel Charts Sexier - Proven Tips
*Creating a Bridge Chart in Excel 2016 - As Easy as It Gets
*New Ways to Visualize Your Data - Treemap Charts
*How to Represent Trends with Sparklines

6. Building Professional Charts in Excel

*Stacked Column Chart with a Secondary Axis


*Doughnut Chart
*Area Chart
*Bridge Chart

cours 18:

Starting a Career in Data Science: Project Portfolio, Resume, and Interview


Process;

1. Course Intro: What Is Data Science

*Course Overview
*The Data Science Knowledge You Need
*Types of Data Science Roles
*The Interview Process Structure
*What Interviewers Look For
*How to Get the Most
2. The Project Portfolio

*Portfolio Overview
*What Is a Data Science Project
*The Projects You Should Do
*How to Differentiate Your Projects
*Where to Showcase Your Projects
*Best Github Practices
*Kaggle Profile

3. The Resume

*Resume Overview
*How to Structure Your Resume
*How to Write about Work and Projects
*Customize your Resume
*Your Virtual Resume
*Resume Checklist
*The Cover Letter

4. Get an Interview

*Interviewing Overview
*How Candidates are Selected
*Networking for Data Scientists
*Leveraging Your Resources
*Informa tional Interviews
*Reaching out to Recruiters

5. The Phone Interview

*The Phone Interview Overview


*What to Expect
*How to Prepare
*How to Succeed

6. The Take-Home Test

*The Types of Take-Home Tests


*Dealing with Data Sets
*Coding Quizzes
*Written Test

7. The In-Person Interview

*Intro to the In-Person Interview: What to Expect


*Ace the Behavioral Interview
*Technical Interviewing
*Following Up
*The Briefcase Method

8. Bonus Content: Interview with Successful Data Scientists

*Anna Interview
*Elevator Pitch Outline and Examples
*Jaemin Interview
*Jay Interview
*Jefferson Interview
*Sheng Interview
*Glassdoor Findings
*LinkedIn Bonus Content
*Bonus Content: Portfolio Website
*Bonus Content: Star Storytelling Technique

9. Bonus Downloadable Materials

*Resume and Cover Letter Templates and Checklist


*Reach Out Templates
*Interview Questions

cours 19:

SQL + Tableau

1. Combining SQL with Tableau: Introduction

*Introduction to software integration


*Combining SQL and Tableau
*Installing Tableau Public
*Loading the Database

2. Problem 1

*Problem 1: Task
*Task 1 - Text
*Important clarification
*Problem 1: Solution in SQL
*Exporting your output from SQL and loading it in Tableau
*Chart 1: Visualizing the Solution in Tableau - Part I
*Chart 1: Visualizing the Solution in Tableau - Part II

3. Problem 2

*Problem 2: Task
*Task 2 - Text
*Problem 2: Solution in SQL
*Chart 2: Visualizing the Solution in Tableau

4. Problem 3

*Problem 3: Task
*Task 3 - Text
*Problem 3: Solution in SQL
*Chart 3: Visualizing the Solution in Tableau

5. Problem 4

*Problem 4: Task
*Task 4 - Text
*Problem 4: Solution in SQL
*Chart 4: Visualizing the Solution in Tableau

6. Problem 5

*Problem 5: Organizing Charts 1-4 into a Beautiful Dashboard

cours 20:

Time Series Analysis with Python:

1. Introduction

*What does the Course Cover

2. Setting Up the Environment

*Setting up the Environment


*Installing the Necessary Packages

3. Introduction to Time Series in Python

*Introduction to Time Series Data


*Notation for Time Series Data
*Peculiarities of Time Series Data
*Loading the Data
*Examining the Data
*Plotting the Data
*The QQ Plot

4. Creating a Time Series Object in Python

*Transforming String Inputs into Date Time Values


*Using Dates as Indices
*Setting the Frequency
*Filling Missing Values
*Adding and Removing Columns in a Data Frame
*Splitting the Data
*Updating the Dataset

5. Working with Time Series in Python

*White Noise
*Random Walk
*Stationarity
*Determining Weak Form Stationarity
*Seasonality
*Correlation Between Past and Present Values
*The ACF
*The PACF
6. Picking the Correct Model

*Picking the Correct Model

7. The AR Model

*The AR Model
*Examining the ACF and PACF of Prices
*Fitting an AR(1) Model for Index Prices
*Fitting Higher-Lag AR Models for Prices
*Using Returns
*Examining the ACF and PACF of Returns
*Fitting an AR(1) Model for Returns
*Fitting Higher-Lag AR Models for Returns
*Normalizing Values
*Model Selection for Normalized Returns (AR Model)
*Examining the AR Model Residuals
*Unexpected Shocks from Past Periods

8. The MA Model

*The MA Model
*Fitting an MA(1) Model for Returns
*Fitting Higher-Lag MA Models for Returns
*Examining the MA Model Residuals for Returns
*Model Selection for Normalized Returns (MA Model)
*Fitting an MA(1) Model for Prices
*Past Values and Past Errors

9. The ARMA Model

*The ARMA Model


*Fitting a Simple ARMA Model for Returns
*Fitting Higher-Lag ARMA Models for Returns - Part 1
*Fitting Higher-Lag ARMA Models for Returns - Part 2
*Fitting Higher-Lag ARMA Models for Returns - Part 3
*Examining the ARMA Model Residuals of Returns
*ARMA for Prices
*ARMA Models and Non-Stationary Data

10. The ARIMA Model

*The ARIMA Model


*Fitting a Simple ARIMA Model for Prices
*Fitting Higher-Lag ARIMA Models for Prices - Part 1
*Fitting Higher-Lag ARIMA Models for Prices - Part 2
*Higher Levels of Integration
*Using ARIMA Models for Returns
*Outside Factors and the ARIMAX Model
*Seasonal Models - SARIMAX
*Predicting Stability

11. The ARCH Model


*The ARCH Model
*Volatility
*A More Detailed Look of the ARCH Model
*The arhc_model Method
*The Simple ARCH Model
*Higher-Lag ARCH Models
*An ARMA Equivalent of the ARCH Model

12. The GARCH Model

*The GARCH Model


*The ARMA and the GARCH
*The Simple GARCH Model
*Higher-Lag GARCH Models
*An Alternative to the Model Selection Process

13. Auto ARIMA

*Auto AR IMA
*Preparing Python for Model Selection
*The Default Best Fit
*Basic Auto ARIMA Arguments
*Advanced Auto ARIMA Arguments
*The Goal Behind Modelling

14. Time Series Forecasting

*Introduction to Time Series Forecasting


*Simple Forecasting Returns with AR and MA
*Intermediate (MAX Model) Forecasting
*Advanced (Seasonal) Forecasting
*Auto ARIMA Forecasting
*Pitfalls of Forecasting
*Forecasting Volatility
*Appendix: Multivariate Forecasting (VARMAX)

15. Business Case

*Business Case - A Look Into the Automobile Industry

cours 21:

Power BI

1. Power BI Overview

*Introduction to Power BI
*Power BI Desktop vs Power BI Pro
*Power BI data sources
2. Power BI Setup

*Power BI for MAC


*Power BI for Mac Part 2
*How to Install Power BI

3. Connecting to data sources

*Introduction to Data Sources


*The Query Editor
*Import vs DirectQuery
*Importing Data from Excel Files
*Importing Data from Excel Files - Continued
*First Data Model
*Active vs Inactive Relationships
*Entering Data Manually
*Roles
*Refreshing Data
*Hierarchies
*Ask a Question

4. Task three

*Task Three, Part 1


*Task Three
*Task Three, Part 2

5. Data Modelling

*Introduction to Data Modeling


*DAX
*Calculated Columns
*Measures
*Star Schema
*Snowflake Schema
*Investigating Filter Flow
*Related vs Relatedtable
*Bi-directional Cross-filtering
*Calculate
*Time Intelligence, Part 1
*Time Intelligence, Part 2
*ALL
*FILTER
*Iterators
*Logical Operators
*Switch
*Text Functions

6. Task Four

*Task Four
*Task Four, Part 1
*Task Four, Part 2
*Task Four, Part 3

7. Creating Our First Data Visualization Report


*Introduction
*Report View
*Slicers
*Bar Chart
*Bookmarks
*Conditional Formatting
*Field Formatting
*Gauge Chart
*Card Visuals, Part 1
*Card Visuals, Part 2
*Drillthrough
*Filters
*Map Visuals
*Custom Colors

8. Task Five

*Task Five
*Task Five, Part 1
*Task Five, Part 2
*Task Five, Part 3
*Task Five, Part 4

9. Task Six Final Project

*Final Project, Part 1


*Task Six
*Final Project, Part 2
*Final Project, Part 3
*Final Project, Part 4
*Final Project, Part 5
*Final Project, Part 6
*Final Project, Part 7
*Final Project, Part 8
*Final Project, Part 9
*Final Project, Part 10
*Final Project, Part 11
*Final Project, Part 12
*Final Project, Part 13
*Final Project, Part 14

10. Bonus - Power BI Service

*Announcement
*Power BI Service
*Workspaces

cours 22:

Product Management for AI & Data Science

1. Intro to Product Management for AI & Data Science

*Introduction
*Course Overview
*Growing Importance of an AI & Data PM
*The Role of a Product Manager
*Differentiation of a PM in AI & Data
*Product Management vs. Project Management

2. Key Technological Concepts for AI & Data Science

*A Product Manager as an Analytics Translator


*Data Analysis vs. Data Science
*An Algorithm vs. AI
*Explaining Machine Learning
*Explaining Deep Learning
*When to use Machine Learning vs. Deep Learning
*Supervised, Unsupervised, & Reinforcement Learning

3. Business Strategy for AI & Data Science

*AI Business Model Innovations


*When to Use AI
*SWOT Analysis
*Building a Hypothesis
*Testing a Hypothesis
*AI Business Canvas
*Create a Business Proposal

4. User Experience for AI & Data Science

*User Experience for Data & AI


*Getting to the Core Problem
*User Research Methods
*Developing User Personas
*Prototyping with AI

5. Data Management for AI & Data Science

*Data Growth Strategy


*Open Data
*Company Data
*Crowdsourcing Labeled Data
*New Feature Data
*Acquisition/Purchase Data Collection
*Databases, Data Warehouses, & Data Lakes

6. Product Development for AI & Data Science

*AI Flywheel Effect


*Top & Bottom Problem Solving
*Product Ideation Techniques
*Complexity vs. Benefit Prioritization
*MVPs & MVDs (Minimum Viable Data)
*Agile & Data Kanban

7. Building The Model

*Who Should Buid Your Model


*Enterpise AI
*Machine Learning as a Service (MLaaS)
*In-House AI & The Machine Learning Lifecycle
*Timelines & Diminishing Returns
*Setting a Model Performance Metric

8. Evaluating Performance

*Dividing Test Data


*The Confusion Matrix
*Precision, Recall & F1 Score
*Optimizing for Experience
*Compare & Select the Best Model
*Error Recovery

9. Deployment & Continuous Improvement

*Model Deployment Methods


*Monitoring Models
*Selecting a Feedback Metric
*User Feedback Loops
*Shadow Deployments

10. Managing Data Science & AI Teams

*AI Hierarchy of Needs


*AI Within an Organization
*Roles in AI & Data Teams
*Managing Team Workflow
*Dual & Triple-Track Agile

11. Communication

*Internal Stakeholder Management


*Setting Data Expectations
*Active Listening & Communication
*Compelling Presentations with Storytelling
*Running Effective Meetings

12. Ethics, Privacy, & Bias

*AI User Concerns


*Bad Actors & Security
*AI Amplifying Human Bias
*Data Laws & Regulations

cours 23:

Git and GitHub

1. Course Introduction

*Introduction - Git and GitHub

2. Introduction to Git

*Installing Git
*Configure Git
*Basic commands
*Exploring Git log
*Git diff
*HEAD
*Making corrections Git reset
*Branching

3. GitHub

*GitHub
*GitClone

4. Conclusion

*Conclusion

cours 23:

Customer Analytics in Python

1. A Brief Marketing Introduction

*Course Introduction
*Segmentation, Targeting, Positioning
*Marketing Mix
*Physical and Online Retailers: Similarities and Differences.
*Price Elasticity

2. Setting up the environment

*Setting up the environment


*Installing the relevant packages

3. Segmentation Data

*Getting to know the Segmentation Dataset


*Importing and Exploring Segmentation Data
*Standardizing Segmentation Data

4. Hierarchical Clustering

*Hierarchical Clustering: Background


*Hierarchical Clustering: Implementation and Results

5. K-means Clustering

*K-Means Clustering: Background


*K-Means Clustering: Application
*K-Means Clustering: Results
6. K-Means Clustering based on Principal Component Analysis

*Principal Component Analysis: Background


*Principal Component Analysis: Application
*Principal Component Analysis: Results
*K-Means Clustering with Principal Components: Application
*K-Means Clustering with Principal Components: Results
*Saving the Models

7. Purchase Data

*Purchase Analytics - Introduction


*Getting to know the Purchase Dataset
*importing and Exploring Purchase Data
*Applying the Segmentation Model

8. Descriptive Analyses by Segments

*Purchase Analytics Descriptive Statistics: Segment Proportions


*Purchase Analytics Descriptive Statistics: Purchase occasion and Purchase
Incidence
*Brand Choice
*Dissecting the revenue by segment

9. Modeling Purchase Incidence

*Purchase Incidence Models. The Model: Binomial Logistic Regression


*Prepare the Dataset for Logistic Regression
*Model Estimation
*Calculating Price Elasticity of Purchase Probability
*Price Elasticity of Purchase Probability: Results
*Purchase Probability by Segments
*Purchase Probability Model with Promotion
*Calculating Price Elasticities with Promotion
*Comparing Price Elasticities with and without Promotion

10. Modeling Brand Choice

*Brand Choice Models. The Model: Multinomial Logistic Regression


*Prepare Data and Fit the Model
*Interpreting the Coefficients
*Own Price Brand Choice Elasticity
*Cross Price Brand Choice Elasticity
*Own and Cross-Price Elasticity by Segment
*Own and Cross-Price Elasticity by Segment - Comparison

11. Modeling Purchase Quantity

*Purchase Quantity Models. The Model: Linear Regression


*Preparing the Data and Fitting the Model
*Calculating Price Elasticity of Purchase Quantity
*Own Price Brand Choice Elasticity

12. Deep Learning for Conversion Prediction


*Introduction to Deep Learning for Customer Analytics
*Exploring the Dataset
*How Are We Going to Tackle the Business Case
*Why do We Need to Balance a Dataset
*Preprocessing the Data for Deep Learning
*Outlining the Deep Learning Model
*Training the Deep Learning Model
*Testing the Model
*Obtaining the Probability of a Customer to Convert
*Saving the Model and Preparing for Deployment
*Predicting on New Data

cours 24:

Web Scraping and API Fundamentals in Python

1. Introduction to the course

*What does the course cover


*What is Web Scraping
*Ethics of Scraping

2. Setting Up the Environment

*Setting up the Environment


*Installing the Necessary Packages

3. Working with APIs

*API overview
*HTTP requests GET and POST requests
*JSON preferred data exchange format for APIs
*Exchange rates API GETting a JSON response
*Incorporating parameters in a GET request
*Additional API functionalities
*Creating a simple currency converter
*iTunes API
*Homework
*Homework - 2
*iTunes API Structuring and exporting the data
*GitHub API Pagination
*EDAMAM API Initial setup and registration
*EDAMAM API Sending a POST request
*Downloading files with Requests

4. HTML overview

*What is HTML?
*Structure of HTML
*Syntax of HTML. Tags
*Tag attributes
*Popular tags
*CSS and JavaScript
*Character encoding
*XHTML and code style

5. Web Scraping with Beautiful Soup

*Introduction to the Beautiful Soup package


*Workflow of Web Scraping
*Setting up your first scraper
*Searching and navigating the HTML tree
*Searching the HTML tree by attributes
*Extracting data from the HTML tree
*Extracting text from an HTML tag
*Practical example dealing with links
*Homework BeautifulSoup Section 1
*Extracting data from nested HTML tags
*Scraping multiple pages automatically

6. Practical project: Scraping Rotten Tomatoes

*Setting up your scraper


*Extracting the title and year of each movie
*Homework BeautifulSoup Secion 2 - Score
*Extracting the rest of the information
*Dealing with the cast of the movies
*Homework BeautifulSoup Secion 2 - Rest
*Storing and exporting the data in a structured form

7. Scraping HTML tables

*Scraping Steam
*Scraping HTML tables with the help of Pandas

8. Common roadblocks when scraping

*Common roadblocks when Web Scraping

9. The requests-html package

*Introduction to the requests-html package


*Exploring the capabilities of requests-html for Web Scraping
*Searching for text
*CSS selectors
*Scraping JavaScript
*Scraping SoundCloud

cours 25:
Introduction to Python

1. Why Python?

*Course Introduction
*Setting Up the Environment

2. Python Variables and Data Types

*Python Variables
*Understanding Numbers and Boolean Values
*Strings

3. Basic Python Syntax

*The Arithmetic Operators of Python


*What is the Double Equality Sign?
*How to Reassign Values
*How to Add Comments
*Understanding Line Continuation
*How to Index Elements
*How to Structure Your Code with Indentation

4. More on Operators

*Python's Comparison Operators


*Python's Logical and Identity Operators

5. Conditional Statements

*Getting to know the IF Statement


*Adding an ELSE Statement
*Else if, for Brief – ELIF
*An Additional Explanation of Boolean Values

6. Functions

*How to Define a Function in Python


*How to Create a Function with a Parameter
*Another Way to Define a Function
*How to use a Function within a Function
*Use Conditional Statements and Functions Together
*How to Create Functions Which Contain a Few Arguments
*Built-In Functions in Python Worth Knowing

7. Sequences

*Introduction to Lists
*Using Methods in Python
*What is List Slicing?
*Working with Tuples
*Python Dictionaries

8. Iteration

*Using For Loops


*Using While Loops and Incrementing
*Use the range() Function to Create Lists
*Combine Conditional Statements and Loops
*All In – Conditional Statements, Functions, and Loops
*How to Iterate over Dictionaries

9. A few important Python concepts and terms

*Object-Oriented Programming (OOP)


*Modules, Packages and the Python Standard Library
*Importing Modules in Python
*What is Software Documentation?
*The Python Documentation

cours 26;

Python for Finance

1. Useful Tools

*Course Introduction
*Must-Have Packages for Finance and Data Science
*Working with Arrays
*Generating Random Numbers
*Important Note on Using Online Financial Data Sources
*Using Financial Data in Python
*Importing and Organizing Data in Python - Part I
*Importing and Organizing Data in Python - Part II
*Importing and Organizing Data in Python - Part III
*Changing the Index of Your Time-Series Data
*Restarting the Jupyter Kernel

2. Calculating and Comparing Rates of Return in Python

*Considering Both Risk and Return


*What Are We Going to See Next
*Calculating a Security's Rate of Return
*Calculating a Security's Rate of Return in Python - Simple Returns - Part I
*Calculating a Security's Rate of Return in Python - Simple Returns - Part II
*Calculating a Security's Rate of Return in Python - Logarithmic Returns
*What Is a Portfolio of Securities and How to Calculate Its Rate of Return
* Using 'Loc' and 'Iloc' - Notes
*Calculating the Rate of Return of a Portfolio of Securities
*Popular Stock Indices
*Calculating the Rate of Return of Indices

3. Meаsuring Investment Risk


*How Do We Measure a Security's Risk
*Calculating a Security's Risk in Python
*The Benefits of Portfolio Diversification
*Calculating the Covariance Between Securities
*Measuring the Correlation between Stocks
*Calculating Covariance and Correlation
*Considering the Risk of Multiple Securities in a Portfolio
*Calculating Portfolio Risk
*Understanding Systematic vs. Idiosyncratic Risk
*Calculating Diversifiable and Non-diversifiable Risk of a Portfolio

4. Using Regressions for Financial Analysis

*The Fundamentals of Simple Regression Analysis


*Running a Regression in Python
*Are All Regressions Created Equal? Learning How to Distinguish Good
Regressions
*Computing Alpha, Beta, and R Squared in Python

5. Markowitz Portfolio Optimization

*Markowitz Portfolio Theory - One of the main Pillars of Modern Finance


*Obtaining the Efficient Frontier in Python - Part I
*Obtaining the Efficient Frontier in Python - Part II
*Obtaining the Efficient Frontier in Python - Part III

6. The Capital Asset Pricing Model

*The Intuition behind the Capital Asset Pricing Model (CAPM)


*Understanding and Calculating a Security's Beta
*Calculating the Beta of a Stock
*The CAPM Formula
*Calculating the Expected Return of a Stock (CAPM)
*Introducing the Sharpe Ratio and the Way It Can Be Applied in Practice
*Obtaining the Sharpe Ratio in Python
*Measuring Alpha and Verifying How Good (or Bad) a Portfolio Manager Is
Doing

7. Multivariate Regression Analysis

*Multivariate Regression Analysis - a Valuable Tool for Finance Practitioners

*Running a Multivariate Regression in Python

8. Monte Carlo Simulations as a Decision-Making Tool

*The Essence of Monte Carlo Simulations


*What is a Normal Distribution? - Note
*Monte Carlo Applied in a Corporate Finance Context
*Monte Carlo: Predicting Gross Profit - Part I
*Monte Carlo: Predicting Gross Profit - Part II
*Forecasting Stock Prices with a Monte Carlo Simulation
*Another Way to Calculate Simple and Log Returns - Note
*Monte Carlo: Forecasting Stock Prices - Part I
* Monte Carlo: Forecasting Stock Prices - Part II
*Monte Carlo: Forecasting Stock Prices - Part III
*An Introduction to Derivative Contracts
*The Black-Scholes Formula for Option Pricing
*Monte Carlo: Black-Scholes-Merton
*Monte Carlo: Euler Discretization - Part I
*Monte Carlo: Euler Discretization - Part II

cours 26:

Advanced Microsoft Excel

1. Welcome to Advanced Excel

*Welcome to the Course


*Why Excel and Why Modeling

2. Proficient Excel Formatting

*Let's Start from Scratch and Create a P&L Sheet


*Cell Styles Allow You to Be Faster
*Pasting Values, Formulas, and Formats with Paste Special
*Formatting Cells Part I - Working with Data in Excel
*Formatting Cells Part II - Customize Numbers the Way You Like
*Highlight Key Data with Excel Conditional Formatting
*Filter by Color

3. How to Be 3.0x Faster than Average Users

*Use Multiple Screens Simultaneously


*F1 to F12 - Using Excel's Function Keys
*How to Select Visible Cells Only
*Grouping Excel Rows and Columns - The Correct Way to Do It!
*Working on Multiple Sheets at the Same Time

4. Excel Mechanics

*Find & Replace - Our Favourite Excel Tool


*A Great Way to Apply Find & Replace
*What Are Circular References in Excel
*Circular References - An Example
*Trace Precedents - Display the Relationship Between Formulas and Cells

5. Not So Simple Excel Functions

*What is a Nested Function?


*Advanced Excel Functions Index; Match; Index & Match
*Advanced Excel Functions Index, Match, Match
*Advanced Excel Functions Indirect; Vlookup & Indirect
*Advanced Excel Functions Rows; Columns; Vlookup & Columns
*Advanced Excel Functions Vlookup & Match
*Advanced Excel Functions Choose; Vlookup & Choose
*Advanced Excel Functions Offset; Offset & Match
*Date Functions

6. Excel Tips & Tricks

*Excel Tips & Tricks Part 1


*Excel Tips & Tricks Part 2
*Excel Tips & Tricks Part 3
*Excel Tips & Tricks Part 4

cours 27:

Convolutional Neural Networks with TensorFlow in Python

1. Introduction to the course

*What does the course cover?


*Why CNNs?

2. Kernels

*Introduction to image kernels


*How do image transformations work?
*Kernels as matrices
*Convolution - applying kernels
*Edge handling

3. CNN Introduction

*CNNs motivation
*Feature maps
*Pooling and Stride
*Dimensions

4. Neural network techniques (revision)

*Activation functions
*Overfitting and early stopping
*Optimizers

5. Setting up the environment

*Setting up the environment - Do not skip, please!


*Installing the packages

6. CNN assembling - MNIST

*Road plan
*A simple CNN architecture
*Preprocessing the data
*Building and training the CNN
*MNIST CNN Homework
*Testing the trained CNN

7. Tensorboard: Visualization tool for TensorFlow

*Tensorboard on the MNIST example


*Confusion matrix and visualizing it with Tensorboard
*Confusion Matrix Homework
*Using Tensorboard to tune hyperparameters
*Hyperparameter Tuning Homework

8. Common techniques for better performance of neural networks

*Introduction
*Regularization
*L2 Regularization and weight decay
*Dropout
*Data augmentation

9. A practical project: Labelling fashion items

*Introduction to the problem


*The objective and the images
*Converting images to arrays
*Getting started with the code concepts
*Primary classification task - Part 1
*Primary classification task - Part 2
*Primary classification task - Part 3
*Glasses and Sunglasses Homework
*Trousers and Jeans - discussion of approaches
*Trousers and Jeans - All
*Trousers and Jeans - Gender + Type
*Trousers and Jeans - Type - Homework
*Shoes Homework
*Trousers and Jeans - comparing the methods
*L2 regularization and Dropout
*Data augmentation - Shoes All

10. Understanding CNNs

*Unexpected failures

11. Popular CNN architectures

*Introduction - the ILSVRC challenge


*AlexNet - CNN success
*VGG - more layers
*GoogleNet - computational efficiency
*ResNet - revolution of depth

cours 28:

Data Strategy

1. Introduction to Data Strategy

*Welcome to the course!

2. Deciding Your Strategic Data Needs


*Delineating the 5 strategic data use case areas

3. Using Data to Improve Your Decisions

*Section introduction
*Curated dashboards vs. self-service data exploration
*Challenges related to self-service data exploration
*Asking key business questions first (KBQs)
*The power of clear Key Business Questions (KBQs)
*How to ask the right Key Business Questions
*Giving people access to data
*Curating the most important data insights

4. Using Data to Understand Your Customers and Markets

*Section intro
*How this butcher uses data to understand customers
*Netflix use case
*Amazon use case
*The increasing need for real-time data to understand customers and
markets

5. Using Data to Provide More Intelligent Services

*Using data to provide more intelligent services

6. Using Data to Make More Intelligent Products

*Using data to make more intelligent products

7. Using Data to Improve Your Business Processes

*Using data to improve your business processes

8. Monetising Your Data

*Monetising your data - intro


*The Shotspotter case study

9. Defining Your Data Use Cases

*Defining data use cases walk through (part 1)


*Defining data use cases walk through (part 2)
*Defining data use cases walk through (part 3)

10. Sourcing and Collecting the Data

*Section introduction
*Structured vs Semi-structured Data
*Internal vs External Data
*Different Types of Data
*Meta Data
*The Importance of Realtime Data
*Gathering Internal Data
*Accessing External Data
*Sources of External Data
*When the data you want doesn't exist
11. Data Governance

*Section introduction
*To own or not to own
*Ensuring the correct rights are in place
*Case study on building trust

12. Turning Data into Insights

*Section introduction
*Text analytics
*Sentiment analytics
*Image analytics
*Video analytics
*Voice analytics
*Data mining
*Business experiments
*Visual analytics
*Correlation Analysis
*Regression Analysis
*Scenario Analysis
*Time series analysis
*Monte carlo simulation
*Linear programming
*Cohort analysis
*Factor analysis
*Neural network analysis
*Deep learning
*Reinforcement learning

13. Creating the Technology and Data Infrastructure

*Section introduction
*How to collect data
*Database, Data warehouse, Data mart and Data lake
*How to store data
*How to process data
*Communicating data
*What is а KPI dashboard
*How to design a KPI Dashboard
*Reporting lessons from journalists
*Using KPI dashboard software
*Big data as a service

14. Building the Data Competencies in Your Organization

*Section introduction
*Skills shortage
*The skills needed for a data scientist
*Building internal skills and competencies
*Outsourcing your data analysis
*Leadership challenges

15. Executing and Revisiting Your Strategy

*Putting the data strategy into action


*Why data strategies fail
*Creating a data culture
*Revisiting the data strategy
*A changing business environment
*Changing technology landscape

16. Looking Ahead

*Using data for good

cours 29:

Fashion Analytics with Tableau

1. Introduction to the Course

*What does the Course Cover


*What Is Fashion Analytics?
*Realizing the Potential of Analytics in Fashion
*The Breadth of Analytical Options
*Requirements for Embedding Analytics in Fashion

2. Consumer-Driven Marketing

*Marketing Fundamentals for Analytics


*Analytics Activation in Consumer-Driven Marketing
*An Overview of Cluster Analysis and Consumer Scoring

3. Consumer Analytics – Product Recommendation

*What Is a Product Recommendation Systems?


*Collaborative and Content-Based Filtering
*Similarity Measures for Product Recommendation Engines

4. Digital and Web Analytics

*Introduction to Digital and Web Analytics


*Experience Analytics and Attribution Models
*Clickstream Analytics and A/B Testing
*Challenges and Opportunities in Web Analytics

5. Supply Chain Analytics

*Introduction to Supply Chain Analytics


*Advanced Analytics in the Supply Chain
*Applications of Analytics in the Supply Chain

6. Integrated Demand Forecasting

*What Is Integrated Demand Forecasting?


*Integrated Demand Forecasting in the Fashion Industry
*Data Science in Demand Forecasting
*How Do Industry Leaders Use Using Integrated Demand Forecasting ?

7. Pricing Optimization

*What Is Pricing Optimization?


*Advanced Analytics in Pricing Optimization
*How Does the Fashion Industry Leverage Optimization in Pricing?

8. Store Localization, Clustering, and In-store Optimization

*Introduction to Store Localization, Clustering, and In-store Optimization


*Advanced Analytics in Localization, Clustering, and In-store Optimization

9. AI for Predicting Fashion Trends

*Introduction to Artificial Intelligence in Fashion


*Image Recognition and Market Intelligence in Fashion

10. Case Study – Building a Fashion Analytics Story in Tableau

*Introduction to the Case Study


*Getting to Know the Dataset
*Creating a Company KPIs Table in Tableau - Net Sales
*Creating a Company KPIs Table in Tableau - Gross Profit Margins
*Creating a Map Chart of Consumer Countries in Tableau
*Creating a Customer KPIs Table in Tableau
*Creating a Bar Chart of Consumer Metrics in Tableau
*Building the Frequency Curve in Tableau
*Building the Repurchase Curve in Tableau
*Creating the Dashboards for the Fashion Analytics Story in Tableau
*Styling and Formatting the Fashion Analytics Report
*Interpretation of the Data Analytics Fashion Story

cours 30:

Dates and Times in Python

1. Dates and Times in Python

*What Does the Course Cover


*Introduction to Working with Dates and Times
*Time Standards and Regulations: GMT and UTC
*Principles of Working with Dates and Times in Computing – Unix Time
*Principles of Working with Dates and Times in Computing – Dates in Python
*The datetime Module: working with Dates
*The datetime Module: working with Time
*The datetime Module: working with the datetime class
.*strftime(): Converting datetime Objects into Strings
.*strptime(): Converting Strings into datetime Objects
*pd.to_datetime(): Converting Data into datetime Objects
*Focus on Working with Timestamps
*Practical Application of Working with Timestamps in Python
*Converting between Timezones in Python: the pytz Module
*Converting between Timezones with pandas

2. A Practical Example: Working with Dates, Times, and Time Zones

*Practical Example Exercise - Dates and Times


*Introduction
*Part I - Data Preparation part 1
*Part I - Data Preparation part 2
*Part II - Data Manipulation part 1
*Part II - Data Manipulation part 2
*Part III - Data Analysis
*Part IV - Data Visualization

cours 31:

SQL for Data Science Interviews

1. Introduction

*Why Take This Course and What Does It Cover

2. SQL Overview and Environment Setup

*Introduction to Section 2
*What is SQL?
*SQL Server Setup

3. Fundamental SQL Components

*Introduction to Section 3
*Structure of a SQL Query
*SQL Functions to Master

4. Full Mock Interviews

*Introduction to Section 4
*5 Step Framework for SQL Interviews
*What Comes Next
*Mock Interview 1
*Mock Interview 2
*Mock Interview 3
*Mock Interview 4
*Mock Interview 5
*Mock Interview 6
*Mock Interview 7
*Mock Interview 8
*Mock Interview 9
*Mock Interview 10

5. Bonus

*How to Recover from Mistakes


*How to Anticipate Followups
*How to Remain Calm

cours 32:

Data Literacy

1. Introduction to Data Literacy

*What Exactly is Data Literacy


*Why do We Need Data Literacy
*Data-driven Decision Making
*Benefits of Data Literacy
*How to Get Started

2. Understanding Data

*Data Definition
*Qualitative vs. Quantitative Data
*Structured vs. Unstructured Data
*Data at Rest vs. Data in Motion
*Transactional vs. Master Data
*Big Data
*Storing Data
*Database
*Data Warehouse
*Data Marts
*The ETL Process
*Apache Hadoop
*Data Lake
*Cloud Systems
*Edge Computing
*Batch vs. Stream Processing
*Graph Database

3. Using Data

*Analysis vs. Analytics


*Descriptive Statistics
*Inferential Statistics
*Business Intelligence (BI)
*Artificial Intelligence (AI)
*Machine Learning (ML)
*Supervised Learning
*Regression Analysis
*Time Series Forecasting
*Classification
*Unsupervised Learning
*Clustering Analysis
*Association Rules
*Reinforcement Learning
*Deep Learning
*Natural Language Processing (NLP)

4. Reading Data

*Reading Data
*Data Quality Assessment
*Data Description
*Measures of Central Tendency
*Measures of Spread

5. Interpreting Data

*Interpreting Data
*Correlation Аnalysis
*Correlation Coefficient
*Correlation and Causation
*Simple Linear Regression
*R-squared
*Forecasting
*Forecast Errors
*Statistical Tests
*Hypothesis Testing
*P-Value
*Statistical Significance
*Classification Models
*Accuracy
*Recall and Precision

cours 33:

AI Applications for Business Success

1. Introduction

*What Does the Course Cover


*Course Resources

2. Business Goals

*Introduction to the business case


*SWOT Analysis
*SMART Goals
*Limitations of the BI Approach
*Correlation vs. Causation
*Making Recommendations with Descriptive Statistics

3. Approaches to solving the business objective

*Introduction to the approaches


*The BI Approach
*State Space and Takens' Theorem
*Shadow Manifolds and K-Nearest Neighbors

4. Artificial Intelligence in Business

*Quantifying Attainability
*Gradient Boosted Machines: Part 1
*Gradient Boosted Machines: Part 2
*Gradient Boosted Machines: Part 3
*SHAP Values
*Friedman's H-Statistic
*LIME
*Waterfall Charts 1
*Waterfall Charts 2
*Causation: Traditional Statistical Methods
*Causation: Advanced Statistical Methods
*Time Series Forecasting with Takens' Theorem

5. Artificial Intelligence Recommends Metrics

*Introduction
*The Hybrid Experiment
*Quantile Difference Tests

cours 34:

Linear Algebra and Feature Selection

1. Linear Algebra Essentials

*What Does the Course Cover


*Why Linear Algebra?
*Solving Quadratic Equations
*Vector
*Matrices
*The Transpose of Vectors and Matrices, the Identity Matrix
*Linear Independence and Linear Span of Vectors
*Basis of a Vector Space, Determinant of a Matrix, Inverse of a Matrix
*Solving Equations of the Form A*x=b
*The Gauss Method
*Other Solutions to the Equation A*x=b
*Determining Linear Independence of a Random Set of Vectors
*Eigenvalues and Eigenvectors
*Calculating Eigenvalues
*Calculating Eigenvectors

2. Dimensionality Reduction Motivation

*Feature Selection, Feature Extraction, and Dimensionality Reduction


*The Curse of Dimensionality

3. Principal Component Analysis (PCA)

*Principal Component Analysis – Overview


*A Step-by-Step Explanation of PCA on California Estates – Example
*The Theory Behind PCA
*PCA Covariance Matrix in Jupyter – Analysis and Interpretation

4. Linear Discriminant Analysis (LDA)

*Overall Mean and Class Means


*Linear Discriminant Analysis – Overview
*LDA: Calculating Within- and Between-Class Scatter Matrices
*A Step-by-Step Еxplanation of LDA on a Wine Quality Dataset – Exampl
*Calculating the Within- and Between-Class Scatter Matrices
*Calculating Eigenvectors and Eigenvalues for the LDA
*Analysis of LDA
*LDA vs. PCA
*Setting Up the Classifier to Compare LDA and PCA
*Coding the Classifier for LDA and PCA
*Analysis of the Training and Testing Times for the Classifier and Its
Accuracy

cours 35:

Machine Learning in Excel

1. Introduction

*Course Introduction
*What Is Machine Learning?
*Types of Machine Learning

2. Simple Linear Regression

*Linear Regression: Introduction


*Linear Regression
*Linear Regression Model (Graphical Representation)
*Formatting Excel Spreadsheet
*First Regression in Excel
*What Is OLS?
*Interpreting Regression Tables (Part 1)
*Decomposition of Variability
*Interpreting Regression Tables (Part 2)
*Interpreting Regression Tables (Part 3)

3. Multiple Linear Regression

*Multiple Regression Analysis


*Multiple Linear Regression (Example)
*Multiple Linear Regression (Results)
*OLS Assumptions
*OLS Assumptions: Linearity
*OLS Assumptions: No Endogeneity
*OLS Assumptions: Normality and Homoscedasticity
*OSL Assumptions: No Autocorrelation
*OLS Assumptions: No Multicollinearity
*Dummy Variables
*Dummy Variables - Exercise
*Making Predictions Using Linear Regression
*Making Predictions Using Linear Regression -Exercise

4. Linear Regression Practical Example

*Practical Example (part 1)


*Practical Example (part 2)
*Practical Example (part 3
*A note on multicollinearity
*Feature Scaling
*Practical Example (part 4)

5. Logistic Regression

*Introduction to Logistic Regression


*From Linear to Logistic Regression
*Logistic vs. Logit Functio
*Applying Logistic Regression in Excel
*Interpreting Regression Coefficients
*Logistic Regression with Xreal
*Understanding the Logistic Regression Summary (part 1)
*Understanding the Logistic Regression Summary (Part 2)
*ROC Curve
*Binary Predictors for Logistic Regressions
*Underfitting and Overfitting
*Testing the Logistic Model

6. Cluster Analysis

*Cluster Analysis (Definition)


*Cluster Analysis (Application)
*Clustering vs Classification
*Cluster Analysis (Math Prerequisites)

7. K-means Clustering

*K-means Clustering
*K-means Clustering in Excel
*K-means Clustering with Xreal
*Choosing the Number of Clusters
*Clustering Categorical Data
*Standardization
*Clustering and Regression
*Clustering (Pros and Cons)
*Types of Clustering
*Market Segmentation (Part 1)
*Market Segmentation (Part 2)
8. Decision Trees

*Decision Trees
*Entropy (Loss function)
*Information Gain
*Decision Trees in Excel (Part 1)
*Decision Trees in Excel (part 2)
*Decision trees (Prediction)

9. Machine Learning in the Cloud

*Machine Learning in the Cloud


*Setting up Azure Machine Learning Studio (AMLS)
*First Experiment in AMLS (Part 1)
*First Experiment in AMLS (Part 2)
*Machine Learning in the Cloud (Assignment)
*Publishing a Web Service
*Azure Assignment
*The Future of Machine Learning

cours 36:

A/B Testing in Python

1. Introduction to A/B testing

*Welcome to the course: meet your instructo


*What is A/B testing and why is it so important?
*The key characteristics of an A/B test
*How to create an A/B test? Who does it?
*How to know if an A/B test was successful? Defining KPIs and metrics
*Calculation of metrics in practice: Kittengram

2. Setting up and executing A/B tests in practice

*Data instrumentation and tracking


*How to calculate metrics from raw datasets
*Designing the experiment
*How to set up the A/B test
*What is statistical significance?
*Calculating the sample size of an A/B test
*Example of significance power calculator
*A/B test - start & analysis
*How to present the results of an A/B test
*A/B test analysis process
*Comparing the activity between the groups

3. Advanced A/B testing considerations and interview prep


*Advanced A/B testing considerations
*How to A/B test ethically

4. Interview preparation

*Introduction
*Question 1
*Question 2
*Question 3
*Question 4
*Question 5
*How to prepare for the interview

5. Conclusion

*Conclusion

cours 37:

Machine Learning with Decision Trees and Random Forests

1. Introduction to Decision Trees and Random Forests

*What does the course cover?

2. Setting up the Environment

*Setting up the environment


*Installing the relevant packages

3. Decision Trees

*What Is a Tree in Computer Science?


*The Concept of Decision Trees
*Decision Trees in Machine Learning
*Decision Trees: Pros and Cons
*Practical Example: The Iris Dataset
* Practical Example: Creating a Decision Tree
*Practical Example: Plotting the Tree
*Decision Tree Metrics Intuition: Gini Impurity
*Decision Tree Metrics: Information Gain
*Tree Pruning: Dealing with Overfitting

4. Random Forests

*Random Forest as Ensemble Learning


*Bootstrapping
*From Bootstrapping to Random Forests
*Random Forest in Code – Glass Dataset
*Census Data and Income – Preprocessing
*Training the Decision Tree
*Training the Random Forest
cours 38:

Data-Driven Business Growth

1. Introduction

*Introduction to the course

2. The stages of data maturity

*The stages of data maturity and what you will see next
*How to go from no data to some data - Reach Level 1

3. Data Maturity Level 1 - Project 1

*Data maturity Level 1


*Intro to Project 1
*Project 1 - data files
*Why do the analysis?
*Formulating an analysis plan
*The data we will use
*Exploring the data: large dataset
*Exploring the data: small dataset
*Customer journey
*Top of funnel opportunities
*Middle of funnel opportunities
*Bottom of funnel opportunities
*Test and learn
*Next steps
*How to get to data maturity Level 2
*How to ask for funding

4. Data Maturity Level 2 - Project 2

*Data Maturity Level 2


*Intro to Project 2
*The Crawl stage
*The Walk stage
*The Run stage
*A/B testing and A/B test mechanics
*Statistical significance and potential outcomes
*The impact of sample size
*Test power vs. lift
*How to get to Data Maturity Level 3
*How to ask for funding

5. Data Maturity Level 3


*Data Maturity Level 3

6. Project 3

*Intro to Project 3
*Exploring the dataset
*Analyzing purchase rate across groups
*Understand the business problem and specify your objectives
*Explore data. Create, train, and test dataset
*Perform the analysis

7. Beyond Data Maturity Level

*What to do to improve even further

cours 39:

Power Query and Data Modeling

1. Intro to Power Query

*What is Power Query


*Importing Data
*Data Types
*Replacing Values
*Merge Columns
*Extract Values
*Remove Duplicates
*Remove Columns
*Filters
*Date Functions
*Conditional Columns
*Unpivot
*Merge Queries
*Parameters
* Query Properties
*Data Profiling
*Web Scraping
*Mathematical Functions

2. Data Modeling

*Data modeling
*Relationships
*Role Level Securit
*Global Superstore report

cours 40 :
Introduction to DAX

1. Introduction to DAX

*Course intro
*Calculated Columns
*Calendar Table
*Building a Measure
*Measures Table
*CALCULATE
*FILTER vs KEEPFILTERS
*Iterators
*RELATED vs RELATEDTABL
*SELECTEDVALUE
*DIVIDE
*Logical Operators
*Variables
*TREATAS
* SWITCH
* Text Functions
*CONCATENATEX
*Time Intelligence
*Expression Based Titles
* Role Level Security
*Calculation Groups
*DAX assignment

cours 41 :

Building Business Reports Using Power BI

1. Building business reports using Power BI

*Course intro
*Power BI Canvas
* Drawing Shapes
*Slicers
*Filled Map
*Matrix Table
*Donut Chart
*Interactive Tooltip
*Gauge Visual
*KPI Visual
*Card Visual
*Area Chart
*Bookmarks
*Format Painter
*Stacked Column Chart
*Two reports on one page
cours 42

Working with Text Files in Python

1. Introduction to Working with Text Files and Data

*Introduction to Working with Text Files in Python


*The Difference between File and File Object; Read vs. Parse
*Defining Structured, Semi-Structured and Unstructured Data
*What is Data Connectivity: A Text Files Perspective

2. Principles of Importing Data in Python

*Principles of Importing Data in Python


*More on Working with Text Files (*.txt vs *.csv)
*What about Fixed-width Files?
*Python Programming: Common Naming Conventions

3. Importing Text Files in Python

*Python: Importing Text Files ( open() )


*Python: Importing Text Files ( with open() )
*pandas: Importing *.csv Files - Part I
*pandas: Importing *.csv Files - Part II
*pandas: Importing *.csv Files - Part III
*Python: Importing Data with the "index_col" Parameter
*Python: Importing *.json Files
*Introduction to Working with Excel Files in Python
*Dealing with Excel Data (the *.xlsx Format)
*Importing Data in Python - an Important Exercise
*Python: Importing Data with the pandas' .squeeze() Method
*Jupyter: A General Note on Importing Files
*pandas: Saving Your Data

4. Conclusion

*Working with Text Files - Conclusion

cours 43:

Intermediate Python Programming


1. Intermediate Python Programming - Introduction

*Course Introduction
*Python Refresher and Setting Up the Environment

2. Working with Text Data in Python

*Dealing with Text Data and Argument Specifiers


*Working with Python Strings at the Next Level
*Exploring Python String Methods - Part I
*Exploring Python String Methods - Part II
*Learning How to Use String Accessors
*Working with the .format() Method

3. Nested for Loops, List Comprehensions, and Anonymous Functions

*The Concept of Iterating Over Range Objects in Python


* Introduction to Nested For Loops
*Working with Triple Nested For Loops
*Using List Comprehensions
*Working with Anonymous (Lambda) Functions

cours 44:

Advanced SQL

1. SQL Window Functions

*SQL Window Functions - Section Introduction


*SQL Window Functions - Introduction
*The ROW_NUMBER() Ranking Window Function
*The ROW_NUMBER() Ranking Window Function - Exercise
*Using Several Window Functions in a Query
*Using Several Window Functions in a Query - Exercise
*SQL Window Functions Syntax
*SQL Window Functions Syntax - Exercise
*PARTITION BY vs GROUP BY
*PARTITION BY vs GROUP BY - Exercise
*The RANK() and DENSE_RANK() Window Functions
*The RANK() and DENSE_RANK() Window Functions - Exercise
*Using MySQL Ranking Window Functions and Joins Together
*Using MySQL Ranking Window Functions and Joins Together - Exercise
*The LAG() and LEAD() Value Window Functions
*The LAG() and LEAD() Value Window Functions - Exercise
*SQL Aggregate Functions and Window Functions - Part I
*SQL Aggregate Functions and Window Functions - Part I - Exercise
*SQL Aggregate Functions and Window Functions - Part II - Exercise
*SQL Aggregate Functions and Window Functions - Part II
2. SQL Common Table Expressions (CTEs)

*SQL Common Table Expressions (CTEs) - Section Introduction


*SQL Common Table Expressions (CTEs) - Introduction
*The Same CTE Task - an Alternative Solution
*The Same CTE Task - an Alternative Solution - Exercise
*A WITH Clause with Multiple Subclauses - Part I
*A WITH Clause with Multiple Subclauses - Part II - Exercise
*A WITH Clause with Multiple Subclauses - Part II
*Referring to CTEs in a WITH Clause

3. SQL Temporary Tables

*SQL Temporary Tables - Introduction


*SQL Temporary Tables - Application
*SQL Temporary Tables - Application - Exercise
*SQL Temporary Tables - Extra Features
*SQL Temporary Tables - Extra Features - Exercise

cours 45:

The Machine Learning Algorithms A-Z

1. Course Introduction

*Introduction
*ML Algorithms course - GitHub repository
*How to Use this Course
*Types of ML Problems
*Additional Resources

2. Linear Regression

*Linear Regression
*Real World Business Problems
*Example: Linear Regression
*Intuition: Linear Regression
*Training Step-by-Step: Linear Regression
*Prediction: Linear Regression
*Assumptions: Linear Regression
*Assumption #1: Model is linear in coefficients and error terms
*Assumption #2: Homoscedasticity
*Assumption #3: Multicollinearity
*Assumption #4: Independence/Autocorrelation
*Assumption #5: Normally Distributed Error Terms
*Assumption #6: Outliers
*Inference - Interpreting Output
* AB Testing Example
*ML Process: Linear Regression
*Pros & Cons, When to Use

3. Ridge, Lasso, Elastic Net


*Ridge, Lasso, Elastic Net
*Intuition: Ridge, Lasso, Elastic Net
*Plain Definition: Ridge, Lasso, Elastic Net
*Shrinkage Methods vs. Feature Selection
*Step-by-Step Intuition: Ridge, Lasso, Elastic Net
*Lasso Regression (L1)
*Ridge Regression (L2)
*ElasticNet (L1 + L2)
*Determining the Degree of Regularization
*Difference between Lasso & Ridge
*Link to resources
*When to use: Ridge, Lasso, Elastic Net

4. Logistic Regression

*Introduction: Logistic Regression


*Example: Logistic Regression
*Intuition: Logistic Regression
*Real World Business Problems: Logistic Regression
*What is Logit
*Step-by-Step Prediction: Logistic Regression
*Step-by-Step Training: Logistic Regression
*Assumptions: Logistic Regression
*Understanding Logistic Regression Output
*Maximum Likelihood Explained
*Log Loss
*Predicting Multiple Classes using Multinomial Logistic Regression
*ML Process: Logistic Regression
*ProsCons, When to Use

5. Gradient Descent

*Gradient Descent
*Intuition: Gradient Descent
*Plain Definition: Gradient Descent
*Step-by -Step: Gradient Descent
*Assumptions: Gradient Descent
*Parameter Tuning (Step size, Alpha)
*Gradient Descent Pros and Cons
*Stochastic Gradient Descent
*Pros and Cons: Gradient Descent

6. Decision Trees

*Decision Trees
* Example: Decision Trees
*Plain Explanation: Decision Trees
*Different Components of Decision Trees Explained
*Real World Business Example: Decision Trees
* Assumptions: Decision Trees
*Training Step-by-Step: Decision Trees
*Prediction Step-by-Step: Decision Trees
*Additional Metrics: Decision Trees
*Tuning the Parameters: Decision Trees
*ML Process: Decision Trees
*Decision Trees Assumptions
*Pros and Cons: Decision Trees
*When to Use Decision Trees.

7. Random Forest

*Random Forest
*Intuition: Random Forest
*Example: Random Forest
*Real World Business Problems: Random Forest
* Plain Definition: Bagging
*Where Bagging Fails
*Plain Definition: Random Forest
*Step-by-Step (Training): Random Forest
* Step-by-Step (Prediction): Random Forest
*How Random Forest give us Feature Importance
*Out of Bag Error
*ML Process: Random Forest
*When to use: Random Forest
*Pros and Cons: Random Forest

8. Gradient Boosted Trees

*Gradient Boosted Trees


*Example: Gradient Boosted Trees
* Real World Business Problems: Gradient Boosted Trees
*Plain Definition: Gradient Boosted Trees
*Terminology: Gradient Boosted Trees
*Assumptions: Gradient Boosted Trees
*Training Step-by-Step (Regression): Gradient Boosted Trees
*Training Step-by-Step (Classification): Gradient Boosted Trees
*Prediction Step-by-Step: Gradient Boosted Trees
* What does the “Gradient” mean
*How Gradient Boosted Trees give us Feature Importance
*ML Process: Gradient Boosted Trees
*When to use Gradient Boosted Trees

9. XGBoost

*Intuition: XGBoost
* Real World Business Problems: XGBoost
*Plain Definition: XGBoost
*XGBoost Algorithm Improvements
* System Improvements
*ML Process: XGBoost
*When to use XGBoost
* Pros and Cons: XGBoost

10. K Nearest Neighbors

*Intuition: KNN
*Example: KNN
*Plain Definition: KNN
*Assumptions : KNN
*Training Step-by-Step: KNN
*Prediction Step-by-Step: KNN
*Tuning Parameters: KNN
*ML Process: KNN
*When to use KNN

11. K-Means Clustering

*Intuition: K-Means Clustering


*Example: K-Means Clustering
*Plain Definition: K-Means Clustering
*Real World Business Problems: K-Means Clustering
*Step-by-Step Training: K-Means Clustering
*Selecting K
*Silhouette Method
*Hard Clustering vs- Soft Clustering
*Derivatives of K-Means
*Assumptions: K-Means Clustering
*ML Process: K-Means Clustering
*When do we use K means Clustering

12. Hierarchical Clustering

*Intuition: Hierarchical Clustering


*Real World Business Problems: Hierarchical Clustering
*Definition: Hierarchical Clustering
*Step-by-Step Agglomerative Clustering
*Linkages
*Distance
*ML Process: Hierarchical Clustering
*Pros and Cons: Hierarchical Clustering
*When to Use: Hierarchical Clustering

13. Support Vector Machines

*Intuition: SVM
*Real World Business Problems: SVM
*Step-by-Step Training (Non-Technical): SVM
*Loss Function
*Nonlinear Data
* Prediction (Step-by-Step): SVM
*Terminology: SVM
*Assumptions: SVM
*Soft vs Hard Margins: SVM
* How to use SVMs as a multi-class classifier
*How does SVM Regression Work
* ML Process: SVM
*Pros & Cons (Classifier): SVM
* When to use an SVM Classifier

14. Artificial Neural Nets

*Intuition: Artificial Neural Nets


*Real world Business Problems: Artificial Neural Nets
*Example: Artificial Neural Nets
* Multi-Layered Networks: Artificial Neural Nets
*Classification - Activation Layers
*Vanishing Gradient Problem
*Activation Layers
*Embeddings
*Types of ANNs
* Transfer Learning
*ML Process: Artificial Neural Nets
* Pros and Cons: Artificial Neural Nets

15. Collaborative Filtering - Non-Negative Matrix Factorization

*Intuition: Collaborative Filtering


*Plain Definition: Collaborative Filtering
*Real world Business Problems: Collaborative Filtering
*Assumptions: Collaborative Filtering
*Different Approaches to Collaborative Filtering
*Matrix Factorization Intuition
*Matrix Factorization Definition
*Assumptions: NMF
*Step-by-Step (prediction): Collaborative Filtering
*Step-by-Step (training): Collaborative Filtering
*Determining the ideal number of latent variables
* Addressing the Cold-Start Problem
*ML Process: Collaborative Filtering
*ProsCons: Collaborative Filtering
*When to use NMF

16. Naïve Bayes

*Bayes Theorem
*Intuition and Plain Definition
*Step-By-Strp Explanation - First Part
*Step-by-step Explanation - Second Part
* Why is Naive Baïve called Naïve?
*The types of Naïve Bayes
*ML Process: Naïve Bayes
*Pros and Cons: Naïve Bayes
*Real-Life Business Example
*When to use Naïve Bayes

17. Practical projects

*Regression project
*Classification project

cours 46:

Customer Churn Analysis with SQL and Tableau

1. Introduction to The Course

*Introduction to the course

2. Intro to the case study

*Motivation - Our Story


*Intro to the business case
*Basic Terminology - Net Revenue, Refunds, New vs Recurring Revenue
*Types of customers
*Basic Terminology - Customer Churn and Customer Retention
*Defining key questions for the customer churn analysis
*Creating an outline of the dashboard

3. Data Preparation

*Types of data access


*Query 1 - Net Revenue
*How to export data from Workbench
*Query 2 - User Resurrections and how to export data from Workbench

4. Creating a customer churn story in Tableau part I

*Bar Chart - Net Revenue and Refunds


*Charts Formatting: Styling Tips for Business Dashboards in Tableau
*Customer Churn—Line Chart in Tableau
*Telling a Story with Data: Create a Dashboard to Add to the Tableau Story
*Creating a date filter for page one of the report
*New vs Recurring Revenue—Combination Charts in Tableau
*Revenue by Country—Map Charts in Tableau
*Top 10 Countries by Revenue—Tables in Tableau
*Chart Formatting and Creating the Dashboard in Tableau

5. Creating a customer churn story in Tableau part II

*User Ressurections - Bubble Chart in Tableau


*User Ressurections – Bar Chart in Tableau
*Assignment 2 – User Upgrades Bubble and Bar Chart
*Building the Tableau Dashboard - Page 5
*Annual Subscribers - Order Frequency Combination Chart in Tableau
*Assignment 3 – Monthly subscribers
*Annual Subscribers – Cohort Analysis Table in Tableau
*Building the Tableau Dashboard – Page 6

6. Dashboard Interpretation

*Dashboard Interpretation – Net Revenue and Refunds


*Dashboard Interpretation – Churn Rates
*Dashboard Interpretation – New vs Recurring Revenue
*Dashboard Interpretation – Maps and Revenue by Country – Top 10
*Dashboard Interpretation – Maps and Revenue by Country – Average Order Value
(AOV)
*Dashboard Interpretation – User Upgrades
*Dashboard Interpretation – User Resurrections
*Dashboard Interpretation – Order Frequency Table and Cohort Analysis –
Annual Subscriptions
*Dashboard Interpretation – Order Frequency Tables and Cohort Analysis –
Monthly Customers
*What more is out there?
cours 47:

Machine Learning Deep Dive: Business Applications and Coding Walkthroughs

1. ML Business Use Cases

*Linear Regression
*Logistic Regression
*Random Forest
*K-Means Clustering
*K-Nearest Neighbors
*Hierarchical Clustering
*Support Vector Machines
*Artificial Neural Networks
*Collaborative Filtering

2. Coding Walkthroughs

*Introduction
*Linear Regression - First Part
*Linear Regression - Second Part
*Linear Regression - Third Part
*Logistic Regression
*Decision Trees - First Part
*Decision Trees - Second Part
*Decision Trees - Third Part
*Random Forest - First Part
*Random Forest - Second Part
*Gradient Boost - First Part
*Gradient Boost - Second Part
*KNN - First Part
*KNN - Second Part
*K-Means Clustering - First Part
*K-Means Clustering - Second Part
*Hierarchical Clustering - First Part
*Hierarchical Clustering - Second Part
*SVM
*Neural Network - First Part
*Neural Network - Second Part
*Neural Network - Third Part
*NMF - First Part
*NMF - Second Part
*Naïve Bayes

cours 48:

Customer Engagement Analysis with SQL and Tableau


1. Introduction to the Course

*What Does the Course Cover?


*Motivation - Our Story

2. Introduction to the Customer Engagement Analysis Case Study

*The Case Study


* Defining the Key Questions (Part 1)
*Defining the Key Questions (Part 2)
*Sketching the Dashboard (Part 1)
*Sketching the Dashboard (Part 2)

3. Retrieving Relevant Data from the Database

*Types of Data Access


*Importing and Presenting the Database
*SQL Query – Activity by Dates (Part 1)
* SQL Query – Activity by Dates (Part 2)
*SQL Query – Activity by Dates (Part 3)
*Assignment 1

4. Building the Dashboard – Introduction to Dashboards

*Fixing the Size of the Dashboard


*The Horizontal, Vertical, and Blank Objects
*The Text, Image, and Navigation Objects
* Creating an Item Hierarchy
* Tiled versus Floating Layout

5. Building the Dashboard – The Overview Page

*Creating the KPIs (Part 1)


*Creating the KPIs (Part 2)
*Assignment 2
*Horizontal Bar Chart – Overall Minutes Watched
*Formatting the Horizontal Bar Chart
*Assignment 3
*Swapping Between Charts Using a Parameter
*Donut Chart – Ratings
*Assignment 4
*Creating the First Page of the Dashboard (Part 1
*Assignment 5
*Creating the First Page of the Dashboard (Part 2)
*Assignment 6
*Creating the First Page of the Dashboard (Part 3)

6. Building the Dashboard – The Engagement and Onboarding Page

*Line Chart – Number of Engaged Students (Choosing the Engagement Period)


*Line chart – Onboarded to Registered Students (Choosing the Engagement
Period)
*Assignment 7
*Line and Bar Charts - Number of Engaged Students (Choosing the Month and
Year)
*Assignment 8
*Creating the Second Page of the Dashboard
*Assignment 9
*Configuring the Navigation Buttons
*Assignment 10

7. Building the Dashboard – The Engagement by Cohorts Page

*What is a Cohort Analysis?


*Cohort Tables – Retention of Student Engagement
*Retention Curves – Retention of Student Engagement
*Assignment 11
*Creating the Third Page of the Dashboard
*Assignment 12
*Creating a Hover Tooltip
*Assignment 13

8. Building the Dashboard – The Exams and Certificates Page

*Horizontal Stacked Bar Chart – Number of Exams Attempted


*Assignment 14
*Bar Charts – Certificates Issuance and Career Track Funnel
*Assignment 15
*Creating the Fourth Page of the Dashboard
*Assignment 16

9. Building the Dashboard – The Content Consumption Page

*Combo Charts – Minutes Watched and Minutes Watched per User


*Combo Chart – F2P Conversion Rate and Subscription Duration

10. Discussion and Insights

*Interpreting the Data (Platform Engagement)


*Interpreting the Data (Onboarding and Retention)
*Interpreting the Data (Content Consumption)
*Interpreting the Data (F2P CR and Subscription Duration)
*Interpreting the Data (Courses Engagement)
*Interpreting the Data (Exams)
*Interpreting the Data (Certificates)
*Further Steps and Improvement

cours 49 :

Advanced SQL for Data Engineering

1. Setting up the environment and overview of relational databases

*Introduction to the course


*Coure GitHub repository
*Setting up the environment
*An overview of relational databases
*DDL, DML, DQL, DCL
*SQL syntax
2. Manipulating databases: Mastering essential SQL statements

*Intro SQL statements


*CREATE
*ALTER
*INSERT
*UPDATE
*DELETE
*MERGE
*DROP

3. Timing is everything: managing and manipulating DateTime in SQL

*DateTime intro
*Different DateTime types
*Timezones
*Intervals

4. Complex data types: ENUMs, ARRAYs, Ranges, and Nested data in SQL

*ENUM
*ARRAYs
*RANGE
*Nested data

5. Advanced query techniques: exploring OVER, JOINS, CASE and more in SQL

*OVER
*CROSS JOIN
*LATERAL JOIN
*CROSS JOIN LATERAL
*COALESCE
*CASE
*CONCAT
*Recursive CTE
*Recursive CTE - second part

6. Optimizing data structures: The art and science of data normalization in SQL

*Data normatlization (1/3)


*Data normatlization (2/3)
*Data normatlization (3/3)
*STAR schema Snowflake

7. Performance and control: Mastering stored procedures, temporary tables, and


control structures in SQL

*Stored procedures and UDFs (1/3)


*Stored procedures and UDFs (2/3)
*Stored procedures and UDFs (3/3)
*Temp table
*Materialized view
*Transactions
*SQL structures

8. Practical tasks

*Question 1
*Question 2
*Question 3
*Question 4

cours 50:

Communication and Presentation Skills for Analysts and Managers

1. Welcome

*What to expect in this course


*"I'm not a natural storyteller"

2. Why: Data presentation mistakes you might be making

*The presentation I messed up


*Video: Spot the presentation mistakes
* Create your own presentation baseline

3. Why: The science of storytelling

*Learn about the neuroscience of decision-making


*My cousin who smoked cigarettes
*Why storytelling matters

4. How: The first step of your data presentation

*GAME framework
*Goal: Use these 3 storytelling chemicals
*Goal: Determine the focus of your data presentation
*Audience: Ask these questions
*Audience: 4 preferred ways of communication
*Audience: Get buy-in with the IKEA effect
* Audience: Think about what language they speak
*Audience: The curse of knowledge
*Message & Expression
* Exercise: Use the GAME framework

5. How: Creating a good story

*What is a good story? Introducing the ABT framework


*Applying the ABT framewor
*Exercise: Create your own ABT stor
*How to visualize your ABT story
*ABT framework - more resource
*The ABT framework - more resources
*Shark tank pitch (spot the storytelling elements)

6. How: Building your data story

*How to create an effective data storyline


*Case 1 - introduction
*Case 1 - solution
*Characters in your data story
*Case 2 - introduction
*Case 2 - solution
*Data trailer
*Pyramid principle

7. How: Using PowerPoint effectively

*Funeral for your darlings: choose the right insights


*How to avoid overwhelming your audience
*Death by PowerPoint (TED Talk)
*How to deal with people asking to send the presentation

8. How: The days before the presentation

*How to practice your presentation


*Which part to practice most
*How to deal with presentation stress

9. How: Presenting with impact

*Do a visual and auditory assessment


*How to keep your audience engaged
*How to get rid of filler word
*Tip for remote presentations
*How NOT to talk in your presentation
*Body language - Two tips you haven't heard before
*Own the stage
*Tip for remote presentations

10. Bonus: How to make insights more persuasive

*My book
*Talk benefits, not features
*Loss aversion
*7 principles to make your ideas stick

11. More bonus resources

*Data visualization best practices


*Chartjunk
*Six data visualization lessons from 1 image
*Choose your chart
*Dashboard design

12. Additional bonus

*How to find more stories in your life


*Bonus - group coaching call
*Take the next step in data storytelling

cours 51:

Intro to ChatGPT and Generative AI


1. Intro to ChatGPT and Generative AI

*How to sign up for ChatGPT. Create your first ChatGPT prompt


*Why is everyone so hyped about ChatGPT?
*What ChatGPT is and is not
*Understanding the technology behind ChatGPT: AI, NLP, LLMs
*The main types of AI
*Other notable AI projects

2. Effective prompt engineering in ChatGPT

*Prompt engineering or how to ask better questions


*How to prompt effectively
*Complete practical example - How to prompt effectively
*How to follow up AI responses
*Enhancing critical thinking with multi-perspective responses from ChatGPT
*Important ChatGPT commands you need to remember

3. Working with text files

*ChatGPT's reading comprehension


*How to split and merge PDFs using ChatGPT
*Generate training data with AI
*How to use ChatGPT as the perfect Excel tutor and work assistant
*AI Image comprehension
*AI Image generation
*AI image editing
*How to summarize an article with ChatGPT

4. Customized GPTs: Build Your Custom ChatGPT

*What are GPTs?


*Why Create Your Own GPTs?
*Best Practices When Creating GPTs
*Introducing the GPT Store
*How to Create Your Own GPT
*GPT Conversation Starters
*Adding Custom Actions to GPTs
* How to Add GPTs to Existing Conversations
* GPT Share Settings

5. Practical Application, Ethical Considerations, and The Future of AI


.
*How to connect to the Open AI API
*Ethics in AI: Balancing progress and principle
*AI hallucination
*Why you and your company must embrace AI

cours 52:

Data Analysis in PowerBI with ChatGPT

1. Data Analysis in Power BI with ChatGPT


*Introduction to the Course
* Download PowerBI
*Describing Data
*Querying ChatGPT
*Building Your First Char
*Slicing Your Data
*Formatting Your Chart
*Converting IDs into Dimensions
*Fixing Slicers
*Working with Canvas Size
*Churn Calculations
*Formatting Your Second Chart
*Formatting a Dashboard
*Smart Narrative

cours 53:

Mastering Key Performance Indicators (KPIs)

1. Understanding Key Performance Indicators: Introduction

*Course introduction
* KPIs as navigation tools
*KPIs and fighter jets
*Understanding KPIs through the lens of weight loss goals
*Aligning Key Performance Indicators with strategic goals

2. Map your strategic goals

*Agree on what matter


*The Smart Strategy Board templat
*How to use the Smart Strategy Board template
* Visualizing strategic goals
*Strategic goals - The dynamics of constant evaluation and adaptation
*Cascading goals
*Support functions
*Action plan alignment

3. Introduction to Key Performance Questions

*Introduction to Key Performance Questions


*The importance of diversity in metrics
*How to develop KPQs
*Applying Key Performance Questions - Google case study
*Applying Key Performance Questions - butcher case stud
*Section recap

4. Introduction to Key Performance Indicators

*Introduction to Key Performance Indicators


*Definition of KPIs
*The difference beween Goal, KPI, and Target
*Leading vs Lagging indicators
*Tangible vs Intangible KPIs
*Qualitative vs Quantitative KPIs
**How many KPIs should you have
*Finding valuable KPIs
*Measuring behaviour, not opinion
*Measuring behaviour - Gucci example
*Data explosion and KPIs
*KPI evolution - supermarket example

5. Working with a KPI design template

*The KPI template


*KPI template - Net profit example
*KPI Template - NPS example
*KPI design tips

6. KPIs in action: Diverse use cases across industries

*Developing financial KPIs


*Developing customer KPIs - hotel example
*Customer KPIs - building materials supplier
*Customer engagement KPIs
*Operational KPI examples
*Resource and people KPIs
*Key Risk Indicators
*Section recap

7. Managing performance

*Managing performance - introduction


*From KPIs to insights
* KPI reporting
*Using a KPI reporting template
*RAG ratings
*RAG rating example - RAF
*Data visualisation
* KPI reporting - example
*KPI review meetings
*KPI meeting template
*Section recap

8. Conclusion

*Best practices
*Final words

cours 54:

Sign-Up Flow Optimization Analysis with SQL and Tableau

1. Intro to Sign-Up Flow Optimization Analysis

*What Does the Course Cover?


*Motivation

2. Case Study
*Understanding the 365's Sign-Up Process
*Basic Terminology
*Defining the Key Aspects of the Sign-Up Process
*Sketching the Story

3. SQL: Data Retrieving

*Types of Data Access


*Getting to know the Database
*SQL Query 1: Sign-Up Conversion Rate
*SQL Query 2: Sign-Up Types and Errors - Part 1
*SQL Query 2: Sign-Up Types and Errors - Part 2
*SQL Query 3: Login Types and Errors - Homework

4. Tableau: Dashboard Sheets

*SQL and Tableau


*Sign-Up Conversion Rate
*Device and Operating System Distribution
* Preferred Devices Horizontal Bar Chart
* Preferred Operating Systems Horizontal Bar Chart
* Sign-Up Types and Errors Bar Chart
*Login Types and Errors Bar Charts

5. Creating the Tableau Story

*Creating Dashboards: Getting Familiar with the Canvas


*Sign-Up Conversion Rates: Sheet Swapping with Parameters
*Sign-Up Conversion Rates: Containers
*Preferred Sign-up Devices and Operating Systems
*Sign-Up Types and Errors
*Login Types and Errors
* Formatting the Tableau Story - Part
*Formatting the Tableau Story - Part 2

6. Analysis Report

*Dashboard Application for an Analysis Report


*Current State of Affairs: Observing Story Point 1 - Conversion Rate
*Current State of Affairs: Observing Story Point 2 - Sign-Up Types and
Preferences
*Current State of Affairs: Observing Story Point 2 - Sign-Up Types Success
and Fail Rates
*Current State of Affairs: Observing Story Point 3 - Login Types
*Business Objective
*Hypothesis and Opportunity Sizing - Part 1
*Hypothesis and Opportunity Sizing - Part 2
*Actionable Insights Based on the Analysis
*7. A/B Testing
*Understanding the Fundamentals of A/B Testing
*Setting Up an A/B Test: Confidence, Significance, and p-values
*Setting Up an A/B Test: Minimum Detectable Effect and Statistical Power
*Running an A/B Test: Execution and Analysing the Results

8. Discussion and Further Analysis of a Website Sign-Up Flow

*Discussion on the Observations


*Conclusions and Recommendations
cours 55:

Python for Social Media Analytics

1. Course Introduction

*Welcome
*Why is this the right time to learn Python for social media analytics?

2. Introduction to Social Media Marketing

*Intro to social media marketing


*The marketing funnel

3. Social Media Marketing Terminology

*What is a marketing campaign?


*Optimizations for ad delivery
*What are ad sets
*What is an ad?
*Core, custom, and lookalike audiences
*Facebook pixel

4. Social Media Marketing Metrics

*Intro to social media marketing metrics


*Reach & impressions
*CTR
*Link clicks
*Conversion rate
*CPM
*CPC
*CPR
*ROAS
*A message from the instructor

5. Intro to Facebook Ads Manager

*Intro to Facebook Ads Manager

6. Creating a Facebook Campaign

*Creating your first campaign on Ads Manager


*Creating an ad set
*Creating an ad

7. Analyzing a Facebook Campaign with Python - Overall Campaign Performance

*Examining the dataset


*Facebook campaign summary
*Calculating the CPM
*Calculating the CPR

8. Analyzing a Facebook Campaign with Python - Ad Set Overview

*Ad set overview


*Warm, hot, and cold ad sets
*LAL vs Detailed ad sets - Part 1
*LAL vs Detailed ad sets - Part 2

9. Analyzing a Facebook Campaign with Python - Detailed Ad Set Analysis

*Analyzing Hot ad sets - Part 1


*Analyzing Hot ad sets - Part 2
*Analyzing warm ad sets
*Analyzing cold ad sets - Part 1
*Analyzing cold ad sets - Part 2
*Analyzing Cold ad sets - Part 3

cours 56:

Intro to NLP for AI

1. Introduction

*Introduction to the course


*Course Materials and Notebooks
*Introduction to NLP
*NLP in everyday life
*Supervised vs Unsupervised NLP

2. Text Preprocessing

*The importance of data preparation


*Lowercase
*Removing stop words
*Regular expressions
*Tokenization
*Stemming
*Lemmatization
*N-grams
*Practical task

3. Identifying Parts of Speech and Named Entities

*Text Tagging
*Parts of speech (POS) tagging
*Named entity recognition (NER)
*Practical task

4. Sentiment Analysis

*What is sentiment analysis?


*Rule-based sentiment analysi
*Pre-trained transformer model
*Practical task

5. Vectorizing Text

*Numerical representation of text


*Bag of Words Model
*TF-IDF

6. Topic Modelling

*What is topic modelling?


*When to use topic modelling?
*Latent Dirichlet Allocation
*LDA in python
*Latent Semantic Analysis
*LSA in python
*How many topics?

7. Builing your own text classifier

*Building a custom text classifier


*Logistic regression
*Naive Bayes
*Linear Support Vector Machine

8. Case Study: Categorizing Fake News

*Introducing the project


*Exploring our data through POS tags
*Extracting named entities
*Processing the text
*Does sentiment differ between news types?
*What topics appear in fake news? (Part 1)
*What topics appear in fake news? (Part 2)
*Categorizing fake news with a custom classifier

9. The Future of NLP

*What is deep learning?


*Deep learning for NLP
*Non-English NLP
*What's next for NLP?

cours 57:

Data Analysis with ChatGPT

1. Course introduction

*Course introduction

2. Overview of ChatGPT Advanced Data Analysis

*How ChatGPT Advanced Data Analysis will boost your productivity


*Is ChatGPT Advanced Data Analysis the killer of the data analyst and data
scientist professions?
*The limitations of ChatGPT Advanced Data Analysis

3. Exploratory Data Analysis (EDA) with ChatGPT

*Exploratory data analysis - intro


*Extracting descriptive statistics with ChatGPT
*Identifying outliers
*Correlation analysis: Hypothesis formulation and feature ideas
*Crafting a comprehensive EDA prompt

4. Preprocess data with ChatGPT - Deal with inconsistent data

*Identifying and handling erroneous data


*Verifying data for errors
*Eliminating duplicate rows
*Addressing unit discrepancies in data
*Dividing cell content into multiple cells

5. Preprocess data with ChatGPT - Organize messy data

*Organize messy data


*Data mapping
*Effective data classification
*Order data with ChatGPT

6. Preprocess data with ChatGPT - Working with functions

*COUNTIFS, SUMIFS
*Data lookup
*Data ranking
*Fill until non-blank values
*Translating logic into Excel formulas with ChatGPT

7. Preprocess data with ChatGPT - Clean text data

*Correcting inverted names


*Dealing with trailing and leading spaces in text data
*Standardizing letter capitalization
*Handling multi-lingual text data
* Multiple "Find and Replace" at the same time with ChatGPT
*Criteria-based duplicate removal
*Removing duplicates across multiple columns

8. Preprocess data with ChatGPT - Dates and time data

*Obtain day and month from date


*Converting hours to decimals
*Obtaining age in years, months, days
*Order dates chronologically

9. Complete Practical example

*Intro to the exercise and preprocessing ideas from ChatGPT


*Preprocessing the 'Free plan users' sheet (1/2)
*Preprocessing the 'Free plan users' sheet (2/2)
*Working on the 'Transactions' sheet
*Dealing with the country mapping issue
*Note from autho

cours 58 :

ChatGPT for Data Science

1. Introduction to the Data Science Process and ChatGPT

*Introduction to the course


*Traditional data science methods and the role of ChatGPT
*How to install ChatGP
*How ChatGPT can boost your productivity

2. Data Science Use Cases

*Data Preprocessing with ChatGPT


*First attempt at machine learning with ChatGPT
*Analyzing a client database with ChatGPT in Python
*Analyzing a client database with ChatGPT in Python – analyzing top products
*Analyzing a client database with ChatGPT in Python – analyzing top clients,
RFM analysis
*Exploratory data analysis (EDA) with ChatGPT - histogram and scatter plot
*Exploratory data analysis (EDA) with ChatGPT - correlation matrix, outlier
detection
*Comprehensive Report on Dataset Analysis
*Hypothesis testing with ChatGPT
*Marvels comic book database: Intro to Regular Expressions (RegEx)
*Decoding comic book data: Python Regular Expressions and ChatGPT
*Advanced Analysis of Comic Book Database Using Regular Expressions
*Algorithm recommendation: Movie Database Analysis with ChatGPT
*Algorithm recommendation: recommendation engine for movies with ChatGPT
*Enhancing the Movie Database Recommendation Engine
*Ethical principles in data and AI utilization
*Using ChatGPT for ethical considerations

3. Intro to the Case Study

*Intro to the case study


*Naïve Bayes
*Tokenization and Vectorization
*Imbalanced data sets
*Overcome imbalanced data in machine learning
*Model performance metrics

4. Case Study User Reviews

*Loading the Dataset and Preprocessing


*Optimizing User Reviews: Data Preprocessing & EDA
*Reg Ex for Analyzing Text Review Data
*Understanding Differences between Multinomial and Bernouilli Naive Bayes
* Machine Learning with Naïve Bayes (First Attempt)
*Machine Learning with Naïve Bayes – converting the problem to a binary one
*Testing the model on new data
cours 59:

Intro to LLMs

1. Introduction to Large Language Models

*Introduction to the course


*Course Materials and Notebooks
* What are LLMs?
*How large is an LLM?
*General purpose models
*Pre-training and fine tuning
*What can LLMs be used for?

2. The Transformer Architecture

*Deep learning recap


*The problem with RNNs
*The solution: attention is all you need
*The transformer architecture
*Input embeddings
*Multi-headed attention
*Feed-forward layer
*Masked multihead attention
*Predicting the final outputs

3. Getting started with GPT models

*What does GPT mean?


*The development of ChatGPT
*OpenAI API
*Generating text
*Customizing GPT Output
*Key word text summarization
*Coding a simple chatbot
*Introduction to Langchain in Python
*Langchain
*Adding custom data to our chatbot

4. Hugging Face Transformers

*Hugging Face package


*The transformer pipeline
*Pre-trained tokenizers
*Special tokens
*Hugging Face and PyTorch, TensorFlow
*Saving and loading models

5. Question and answer models with BERT

*GPT vs BERT
*BERT architectur
*Loading the model and tokenizer
*BERT embeddings
*Calculating the response
*Creating a QA bot
*BERT, RoBERTa, DistilBERT

6. Text classification with XLNet


*GPT vs BERT vs XLNET
*Preprocessing our data
*XLNet Embeddings
*Fine tuning XLNet
*Evaluating our model

cours 60:

Growth Analysis with SQL, Python, and Tableau

1. Introduction to the Course

*What Does the Course Cover?


*Motivation - Our Story

2. Case Study Overview

*Introduction to the Project


*Term Definitions - Visitors, UTMs, Channel of Acquisition
*Term Definitions - User Types and Funnels
*Key Questions and Metrics
*Sketching the Dashboard (Part 1)
*Sketching the Dashboard (Part 2)

3. Retrieving Relevant Data from the Database

*Types of Data Access


*The Database - Part 1
*The Database - Part 2
*Funnels - Theory
*Funnel - SQL Query (Part 1)
*Funnel - SQL Query (Part 2)
*Funnel - SQL Query (Part 3)
*Waterfall Chart - Theory
*Waterfall Chart - SQL Query
*Waterfall Chart - Processing the Data in Python (Part 1)
*Waterfall Chart - Processing the Data in Python (Part 2)
*Waterfall Chart - Processing the Data in Python (Part 3)
*Waterfall Chart - Data Export (Part 4)
*Assignment 1 - Create SQL queries

4. Crafting the Graphs

*The Structure of This Section


*The Free and Conversions Graph (Part 1)
*The Free and Conversions Graph (Part 2)
*The Free and Conversions Graph (Part 3)
*The Free and Conversions Graph (Part 4)
*The Free and Conversions Graph (Part 5)
*Assignment 2 - Labels toggle
*The Free and Conversions Graph (Part 6)
*Paid per Date Graph
*Paid per Date – Styling and Formatting
*Waterfall Chart
*Assignment 3 - Other Waterfall Chart Periods
*Visitors Bar Chart
*Assignment 4 - Create Visitors Bar Chart for Monthl
*Homepage KPIs
*Assignment 5 - Homepage KPIs for Quarterly and Annual
*Assignment 6 - Create Maps and Funnel
*The Rest of the Graphs

5. Assembling the Dashboard

*Dashboard Beginings – The Title Banner


*The Title Banner (Part 2)
*Incorporating the Graphs into the First Page
*The First Page – Continuation
*Constructing the Second Page – Sheet Swapping
* Assignment 7 - Populate Pages 3,4 and 5
*Assignment 8 - Create the Homepage
*Constructing the Homepage

6. Discussions and Insights

*The Significance of Data Interpretation


*Free Users Data
*Paid Users Data
*Waterfall Chart – Subscriptions
*Visitors Data
* Maps
*Closing Thoughts

cours 61:

Intro to Data Engineering

1. Introduction

*What will this course teach you

2. Data Engineering Career

*What is a Data Engineer?


*Data Engineering lifecycle
*Similar careers to Data Engineering
*Data Engineering service models
*Data Engineer leveling guide
*Technical skills of a Data Engineer

3. Data Architecture

*What is Data Architecture


*A sample Data Architecture
*Data lakes, Swamps, Warehouses, and Marts
*Modern data stack
*Connecting to data
*Good Data Architecture

4. Data Orchestration
*Data Pipelines and Data Orchestration
*Apache Airflow

5. Relational Databases

*Relational Database Overview


*Organizing Relational Databases
*Relational Database Types
*Interacting with Relational Databases SQL

6. Non-relational Databases

*ACID Properties
*Document Databases
*Key-Value Database
*Object Storage
*Further reading
*Columnar Database
*Graph Database
*No-SQL Database Questions

7. Software Engineering

*Horizontal Scaling vs Vertical Scalin


*Pytho
*APIs
*Shell Scripting
*Cron
*Version Control - Git - Mercurial
*Testing
*Docker and Containerization
*Infrastructure Management

8. Big Data Engineering

*What is Big Data?


*Hadoop
*Spark
*Kafka

9. Data Modeling

*Logical Physical Data Model


*Entity Relationship Diagrams
*Normalization
*Kimball and Inmon Data Warehousing

10. Security and Privacy

*The Non-optionality of Security and Privacy


*PII
*Principle of Least Privilege

cours 62 :
Statistical Tests in Sales and Marketing

1. Introduction to the Course

*Welcome to the World of Statistical Tests!


*The Importance of Statistical Testing in Sales and Marketing
*Course Approach

2. Case 1: Chi-Square Test of Independence

*Case Description: Packaging Uniqueness


*Research Question
*Operationalization
*Sample
*Statistic and Parameter
*Levels of Measurement
*Contingency Table
*Hypothesis Testing – Introduction
*Null and Alternative Hypothesis
*Test Statistic
*P-value
*Statistical Significance
*Degrees of Freedom
*Chi-Square Test - Introduction
*Chi-Square Test of Independence
*Case Summary and Discussion

3. Case 2: One-Sample t-test

*Case Description: Price per Volume (PPV)


*Mean and Median
*Standard Deviation
*Normal Distribution
*Histogram
*Q-Q plot
*One-Sample t-Test
*One- vs. Two-Tailed Tests
*Case Summary and Discussion

4. Case 3: Two-Sample t-test

*Case Description: In-store Display


*Dependent vs. Independent Variables
*Shapiro-Wilk Test for Normality
*Levene's Test for Equality of Variances
*Two-Sample t-Test
*Case Summary and Discussion

5. Case 4: Paired Samples t-Test

*Case Description: Return Policy


*Independent vs. Dependent Samples
*Paired Samples t-Test
*Case Summary and Discussion
6. Case 5: Mann-Whitney U Test

*Case Description: Sales Incentive Program


*Parametric vs Non-Parametric Tests
* Mann-Whitney U Test
* Case Summary and Discussion

7. Wilcoxon Signed-Rank Test

*Case Description: Package Redesign


*Wilcoxon Signed-Rank Test
*Case Summary and Discussion

8. Chi-Square Goodness-of-Fit Test

*Case Description: Store Footfall


*Chi-Square Goodness-of-Fit Test
*Case Summary and Discussion

9. Conclusion

*Conclusion

cours 63 :

Build Chat Applications with OpenAI and LangChain

1. Introduction to the Course

*What Makes LangChain Powerful?


*What Does the Course Cover?

2. Tokens, Models, and Prices

*Tokens
*Models and Prices

3. Setting Up the Environment

*Setting Up a Custom Anaconda Environment for Jupyter Integration


*Obtaining an OpenAI API Key
*Setting the API Key as an Environment Variable

4. The OpenAI API

*First Steps
*System, User, and Assistant Roles
*Creating a Sarcastic Chatbot
*Temperature, Max Tokens, and Streaming

5. Model Inputs

*The LangChain Framework


*ChatOpenAI
*System and Human Messages
*AI Messages
*Prompt Templates and Prompt Values
*Chat Prompt Templates and Chat Prompt Values
*Few-Shot Chat Message Prompt Templates
*LLMChain

6. Message History and Chatbot Memory

*Chat Message History


*Conversation Buffer Memory: Implementing the Setup
*Conversation Buffer Memory: Configuring the Chain
*Conversation Buffer Window Memory
*Conversation Summary Memor
*Combined Memory

7. Output Parsers

*String Output Parser


*Comma-Separated List Output Parser
*Datetime Output Parser

8. LangChain Expression Language (LCEL)

*Piping a Prompt, Model, and an Output Parser


*Batching
*Streaming
*The Runnable and RunnableSequence Classes
*Piping Chains and the RunnablePassthrough Class
*Graphing Runnables
*RunnableParallel
*Piping a RunnableParallel with Other Runnables
*RunnableLambda
*The @chain Decorator
*Adding Memory to a Chain (Part 1): Implementing the Setu
*RunnablePassthrough with Additional Keys
*Itemgetter
*Adding Memory to a Chain (Part 2): Creating the Chain

9. Retrieval Augmented Generation (RAG)

*How to Integrate Custom Data into an LLM


*Introduction to RAG
*Introduction to Document Loading and Splitting
*Introduction to Document Embedding
*Introduction to Document Storing, Retrieval, and Generation
*Indexing: Document Loading with PyPDFLoader
*Indexing: Document Loading with Docx2txtLoader
*Indexing: Document Splitting with Character Text Splitter (Theory)
*Indexing: Document Splitting with Character Text Splitter (Code Along)
*Indexing: Document Splitting with Markdown Header Text Splitter
*Indexing: Text Embedding with OpenAI
*Indexing: Creating a Chroma Vector Store
*Indexing: Inspecting and Managing Documents in a Vector Store
*Retrieval: Similarity Search
*Retrieval: Maximal Marginal Relevance Search
*Retrieval: Vector Store-Backed Retriever
*Generation: Stuffing Documents
*Generation: Generating a Response

10. Tools and Agents

*Introduction to Reasoning Chatbots


*Tools, Toolkits, Agents, and Agent Executors
*Creating a Wikipedia Tool and Piping It to a Chain
*Fixing the GuessedAtParserWarning
*Creating a Retriever and a Custom Tool
*LangChain Hub
*Creating a Tool Calling Agent and an Agent Executor
*AgentAction and AgentFinish

cours 64:

Introduction to Vector Databases with Pinecone

1. Introduction to Vector Databases

*Introduction to the course


*Database comparison: SQL, NoSQL, and Vector
*Understanding vector databases

2. Basics of Vector Space and High-Dimensional Data

*Introduction to vector space


*Distance metrics in vector space
*Vector embeddings walkthrough

3. Introduction to The Pinecone Vector Database

*Vector databases, comparison


*Pinecone registration, walkthrough and creating an Index
*Connecting to Pinecone using Python
*Creating a new index
*Creating and deleting a Pinecone index using Python
*Upserting data to a pinecone vector database
*Getting to know the fine web data set and loading it to Jupyter
*Upserting data from a text file and using an embedding algorithm

4. Case Study Semantic Search with Pinecone and Custom Data

*Introduction to semantic search


*Introduction to the case study – smart search for data science courses
*Getting to know the data for the case study
*Data loading and preprocessing
*Pinecone Python APIs and connecting to the Pinecone server
*Embedding algorithms
*Embedding the data and upserting the files to Pinecone
*Similarity search and querying the data
*How to update and change your vector database
*Data preprocessing and embedding for courses with section data
*Courses and Sections Together Assignment
*Upserting the new updated files to Pinecone
*Similarity search and querying courses and sections data
*Weighted semantic search assignment
*Using the BERT embedding algorithm
*Vector database for recommendation engines
*Vector database for semantic image search
*Vector database for biomedical research

cours 65:

Intro to Revenue Analytics

1. Course Introduction

*Course introduction
*Why this course
*Course structure

2. Understanding revenue

*Basics of revenue
*Key concepts related to revenue
*How revenue is recorded in accounting books

3. Measuring revenue growth

*Year-on-year % growth
*CAGR - Compound Annual Growth Rate
*Run rate
* Net revenue retention

4. Generating revenue

*Sales target
*Sales cycle
*Sales pipeline
*Great work!

5. Landscaping revenue streams

*Introduction and learning objectives


*What is a revenue stream?
*Sales revenue: the traditional pillar
*Subscription revenue: the recurring income model
*Advertising revenue: monetizing attention
*Licensing revenue: monetizing intellectual property and brand
*Affiliate marketing: collaboration for profit
*Rental or leasing revenue: monetizing assets
*Consulting and service fees: selling expertise
*Donations and crowdfunding: community support
*Section summary

6. Intro to pricing analytics

*Section introduction
*Unraveling the basics of pricing
*From production costs to net revenue
*Why pricing is important
*Price elasticity of demand
*The 5 Cs of pricing
*Key pricing strategies
*Section summary

7. Intro to customer success analytics

*Section introduction
*Customer success and its importance
*Key components of customer success
*Net Promoter Score (NPS)
*How to implement a customer success strategy
*The importance of customer success feedback analysis
*Methods for collecting customer feedback
*Tools for customer feedback analysis
*How to conduct customer feedback analysis
*Practical example
*Section summary

cours 66:

Intro to AI new

1. Welcome to the course

*Natural vs Artificial Intelligence


*Brief history of AI
*Demystifying AI, Data science, Machine learning, and Deep learning
*Weak vs Strong AI

2. Data is the key ingredient for building AI

*Structured vs unstructured data


*How we collect data
*Labeled and unlabeled data
* Metadata: Data that describes data

3. Key AI techniques

*Machine learning
*Supervised, Unsupervised, and Reinforcement learning
*Deep learning

4. Important AI branches

*Robotics
*Computer vision
*Traditional ML
*Generative AI

5. Understanding Generative AI
*The rise of Gen AI: Introducing ChatGPT
*Early approaches to Natural Language Processing (NLP)
*Recent NLP advancements
*From Language Models to Large Language Models (LLMs)
*The efficiency of LLM training. Supervised vs Semi-supervised learnin
*From N-Grams to RNNs to Transformers: The Evolution of NLP
*Phases in building LLMs
*Prompt engineering vs Fine-tuning vs RAG: Techniques for AI optimization
*The importance of foundation models
*Buy vs Make: foundation models vs private models

6. Practical challenges in Generative AI

*Inconsistency and hallucination


*Budgeting and API costs
*Latency
*Running out of data

7. The AI tech stack

*Python programming
*Working with APIs
*Vector databases
*Open source models
*Hugging Face
*Langchain
*AI evaluation tools

8. AI job positions

*AI strategist
*AI developer
*AI engineer

9. Looking ahead

*AI ethics
*Future of AI

cours 67 :

Speech Recognition with Python

1. Course Introduction

*Welcome to the World of Speech Recognition


*Course Approach
*How It All Started: Formants, Harmonics, and Phonemes
*Development and Evolution

2. Sound and Speech Basics

*How Do Humans Recognize Speech?


*Fundamentals of Sound and Sound Waves
*Properties of Sound Waves

3. Analog to Digital Conversion

*Key Concepts: Sample Rate, Bit Depth, and Bit Rate


*Audio Signal Processing for Machine Learning and AI

4. Audio Feature Extraction for AI Applications

*Time-Domain Audio Features


*Frequency-Domain and Time-Frequency-Domain Audio Features
*Time-Domain Feature Extraction: Framing and Feature Computation
*Frequency-Domain Feature Extraction: Fourier Transform

5. Speech Recognition Mechanics

*Acoustic and Language Modeling


*Hidden Markov Models (HMMs) and Traditional Neural Networks
*Deep Learning Models: CNNs, RNNs, and LSTMs
*Advanced Speech Recognition Systems: Transformers
*Building a Speech Recognition Model Part I
*Building a Speech Recognition Model Part II
*Selecting the Appropriate Speech Recognition Tool
*Expanding Beyond the Tools We've Covered

6. Setting Up the Environment

*Installing Anaconda
*Setting up a New Environment
*Installing Packages for Speech Recognition
*Importing the Relevant Packages in Jupyter Notebook

7. Transcribing Audio with Google Web Speech API

*Audio File Formats for Speech Recognition


* Importing Audio Files in Jupyter Notebook
*The SpeechRecognition Library: Google Web Speech API
*Evaluation Metrics: WER and CER
*Calculating WER and CER in Python

8. Background Noise and Spectrograms

*Understanding Noise in Audio Files


*Creating a Spectrogram with Python
*Dealing with Background Noise

9. Transcribing Audio with OpenAI's Whisper

*Whisper AI: Transformer-Based Speech-to-Text


*Homework Assignment
*Transcribing Multiple Audio Files from a Directory
*Saving Audio Transcriptions to CSV for Easy Analysis
*Reversing the Process: AI-Powered Text-to-Speech

10. Final Discussion and Future Directions

*Modern Practices and Applications


*Challenges and Limitations
*The Future of Speech Recognition with AI

You might also like