365 Data Science Axs
365 Data Science Axs
*Course Introduction
*Why are there so many business and data science buzzwords?
*Analysis vs Analytics
*Intro to Business Analytics, Data Analytics, and Data Science
*Adding Business Intelligence (BI), Machine Learning (ML), and Artificial
Intelligence (AI) to the picture
*Traditional AI vs. Generative AI
*More Examples of Generative AI
*An Overview of our Data Science Infographic
*When are Traditional data, Big Data, BI, Traditional Data Science and ML
applied?
6. Data Science Job Positions: What do they Involve and What to Look out for?
*Data Science Job Positions: What do they Involve and What to Look out
for?
cours 2:
Statistics:
1. Introduction
*What does the course cover
*Population vs sample
*Introduction
*What is a distribution
*The Normal Distribution
*The Standard Normal Distribution
*Central limit theorem
*Standard error
*Estimators and estimates
4. Confidence Intervals
5. Hypothesis testing
*Null vs Alternative
*Further Reading on Null and Alternative Hypothesis
*Rejection region and significance level
*Type I error vs type II error
*Test for the mean. Population variance known
*p-value
*Test for the mean. Population variance unknown
*Test for the mean. Dependent samples
*Test for the mean. Independent Samples (Part 1)
*Test for the mean. Independent Samples (Part 2)
*Practical Example - Hypothesis Testing
cours 3:
Probability:
*Course Introduction
*What is the Probability Formula
*Expected Values
*Probability Frequency Distribution
*Complements
2. Combinatorics
*Fundamentals of Combinatorics
*Computing Permutations
*Solving Factorials
*Variations with Repetition
*Variations without Repetition
*Combinations without Repetition
*Combinations with Repetition
*Symmetry of Combinations
*Combinations with Separate Sample Spaces
*Winning The Lottery
*Summary of Combinatorics
*Practical Example - Combinatrics
3. Bayesian Inference
4. Discrete Distributions
.
5. Continuous Distributions
*Continuous Distributions
*Normal Distribution
*Standardizing a Normal Distribution
*Students T Distribution
*Chi-Squared Distribution
*Exponential Distribution
*Logistic Distribution
*Practical Example - Distributions
*Probability in Finance
*Probability in Statistics
*Probability in Data Science
cours 3:
1. Course Introduction
*Installing Python
*Introducing Spyder
*PRINT function
*The solution
*Variables
*Strings
*Let's write a program
*Question 1
*Question 2
*Question 3
*Question 4
*Question 5
5. Conditionals
*Intro to conditionals
*Logical operators
*Not TRUE is FALSE
*IF statement
*More on strings
*More strings!
*Practice challenge
6. For Loops
7. Dictionaries
*Modules
*Dictionaries
*Zip function
*More string methods
*Tuples
*More than 1 dimension
*Import counters (Part I)
*Import counters (Part II)
9. Classes
*Objects
*Class
*__Init__
*Class variables
*Methods
*Inheritance
*Practical exercise 1
*Practical exercise 2
*General tips
11. Big O
*_Big_O
*O(n)
*O(n²)
*O(n!)
*Summary
*Matplotlib
*Random walk explanation
*Simulating a random walk
*Sierpinski triangle
*Creating the triangle in Matplotlib
*Python imaging library
*Intro to stacks
*Coding a stack in Python
*Practical challenge - stacks
*Hint on solving the challenge
*Solving the stacks challenge
*Towers of Hanoi
*Some hints
*Code for Tower of Hanoi
*Enumerate sets & none
*Debugging
*Beyond the PRINT function
*Debugging in Spyder
*Further exrecises
21. Installing Python and Using Virtual Environments on Windows and Unix
*Introduction
*Downloading Python
*Setting up the virtual environment
*Sharing a project - requirements file
*Downloading and Installing Virtual Box & Setting up a Virtual Machine
*Installing Ubuntu on the Virtual Machine
*Setting up a virtual environment on Ubuntu
*Introduction to the command line
*Handling files on in the command line
*Copying, moving and deleting files on the command line
*Echo, environment variables and path
*Section summary
22. PyCharm
24. Farewell
*Farewell
cours 4:
Mathematics:
cours 5:
SQL:
*Welcome to SQL
*Why SQL?
*Why MySQL?
*Introduction to databases
2. SQL Theory
*Installing MySQL
*Additional note – Installing – Viisual C
*Installing MySQL on macOS and Unix systems
*The Client-Server Model
*Setting up a connection
*Important - Please read
*New Authentication Plugin - Creating a New User
*Introduction to the MySQL interface
6. MySQL Constraints
*SELECT… FROM…
*SELECT - FROM - Exercise
*WHERE
*WHERE - Exercise
*AND
*AND - Exercise
*OR
*OR - Exercise
*Operator Precedence
*Operator Precedence - Exercise
*IN - NOT IN
*IN - NOT IN - Exercise
*LIKE - NOT LIKE
*LIKE - NOT LIKE - Exercise
*Wildcard Characters
*Wildcard Characters - Exercise
*BETWEEN… AND…
*BETWEEN - AND - Exercise
*IS NOT NULL - IS NULL
*IS NOT NULL - IS NULL - Exercise
*Other Comparison Operators
*Other Comparison Operators - Exercise
*SELECT DISTINCT
*SELECT DISTINCT - Exercise
*Introduction to Aggregate Functions
*Introduction to Aggregate Functions - Exercise
*ORDER BY
*ORDER BY - Exercise
*GROUP BY
*Using Aliases (AS)
*Using Aliases (AS) - Exercise
*HAVING
*HAVING - Exercise
*WHERE vs HAVING - Part I
*WHERE vs HAVING - Part II
*WHERE vs HAVING - Exercise 2
*LIMIT
*LIMIT - Exercise
*Introduction to JOINs
*Introduction to JOINs - Exercise
*INNER JOIN - Part I
*INNER JOIN - Part II
*INNER JOIN - Part II - Exercise
*A Note on Using Joins
*Duplicate Records
*LEFT JOIN - Part I
*LEFT JOIN - Part II
*LEFT JOIN - Part II - Exercise
*RIGHT JOIN
*The new and the old join syntax
*The new and the old join syntax - Exercise
*JOIN and WHERE Used Together
*JOIN and WHERE Used Together - Exercise
*CROSS JOIN
*CROSS JOIN - Exercise
*Using Aggregate Functions with Joins
*JOIN more than two tables in SQL
*JOIN more than two tables in SQL - Exercise
*Tips and tricks for joins
*Tips and tricks for joins - Exercise
*UNION vs UNION ALL
*UNION vs UNION ALL - Exercise
cours 6:
1. Linear Regression
*Course Introduction
*The linear regression model
*Correlation vs regression
*Geometrical representation of the Linear Regression Model
*Setting up the Environment
*Python packages installation
*First regression in Python
*Using Seaborn for graphs
*How to interpret the regression table
*Decomposition of variability
*What is the OLS?
*R-squared
*Multiple linear regression Theory
*Adjusted R-squared Theory
*F-test
*OLS assumptions
*A1: Linearity
*A2: No endogeneity
*A3: Normality and homoscedasticity
*A4: No autocorrelation
*A5: No multicollinearity
*Dealing with categorical data - Dummy variables
*Making predictions with the linear regression
*What is sklearn
*Game plan for sklearn
*Simple linear regression
*Simple linear regression - summary table
*A Note on Normalization
*Multiple linear regression
*Adjusted R-squared
*Feature Selection through p-values
*Creating a summary table
*A Note on Calculation of P-Values with sklearn
*Feature Scaling
*Feature Selection through standardization
*Making predictions with standardized coefficients
*Underfitting and overfitting
*Training and testing
4. Logistic Regression
*K-means clustering
*A simple example of clustering
*Clustering categorical data
*How to choose the number of clusters
*Pros and Cons of K-means and clustering
*To standardize or to not standardize
*Relationship between clustering and regression
*Market Segmentation with Cluster Analysis (Part 1)
*Market Segmentation with Cluster Analysis (Part 2)
*How is clustering useful
*Types of clustering
*Dendrogram
*Heatmaps using Seaborn
cours 7:
1. Introduction
4. Minimal example
*Outline
*Generating the data (optional)
*Initializing the variables
*Training the model
*Minimal Example - Exercises.txt
5. Introduction to TensorFlow 2
*TensorFlow Outline
*TensorFlow 2 Intro
*A note on coding in TensorFlow
*Types of file formats in Tensorflow and data handling
*Model layout - inputs, outputs, targets, weights, bias, optimizer, and loss
*Interpreting the result and extracting the weights and bias
*Customizing your model
*Introduction to TensorFlow - Exercises
*The layer
*What is a deep net
*Really understand deep nets
*Why do we need non-linearities
*Activation functions
*Softmax activation
*Backpropagation
*Backpropagation - intuition
7. Backpropagation (optional)
*Backpropagation mathematics
8. Overfitting
9. Initialization
*Initialization
*Types of simple initializations
*Xavier's initialization
10. Optimizers
*SGD&Batching
*Local minima pitfalls
*Momentum
*Learning rate schedules
*Learning rate schedules. A picture
*Adaptive learning schedules
*Adaptive moment estimation
11. Preprocessing
*Preprocessing
*Basic preprocessing
*Standardization
*Dealing with categorical data
*One hot vs binary
*MNIST dataset
*How to tackle the MNIST dataset
*MNIST - Importing libraries and data
*Preprocess the data - create a validation dataset and scale the data
*Preprocess the data - scale the test data
*Preprocess the data - shuffle and batch
*Preprocess the data - shuffle and batch Exercise
*Outline the model
*Select the loss and the optimizer
*Learning
*MNIST - Exercises
*Testing the model
14. Conclusion
*Summary
*What's more out there
*An overview of CNNs
*How DeepMind uses deep learning
*An overview of RNNs
*Non-NN approaches
cours 8;
Introduction to Tableau
1. Introduction to Tableau
*Welcome to Tableau
*Why use Tableau: Make your data make an impact
*Let's download Tableau Public
*Connecting data in Tableau
*Exploring Tableau's interface
*Exploring Tableau's interface
*Let's create our first chart in Tableau!
2. Tableau Functionalities
*Creating a table
*Creating a table - link
*Creating custom fields
*Creating a custom field and adding calculations to a table
*Creating a custom field and adding calculations to a table - link
*Adding totals and subtotals
*Adding totals and subtotals - link
*Adding a custom calculation
*Adding a custom calculation - link
*Inserting a filter
*Inserting a filter - link
*Working with joins in Tableau
cours 9:
The Complete Data Visualization Course with Python, R, Tableau, and Excel:
1. Introduction
3. Bar Chart
4. Pie Chart
6. Line Chart
7. Histogram
8. Scatter Plot
cours 10:
Introduction to R Programming:
*Creating an object in R
*Data types in R (Integers and doubles)
*Data types in R (Characters and logicals)
*Coercion rules in R
*Functions in R
*Functions and arguments
*Building a function in R
*Using the script vs. using the console
4. Matrices
*Creating a matrix
*Faster code - creating a matrix in a single line of code
*Do matrices recycle
*Indexing an element from a matrix
*Slicing a matrix
*Matrix arithmetic
*Matrix operations
*Categorical data
*Creating a factor in R
*Lists in R
*Relational operators in R
*Logical operators in R
*Logical operators and vectors
*If else else-if statements
*If else else-if keep-in-minds's
*For loops in R
*While loops in R
*Repeat loops in R
*Building a function in R 2.0
*Building a function in R 2.0 Scoping
6. Data frames in R
*Population vs Sample
*Mean, median, mode
*Skewness
*Variance, standard deviation, and coefficient of variability
*Covariance and correlation
*Distributions
*Standard Error and Confidence Intervals
*Hypothesis Testing
*Type I and Type II errors
*Test for the mean. Population variance known
*The P-value
*Test for the mean. Population variance unknown
*Dependent samples
*Comparing two means. Independent samples
cours 11:
1. Introduction to NumPy
*Course Introduction
*The NumPy Package and Its Applications
*Installing and Upgrading NumPy
*What is an array?
*Using The NumPy Documentation
*Frequently Asked Questions
*History of NumPy
*Ndarrays
*Arrays vs Lists
3. NumPy Fundamentals
*Indexing
*Assigning Values
*Elementwise Properties
*Types of Data Supported by NumPy
*Characteristics of NumPy Functions - Part 1
*Characteristics of NumPy Functions - Part 2
*Basic Slicing
*Stepwise Slicing
*Conditional Slicing
*Dimensions and the Squeeze Function
*Arrays of 0s and 1s
*"_like" functions in NumPy
*A Non-Random Sequence of Numbers
*Random Generators and Seeds
*Basic Random Functions in NumPy
*Probability Distributions in NumPy
*Applications of Random Data in NumPy
*np.loadtxt() vs np.genfromtxt()
*Simple Cleaning when Importing
*String vs Object vs Numbers
*np.save()
*np.savez()
*np.savetxt()
cours 12:
1. pandas - Basics
3. pandas Series
*.unique(), .nunique()
*Converting Series into Arrays
*.sort_values()
*Attribute and Method Chaining
*.sort_index()
4. pandas DataFrames
cours 13:
*Section introduction
*Stakeholder mapping
*Defining business intelligence
*End-to-end process mapping (part 1)
*End-to-end process mapping (part 2)
*Hire-to-retire (H2R)
*Source-to-Pay (S2P)
*Record-to-report (R2R)
*Order-to-Cash (O2C)
*Global process owners - Conclusion
4. Target setting
*Section introduction
*Descriptive analytics
*Diagnostic analytics
*Predictive analytics
*Prescriptive analytics
*Section introduction
*Trend analysis
*Trend analysis in Excel
*Comparative analysis
*Comparative analysis in Excel
*Value-based analysis
*Value-based analysis in Excel
*Correlation analysis
*Correlation analysis in Excel
*Time series analysis
*Time series analysis in Excel
*Regression analysis
*Regression analysis in Excel
*Machine learning analysis
*Types of machine learning
*Conclusion
*Into to Section 7
*Hypothesis development
*Situational analysis
*Building a RACI matrix
*What makes for a successful project?
*Current state analysis
*Blueprint and design
*Build and test
*Build and test: step-by-step
*Deploy and operationalise: operationalise-and-operationalise
*Data visualization
cours 13:
cours 14:
1. Introduction
3. Dataset description
*Our example: consumer loans. A first look at the dataset
*Dependent variables and independent variables
4. General preprocessing
6. PD model estimation
9. PD model monitoring
cours 15:
1. Software Integration
*Course Introduction
*Properties and Definitions: Data, Servers, Clients, Requests, and Responses
*Properties and Definitions: Data Connectivity, APIs, and Endpoint
*Further Details on APIs
*Text Files as Means of Communication
*Definitions and Applications
*Up Ahead
*Real-Life Example: Absenteeism at Work
*Real-Life Example: The Dataset
4. Machine Learning
cours 16:
Introduction to Jupyter:
1. Introduction to Jupyter
*Introduction
*A 5-minute explanation of Programming
*Introduction to Jupyter
*Installing Anaconda
*Intro to Using Jupyter
*Working with Notebook Files
*Using Shortcuts
*Handlin g Error Messages
*Restarting the Kernel
cours 17:
Introduction to Excel
cours 18:
*Course Overview
*The Data Science Knowledge You Need
*Types of Data Science Roles
*The Interview Process Structure
*What Interviewers Look For
*How to Get the Most
2. The Project Portfolio
*Portfolio Overview
*What Is a Data Science Project
*The Projects You Should Do
*How to Differentiate Your Projects
*Where to Showcase Your Projects
*Best Github Practices
*Kaggle Profile
3. The Resume
*Resume Overview
*How to Structure Your Resume
*How to Write about Work and Projects
*Customize your Resume
*Your Virtual Resume
*Resume Checklist
*The Cover Letter
4. Get an Interview
*Interviewing Overview
*How Candidates are Selected
*Networking for Data Scientists
*Leveraging Your Resources
*Informa tional Interviews
*Reaching out to Recruiters
*Anna Interview
*Elevator Pitch Outline and Examples
*Jaemin Interview
*Jay Interview
*Jefferson Interview
*Sheng Interview
*Glassdoor Findings
*LinkedIn Bonus Content
*Bonus Content: Portfolio Website
*Bonus Content: Star Storytelling Technique
cours 19:
SQL + Tableau
2. Problem 1
*Problem 1: Task
*Task 1 - Text
*Important clarification
*Problem 1: Solution in SQL
*Exporting your output from SQL and loading it in Tableau
*Chart 1: Visualizing the Solution in Tableau - Part I
*Chart 1: Visualizing the Solution in Tableau - Part II
3. Problem 2
*Problem 2: Task
*Task 2 - Text
*Problem 2: Solution in SQL
*Chart 2: Visualizing the Solution in Tableau
4. Problem 3
*Problem 3: Task
*Task 3 - Text
*Problem 3: Solution in SQL
*Chart 3: Visualizing the Solution in Tableau
5. Problem 4
*Problem 4: Task
*Task 4 - Text
*Problem 4: Solution in SQL
*Chart 4: Visualizing the Solution in Tableau
6. Problem 5
cours 20:
1. Introduction
*White Noise
*Random Walk
*Stationarity
*Determining Weak Form Stationarity
*Seasonality
*Correlation Between Past and Present Values
*The ACF
*The PACF
6. Picking the Correct Model
7. The AR Model
*The AR Model
*Examining the ACF and PACF of Prices
*Fitting an AR(1) Model for Index Prices
*Fitting Higher-Lag AR Models for Prices
*Using Returns
*Examining the ACF and PACF of Returns
*Fitting an AR(1) Model for Returns
*Fitting Higher-Lag AR Models for Returns
*Normalizing Values
*Model Selection for Normalized Returns (AR Model)
*Examining the AR Model Residuals
*Unexpected Shocks from Past Periods
8. The MA Model
*The MA Model
*Fitting an MA(1) Model for Returns
*Fitting Higher-Lag MA Models for Returns
*Examining the MA Model Residuals for Returns
*Model Selection for Normalized Returns (MA Model)
*Fitting an MA(1) Model for Prices
*Past Values and Past Errors
*Auto AR IMA
*Preparing Python for Model Selection
*The Default Best Fit
*Basic Auto ARIMA Arguments
*Advanced Auto ARIMA Arguments
*The Goal Behind Modelling
cours 21:
Power BI
1. Power BI Overview
*Introduction to Power BI
*Power BI Desktop vs Power BI Pro
*Power BI data sources
2. Power BI Setup
4. Task three
5. Data Modelling
6. Task Four
*Task Four
*Task Four, Part 1
*Task Four, Part 2
*Task Four, Part 3
8. Task Five
*Task Five
*Task Five, Part 1
*Task Five, Part 2
*Task Five, Part 3
*Task Five, Part 4
*Announcement
*Power BI Service
*Workspaces
cours 22:
*Introduction
*Course Overview
*Growing Importance of an AI & Data PM
*The Role of a Product Manager
*Differentiation of a PM in AI & Data
*Product Management vs. Project Management
8. Evaluating Performance
11. Communication
cours 23:
1. Course Introduction
2. Introduction to Git
*Installing Git
*Configure Git
*Basic commands
*Exploring Git log
*Git diff
*HEAD
*Making corrections Git reset
*Branching
3. GitHub
*GitHub
*GitClone
4. Conclusion
*Conclusion
cours 23:
*Course Introduction
*Segmentation, Targeting, Positioning
*Marketing Mix
*Physical and Online Retailers: Similarities and Differences.
*Price Elasticity
3. Segmentation Data
4. Hierarchical Clustering
5. K-means Clustering
7. Purchase Data
cours 24:
*API overview
*HTTP requests GET and POST requests
*JSON preferred data exchange format for APIs
*Exchange rates API GETting a JSON response
*Incorporating parameters in a GET request
*Additional API functionalities
*Creating a simple currency converter
*iTunes API
*Homework
*Homework - 2
*iTunes API Structuring and exporting the data
*GitHub API Pagination
*EDAMAM API Initial setup and registration
*EDAMAM API Sending a POST request
*Downloading files with Requests
4. HTML overview
*What is HTML?
*Structure of HTML
*Syntax of HTML. Tags
*Tag attributes
*Popular tags
*CSS and JavaScript
*Character encoding
*XHTML and code style
*Scraping Steam
*Scraping HTML tables with the help of Pandas
cours 25:
Introduction to Python
1. Why Python?
*Course Introduction
*Setting Up the Environment
*Python Variables
*Understanding Numbers and Boolean Values
*Strings
4. More on Operators
5. Conditional Statements
6. Functions
7. Sequences
*Introduction to Lists
*Using Methods in Python
*What is List Slicing?
*Working with Tuples
*Python Dictionaries
8. Iteration
cours 26;
1. Useful Tools
*Course Introduction
*Must-Have Packages for Finance and Data Science
*Working with Arrays
*Generating Random Numbers
*Important Note on Using Online Financial Data Sources
*Using Financial Data in Python
*Importing and Organizing Data in Python - Part I
*Importing and Organizing Data in Python - Part II
*Importing and Organizing Data in Python - Part III
*Changing the Index of Your Time-Series Data
*Restarting the Jupyter Kernel
cours 26:
4. Excel Mechanics
cours 27:
2. Kernels
3. CNN Introduction
*CNNs motivation
*Feature maps
*Pooling and Stride
*Dimensions
*Activation functions
*Overfitting and early stopping
*Optimizers
*Road plan
*A simple CNN architecture
*Preprocessing the data
*Building and training the CNN
*MNIST CNN Homework
*Testing the trained CNN
*Introduction
*Regularization
*L2 Regularization and weight decay
*Dropout
*Data augmentation
*Unexpected failures
cours 28:
Data Strategy
*Section introduction
*Curated dashboards vs. self-service data exploration
*Challenges related to self-service data exploration
*Asking key business questions first (KBQs)
*The power of clear Key Business Questions (KBQs)
*How to ask the right Key Business Questions
*Giving people access to data
*Curating the most important data insights
*Section intro
*How this butcher uses data to understand customers
*Netflix use case
*Amazon use case
*The increasing need for real-time data to understand customers and
markets
*Section introduction
*Structured vs Semi-structured Data
*Internal vs External Data
*Different Types of Data
*Meta Data
*The Importance of Realtime Data
*Gathering Internal Data
*Accessing External Data
*Sources of External Data
*When the data you want doesn't exist
11. Data Governance
*Section introduction
*To own or not to own
*Ensuring the correct rights are in place
*Case study on building trust
*Section introduction
*Text analytics
*Sentiment analytics
*Image analytics
*Video analytics
*Voice analytics
*Data mining
*Business experiments
*Visual analytics
*Correlation Analysis
*Regression Analysis
*Scenario Analysis
*Time series analysis
*Monte carlo simulation
*Linear programming
*Cohort analysis
*Factor analysis
*Neural network analysis
*Deep learning
*Reinforcement learning
*Section introduction
*How to collect data
*Database, Data warehouse, Data mart and Data lake
*How to store data
*How to process data
*Communicating data
*What is а KPI dashboard
*How to design a KPI Dashboard
*Reporting lessons from journalists
*Using KPI dashboard software
*Big data as a service
*Section introduction
*Skills shortage
*The skills needed for a data scientist
*Building internal skills and competencies
*Outsourcing your data analysis
*Leadership challenges
cours 29:
2. Consumer-Driven Marketing
7. Pricing Optimization
cours 30:
cours 31:
1. Introduction
*Introduction to Section 2
*What is SQL?
*SQL Server Setup
*Introduction to Section 3
*Structure of a SQL Query
*SQL Functions to Master
*Introduction to Section 4
*5 Step Framework for SQL Interviews
*What Comes Next
*Mock Interview 1
*Mock Interview 2
*Mock Interview 3
*Mock Interview 4
*Mock Interview 5
*Mock Interview 6
*Mock Interview 7
*Mock Interview 8
*Mock Interview 9
*Mock Interview 10
5. Bonus
cours 32:
Data Literacy
2. Understanding Data
*Data Definition
*Qualitative vs. Quantitative Data
*Structured vs. Unstructured Data
*Data at Rest vs. Data in Motion
*Transactional vs. Master Data
*Big Data
*Storing Data
*Database
*Data Warehouse
*Data Marts
*The ETL Process
*Apache Hadoop
*Data Lake
*Cloud Systems
*Edge Computing
*Batch vs. Stream Processing
*Graph Database
3. Using Data
4. Reading Data
*Reading Data
*Data Quality Assessment
*Data Description
*Measures of Central Tendency
*Measures of Spread
5. Interpreting Data
*Interpreting Data
*Correlation Аnalysis
*Correlation Coefficient
*Correlation and Causation
*Simple Linear Regression
*R-squared
*Forecasting
*Forecast Errors
*Statistical Tests
*Hypothesis Testing
*P-Value
*Statistical Significance
*Classification Models
*Accuracy
*Recall and Precision
cours 33:
1. Introduction
2. Business Goals
*Quantifying Attainability
*Gradient Boosted Machines: Part 1
*Gradient Boosted Machines: Part 2
*Gradient Boosted Machines: Part 3
*SHAP Values
*Friedman's H-Statistic
*LIME
*Waterfall Charts 1
*Waterfall Charts 2
*Causation: Traditional Statistical Methods
*Causation: Advanced Statistical Methods
*Time Series Forecasting with Takens' Theorem
*Introduction
*The Hybrid Experiment
*Quantile Difference Tests
cours 34:
cours 35:
1. Introduction
*Course Introduction
*What Is Machine Learning?
*Types of Machine Learning
5. Logistic Regression
6. Cluster Analysis
7. K-means Clustering
*K-means Clustering
*K-means Clustering in Excel
*K-means Clustering with Xreal
*Choosing the Number of Clusters
*Clustering Categorical Data
*Standardization
*Clustering and Regression
*Clustering (Pros and Cons)
*Types of Clustering
*Market Segmentation (Part 1)
*Market Segmentation (Part 2)
8. Decision Trees
*Decision Trees
*Entropy (Loss function)
*Information Gain
*Decision Trees in Excel (Part 1)
*Decision Trees in Excel (part 2)
*Decision trees (Prediction)
cours 36:
4. Interview preparation
*Introduction
*Question 1
*Question 2
*Question 3
*Question 4
*Question 5
*How to prepare for the interview
5. Conclusion
*Conclusion
cours 37:
3. Decision Trees
4. Random Forests
1. Introduction
*The stages of data maturity and what you will see next
*How to go from no data to some data - Reach Level 1
6. Project 3
*Intro to Project 3
*Exploring the dataset
*Analyzing purchase rate across groups
*Understand the business problem and specify your objectives
*Explore data. Create, train, and test dataset
*Perform the analysis
cours 39:
2. Data Modeling
*Data modeling
*Relationships
*Role Level Securit
*Global Superstore report
cours 40 :
Introduction to DAX
1. Introduction to DAX
*Course intro
*Calculated Columns
*Calendar Table
*Building a Measure
*Measures Table
*CALCULATE
*FILTER vs KEEPFILTERS
*Iterators
*RELATED vs RELATEDTABL
*SELECTEDVALUE
*DIVIDE
*Logical Operators
*Variables
*TREATAS
* SWITCH
* Text Functions
*CONCATENATEX
*Time Intelligence
*Expression Based Titles
* Role Level Security
*Calculation Groups
*DAX assignment
cours 41 :
*Course intro
*Power BI Canvas
* Drawing Shapes
*Slicers
*Filled Map
*Matrix Table
*Donut Chart
*Interactive Tooltip
*Gauge Visual
*KPI Visual
*Card Visual
*Area Chart
*Bookmarks
*Format Painter
*Stacked Column Chart
*Two reports on one page
cours 42
4. Conclusion
cours 43:
*Course Introduction
*Python Refresher and Setting Up the Environment
cours 44:
Advanced SQL
cours 45:
1. Course Introduction
*Introduction
*ML Algorithms course - GitHub repository
*How to Use this Course
*Types of ML Problems
*Additional Resources
2. Linear Regression
*Linear Regression
*Real World Business Problems
*Example: Linear Regression
*Intuition: Linear Regression
*Training Step-by-Step: Linear Regression
*Prediction: Linear Regression
*Assumptions: Linear Regression
*Assumption #1: Model is linear in coefficients and error terms
*Assumption #2: Homoscedasticity
*Assumption #3: Multicollinearity
*Assumption #4: Independence/Autocorrelation
*Assumption #5: Normally Distributed Error Terms
*Assumption #6: Outliers
*Inference - Interpreting Output
* AB Testing Example
*ML Process: Linear Regression
*Pros & Cons, When to Use
4. Logistic Regression
5. Gradient Descent
*Gradient Descent
*Intuition: Gradient Descent
*Plain Definition: Gradient Descent
*Step-by -Step: Gradient Descent
*Assumptions: Gradient Descent
*Parameter Tuning (Step size, Alpha)
*Gradient Descent Pros and Cons
*Stochastic Gradient Descent
*Pros and Cons: Gradient Descent
6. Decision Trees
*Decision Trees
* Example: Decision Trees
*Plain Explanation: Decision Trees
*Different Components of Decision Trees Explained
*Real World Business Example: Decision Trees
* Assumptions: Decision Trees
*Training Step-by-Step: Decision Trees
*Prediction Step-by-Step: Decision Trees
*Additional Metrics: Decision Trees
*Tuning the Parameters: Decision Trees
*ML Process: Decision Trees
*Decision Trees Assumptions
*Pros and Cons: Decision Trees
*When to Use Decision Trees.
7. Random Forest
*Random Forest
*Intuition: Random Forest
*Example: Random Forest
*Real World Business Problems: Random Forest
* Plain Definition: Bagging
*Where Bagging Fails
*Plain Definition: Random Forest
*Step-by-Step (Training): Random Forest
* Step-by-Step (Prediction): Random Forest
*How Random Forest give us Feature Importance
*Out of Bag Error
*ML Process: Random Forest
*When to use: Random Forest
*Pros and Cons: Random Forest
9. XGBoost
*Intuition: XGBoost
* Real World Business Problems: XGBoost
*Plain Definition: XGBoost
*XGBoost Algorithm Improvements
* System Improvements
*ML Process: XGBoost
*When to use XGBoost
* Pros and Cons: XGBoost
*Intuition: KNN
*Example: KNN
*Plain Definition: KNN
*Assumptions : KNN
*Training Step-by-Step: KNN
*Prediction Step-by-Step: KNN
*Tuning Parameters: KNN
*ML Process: KNN
*When to use KNN
*Intuition: SVM
*Real World Business Problems: SVM
*Step-by-Step Training (Non-Technical): SVM
*Loss Function
*Nonlinear Data
* Prediction (Step-by-Step): SVM
*Terminology: SVM
*Assumptions: SVM
*Soft vs Hard Margins: SVM
* How to use SVMs as a multi-class classifier
*How does SVM Regression Work
* ML Process: SVM
*Pros & Cons (Classifier): SVM
* When to use an SVM Classifier
*Bayes Theorem
*Intuition and Plain Definition
*Step-By-Strp Explanation - First Part
*Step-by-step Explanation - Second Part
* Why is Naive Baïve called Naïve?
*The types of Naïve Bayes
*ML Process: Naïve Bayes
*Pros and Cons: Naïve Bayes
*Real-Life Business Example
*When to use Naïve Bayes
*Regression project
*Classification project
cours 46:
3. Data Preparation
6. Dashboard Interpretation
*Linear Regression
*Logistic Regression
*Random Forest
*K-Means Clustering
*K-Nearest Neighbors
*Hierarchical Clustering
*Support Vector Machines
*Artificial Neural Networks
*Collaborative Filtering
2. Coding Walkthroughs
*Introduction
*Linear Regression - First Part
*Linear Regression - Second Part
*Linear Regression - Third Part
*Logistic Regression
*Decision Trees - First Part
*Decision Trees - Second Part
*Decision Trees - Third Part
*Random Forest - First Part
*Random Forest - Second Part
*Gradient Boost - First Part
*Gradient Boost - Second Part
*KNN - First Part
*KNN - Second Part
*K-Means Clustering - First Part
*K-Means Clustering - Second Part
*Hierarchical Clustering - First Part
*Hierarchical Clustering - Second Part
*SVM
*Neural Network - First Part
*Neural Network - Second Part
*Neural Network - Third Part
*NMF - First Part
*NMF - Second Part
*Naïve Bayes
cours 48:
cours 49 :
*DateTime intro
*Different DateTime types
*Timezones
*Intervals
4. Complex data types: ENUMs, ARRAYs, Ranges, and Nested data in SQL
*ENUM
*ARRAYs
*RANGE
*Nested data
5. Advanced query techniques: exploring OVER, JOINS, CASE and more in SQL
*OVER
*CROSS JOIN
*LATERAL JOIN
*CROSS JOIN LATERAL
*COALESCE
*CASE
*CONCAT
*Recursive CTE
*Recursive CTE - second part
6. Optimizing data structures: The art and science of data normalization in SQL
8. Practical tasks
*Question 1
*Question 2
*Question 3
*Question 4
cours 50:
1. Welcome
*GAME framework
*Goal: Use these 3 storytelling chemicals
*Goal: Determine the focus of your data presentation
*Audience: Ask these questions
*Audience: 4 preferred ways of communication
*Audience: Get buy-in with the IKEA effect
* Audience: Think about what language they speak
*Audience: The curse of knowledge
*Message & Expression
* Exercise: Use the GAME framework
*My book
*Talk benefits, not features
*Loss aversion
*7 principles to make your ideas stick
cours 51:
cours 52:
cours 53:
*Course introduction
* KPIs as navigation tools
*KPIs and fighter jets
*Understanding KPIs through the lens of weight loss goals
*Aligning Key Performance Indicators with strategic goals
7. Managing performance
8. Conclusion
*Best practices
*Final words
cours 54:
2. Case Study
*Understanding the 365's Sign-Up Process
*Basic Terminology
*Defining the Key Aspects of the Sign-Up Process
*Sketching the Story
6. Analysis Report
1. Course Introduction
*Welcome
*Why is this the right time to learn Python for social media analytics?
cours 56:
1. Introduction
2. Text Preprocessing
*Text Tagging
*Parts of speech (POS) tagging
*Named entity recognition (NER)
*Practical task
4. Sentiment Analysis
5. Vectorizing Text
6. Topic Modelling
cours 57:
1. Course introduction
*Course introduction
*COUNTIFS, SUMIFS
*Data lookup
*Data ranking
*Fill until non-blank values
*Translating logic into Excel formulas with ChatGPT
cours 58 :
Intro to LLMs
*GPT vs BERT
*BERT architectur
*Loading the model and tokenizer
*BERT embeddings
*Calculating the response
*Creating a QA bot
*BERT, RoBERTa, DistilBERT
cours 60:
cours 61:
1. Introduction
3. Data Architecture
4. Data Orchestration
*Data Pipelines and Data Orchestration
*Apache Airflow
5. Relational Databases
6. Non-relational Databases
*ACID Properties
*Document Databases
*Key-Value Database
*Object Storage
*Further reading
*Columnar Database
*Graph Database
*No-SQL Database Questions
7. Software Engineering
9. Data Modeling
cours 62 :
Statistical Tests in Sales and Marketing
9. Conclusion
*Conclusion
cours 63 :
*Tokens
*Models and Prices
*First Steps
*System, User, and Assistant Roles
*Creating a Sarcastic Chatbot
*Temperature, Max Tokens, and Streaming
5. Model Inputs
7. Output Parsers
cours 64:
cours 65:
1. Course Introduction
*Course introduction
*Why this course
*Course structure
2. Understanding revenue
*Basics of revenue
*Key concepts related to revenue
*How revenue is recorded in accounting books
*Year-on-year % growth
*CAGR - Compound Annual Growth Rate
*Run rate
* Net revenue retention
4. Generating revenue
*Sales target
*Sales cycle
*Sales pipeline
*Great work!
*Section introduction
*Unraveling the basics of pricing
*From production costs to net revenue
*Why pricing is important
*Price elasticity of demand
*The 5 Cs of pricing
*Key pricing strategies
*Section summary
*Section introduction
*Customer success and its importance
*Key components of customer success
*Net Promoter Score (NPS)
*How to implement a customer success strategy
*The importance of customer success feedback analysis
*Methods for collecting customer feedback
*Tools for customer feedback analysis
*How to conduct customer feedback analysis
*Practical example
*Section summary
cours 66:
Intro to AI new
3. Key AI techniques
*Machine learning
*Supervised, Unsupervised, and Reinforcement learning
*Deep learning
4. Important AI branches
*Robotics
*Computer vision
*Traditional ML
*Generative AI
5. Understanding Generative AI
*The rise of Gen AI: Introducing ChatGPT
*Early approaches to Natural Language Processing (NLP)
*Recent NLP advancements
*From Language Models to Large Language Models (LLMs)
*The efficiency of LLM training. Supervised vs Semi-supervised learnin
*From N-Grams to RNNs to Transformers: The Evolution of NLP
*Phases in building LLMs
*Prompt engineering vs Fine-tuning vs RAG: Techniques for AI optimization
*The importance of foundation models
*Buy vs Make: foundation models vs private models
*Python programming
*Working with APIs
*Vector databases
*Open source models
*Hugging Face
*Langchain
*AI evaluation tools
8. AI job positions
*AI strategist
*AI developer
*AI engineer
9. Looking ahead
*AI ethics
*Future of AI
cours 67 :
1. Course Introduction
*Installing Anaconda
*Setting up a New Environment
*Installing Packages for Speech Recognition
*Importing the Relevant Packages in Jupyter Notebook