0% found this document useful (0 votes)

133 views38 pages

Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer

This document discusses how Databricks can be used for deep learning and machine learning workflows. It covers topics like model building, distributed training, model tracking with MLflow, automating machine learning with AutoML, and deploying models with MLflow Projects and model serving. The integrated Databricks platform provides notebooks, compute clusters, job scheduling, experiments tracking, and other tools to support the full machine learning lifecycle from data preparation to model deployment.

Uploaded by

NiharikaNic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

133 views38 pages

Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer

Uploaded by

NiharikaNic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Deep Learning with

Databricks
Srijith Rajamohan, Ph.D.
John O’Dwyer
Open

30+ Million
Unify your data
ecosystem with open
source, standards and
formats
monthly downloads

Built on the innovation of

some of the most
successful open source
data projects in the world
Data Analysts Data Scientists Collaborative
Unify your data teams
to collaborate across the
Models entire data and
AI workﬂow
Dashboards

Notebooks

Datasets

Data Engineers
Questions for Scalable ML
▪ Track the provenance and reason for model creation
▪ What training data was used, if any?
▪ Proprietary data, sensitive data, storage, data retention period?
▪ Real-time or batch?
▪ How are the models being used and who is using it?
▪ Exploratory analysis and production environment?
▪ Is model performance being measured regularly and is the model being updated?
▪ Is the model well documented to ensure reuse?
▪ Is the model deployment process being automated?
▪ Institutional adoption and support
Best Practices for ML
▪ Software engineering practices
▪ Code quality best practices
▪ Validate your data
▪ Ensure proper data types and format are fed to your model (Schema validation)
▪ Ensure no data drift, can render a supervised model ineffective
▪ Version and track your experiments like code!
▪ Changing hyperparameters, inputs, code etc.
▪ Monitor predictive performance over time
▪ Ensure model performance does not degrade over time
▪ Ensure model fairness across different classes of data (bias)
What is MLOps?

MLOps = ML + DataOps + DevOps

Build -> Test -> Deploy -> Monitor -> Feedback -> Build

Model management
Databricks Ecosystem for ML/DL
▪ Integrated Environment
▪ Use compute instances from AWS, Azure or GCP
▪ Centered around a notebook environment
▪ Version control them with GitHub
▪ Integrated ‘DBFS’ filesystem that can mount cloud filesystems like S3
▪ Mix SQL, Python, R and Bash in the same notebook
▪ Schedule jobs to run anytime
▪ Databricks Runtimes (DBRs)
▪ Preinstalled with packages for ML/DL
▪ Additional packages can be installed per cluster or per notebook
▪ MLflow integrated into the Databricks platform
▪ Model tracking for experiment management/reproducibility
▪ MLflow projects for packaging an experiment
▪ Model serving with MLflow
Workspace
Workspace
Notebooks
Job scheduling
Job page
Experiments
Registered models
The Data Preparation
The Delta Lake Architecture
Data Store and Versioning
Delta Lake Feature Store
▪ Scalable metadata ▪ Data stored needs to be transformed
▪ Time travel into features to be useful
▪ Feature tables are Delta tables
▪ Open format ▪ Feature Stores can save these features
▪ Unified Batch and Streaming ▪ Discoverable and reusable across an
▪ Schema enforcement organization
▪ Ensures consistency for Data Engineers,
Data Scientists and ML Engineers
▪ Track feature lineage in a model
ETL and EDA
▪ Delta lake
▪ Save data in scalable file formats like Parquet
▪ Delta file formats can let you version control your data
▪ ETL
▪ Read data
▪ PySpark - Ideal for large data
▪ Tensorflow (tf.data) and Pytorch (DataLoader)
▪ Clean and process data
▪ PySpark/Pandas API on Spark can work with large datasets across clusters
▪ Clean and prepare the data
▪ Extract features and save them using Feature Stores
▪ EDA
▪ Preliminary data analysis such as inspecting records, summary statistics
▪ Visualize the data and its distribution
The Model Build
Model training
▪ DBRs provide your favorite DL frameworks such as Tensorflow, Pytorch,
Keras etc.
▪ Integration with MLflow for model tracking
▪ Hyperparameter tuning with Hyperopt/Optuna
▪ Seamlessly run single node but multi-CPU/multi-GPU jobs
▪ Distributed training on multiple nodes with Horovod
▪ NVlink/NCCL enabled instances available for accelerating DL workloads
▪ Tightly coupled - Train directly on Spark Dataframes with Horovod Estimator
▪ Train on distributed Spark clusters with Horovod Runner
Distributed Training with Spark/Horovod
Distributed Training with Spark/Horovod contd...
Invoke training across multiple nodes

Inference using Horovod

Distributed Training
Data parallelism Model parallelism
▪ Data is divided among the different ▪ Model is divided among all the nodes
nodes ▪ Only works if you can take advantage of
▪ Entire model is copied to all the nodes task parallelism in the model
▪ Gradients are communicated back to ▪ Model size is less of a concern
all other nodes to update the model
▪ Synchronous or asynchronous updates
▪ Model size is a concern
Deep Learning Synchronization
Model parameter server All-reduce
▪ Central servers hold all shared ▪ All the machines store the shared
parameters parameters
▪ Workers receive updates from the ▪ No central server
central server ▪ Several architectures for this
▪ Harder to scale ▪ Ring All-reduce
▪ Speedup now depends on the overhead ▪ Tree All-reduce
of communication with the central
server
Other Topics in Training

▪ Quantization-aware training
▪ Lower-precision training to minimize memory/compute requirements
▪ Federated learning
▪ Decentralized learning with the Federated Averaging algorithm (Google)
▪ Keep data on device
▪ Model is updated with data on device and updates sent back to central server
▪ Updates from all devices are averaged
▪ Privacy-preserving learning
▪ Learn from data that is encrypted or with minimal exposure to the data
Model tracking with MLﬂow

▪ The MLﬂow Tracking API

▪ Integrations with common ML/DL tools such as Scikit-learn, Pytorch,
Tensorflow, Spark etc.
▪ Logs metrics and artifacts (output files)
▪ Can log this locally or a remote tracking server
▪ Tracking UI to query runs and visualize the results of a run
▪ Save and load models from a run
Model tracking with MLflow - Keras
Model tracking with MLflow - Autolog

With many of the popular libraries,

you can use the autologging feature
AutoML

▪ Only ML algorithms for now

▪ Works with 9.1 LTS ML DBRs and above
▪ Classiﬁcation and Regression
▪ Decision trees, Random Forests, Logistic Regression, XGBoost, LightGBM
▪ Forecasting with Prophet
▪ Run from the UI or use the command line API
AutoML
AutoML contd...
AutoML - Load the best model
AutoML - Experiments
The Model Inference and Deployment
Model Inference - Pandas UDF
▪ Use a compiled DL model with Pandas UDF for distributed inference
▪ Scalar pandas UDF (batch of data) vs. Iterator pandas UDF (iterator of
batches ) here so model is no initialized for every batch
Model Packaging with MLﬂow Projects

MLProject ﬁle for

reproducible executions

File under folder

sklearn_elasticnet_wine

Execute this project using

the command below

mlflow run sklearn_elasticnet_wine -P alpha=0.42

Model Serve with MLﬂow
Serve the model
mlflow models serve -m
/Users/mlflow/mlflow-prototype/mlruns/0/7c1a0d5c42844dcdb8f5191146925
174/artifacts/model -p 1234

Send a request
curl -X POST -H "Content-Type:application/json; format=pandas-split"
--data '{"columns":["alcohol", "chlorides", "citric acid",
],"data":[[12.8, 0.029, 0.48]]}' http://127.0.0.1:1234/invocations
Thank you!

PySpark FP Course ID 58339
No ratings yet
PySpark FP Course ID 58339
44 pages
Spark
No ratings yet
Spark
96 pages
Pyspark Questions
No ratings yet
Pyspark Questions
63 pages
2023 Intro To Generative Ai
No ratings yet
2023 Intro To Generative Ai
15 pages
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
No ratings yet
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
5 pages
Fine-Tuning Legal-BERT - LLMs For Automated Legal Text Classification - by Drewgelbard - Nov, 2024 - Towards AI
No ratings yet
Fine-Tuning Legal-BERT - LLMs For Automated Legal Text Classification - by Drewgelbard - Nov, 2024 - Towards AI
27 pages
LangChain & RAG
No ratings yet
LangChain & RAG
62 pages
Python OOPS Exercises
No ratings yet
Python OOPS Exercises
7 pages
Oops With Java Lab Manual - 2024
No ratings yet
Oops With Java Lab Manual - 2024
72 pages
Databricks Question
No ratings yet
Databricks Question
7 pages
1 - Optimize Amazon SageMaker Deployment Strategies
No ratings yet
1 - Optimize Amazon SageMaker Deployment Strategies
45 pages
Databricks Widgets
No ratings yet
Databricks Widgets
13 pages
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
No ratings yet
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
76 pages
Python Challenge
No ratings yet
Python Challenge
10 pages
Technical Interview Questions For Freshers - With Answers (2024)
No ratings yet
Technical Interview Questions For Freshers - With Answers (2024)
7 pages
Databricks & PySpark Learning Day-10
No ratings yet
Databricks & PySpark Learning Day-10
4 pages
Google People and Ai Guidebook-Workshop-Slides
No ratings yet
Google People and Ai Guidebook-Workshop-Slides
126 pages
2018 02 08 Whats New in Apache Spark 2 180213220045
No ratings yet
2018 02 08 Whats New in Apache Spark 2 180213220045
57 pages
Pyspark
No ratings yet
Pyspark
31 pages
By Ram Reddy by Ram Reddy: #209, Nilagiri Block, Adithya Enclave, Ameerpet, HYD @8801408841, 8790998182
No ratings yet
By Ram Reddy by Ram Reddy: #209, Nilagiri Block, Adithya Enclave, Ameerpet, HYD @8801408841, 8790998182
255 pages
CPAD Practicals Merged
No ratings yet
CPAD Practicals Merged
72 pages
IA Ethique 15-04
No ratings yet
IA Ethique 15-04
22 pages
Aws Three Practical Use Cases With Databricks Ebook v5 101221
No ratings yet
Aws Three Practical Use Cases With Databricks Ebook v5 101221
34 pages
Industrial Training Report of Frontend Web Development Using React
No ratings yet
Industrial Training Report of Frontend Web Development Using React
24 pages
React Js
No ratings yet
React Js
26 pages
Best Practices For Prompt Engineering With The OpenAI
No ratings yet
Best Practices For Prompt Engineering With The OpenAI
6 pages
Lab7 LLM Chains
No ratings yet
Lab7 LLM Chains
7 pages
Python Classes in Pune
No ratings yet
Python Classes in Pune
11 pages
Dataeng-Zoomcamp - 5 - Batch - Processing - MD at Main Ziritrion - Dataeng-Zoomcamp GitHub
No ratings yet
Dataeng-Zoomcamp - 5 - Batch - Processing - MD at Main Ziritrion - Dataeng-Zoomcamp GitHub
41 pages
Pyspark Code
No ratings yet
Pyspark Code
3 pages
Python Oops
No ratings yet
Python Oops
10 pages
20191216134846D3338 - COMP6579 Session 10 - Big Data Analytics (Apache Spark - SparkML)
No ratings yet
20191216134846D3338 - COMP6579 Session 10 - Big Data Analytics (Apache Spark - SparkML)
42 pages
Scalable-ML-3 4 1
No ratings yet
Scalable-ML-3 4 1
147 pages
Machine Learning + Devops Using Azure ML Services
No ratings yet
Machine Learning + Devops Using Azure ML Services
17 pages
ReactJS - CredoSystems
No ratings yet
ReactJS - CredoSystems
14 pages
MLOps Interview QnA
No ratings yet
MLOps Interview QnA
19 pages
Review of Deep Learning Models For Crypto Price Prediction - Implementation and Evaluation
No ratings yet
Review of Deep Learning Models For Crypto Price Prediction - Implementation and Evaluation
34 pages
Yahoo News/YouGov March 2025 Canada Greenland Poll
No ratings yet
Yahoo News/YouGov March 2025 Canada Greenland Poll
7 pages
Python PPT 01
No ratings yet
Python PPT 01
286 pages
Pyspark RDD Cheat Sheet Python For Data Science
No ratings yet
Pyspark RDD Cheat Sheet Python For Data Science
1 page
Pytorch: Tensors and Datasets
No ratings yet
Pytorch: Tensors and Datasets
9 pages
React JS Developer
No ratings yet
React JS Developer
2 pages
React Js
No ratings yet
React Js
21 pages
React Js
No ratings yet
React Js
82 pages
Company Result - (:companyname) : Freecodecamp
No ratings yet
Company Result - (:companyname) : Freecodecamp
10 pages
1 - Introduction To React JS
No ratings yet
1 - Introduction To React JS
13 pages
10 Evani Generative AI Champion
No ratings yet
10 Evani Generative AI Champion
39 pages
B. Tech Final Year Project Report
No ratings yet
B. Tech Final Year Project Report
15 pages
Spark NLP Training-Public-April 2020
No ratings yet
Spark NLP Training-Public-April 2020
39 pages
Operations Management: Facility Layout
No ratings yet
Operations Management: Facility Layout
46 pages
Chapter 2. Pair Programming
No ratings yet
Chapter 2. Pair Programming
15 pages
Spark NLP Training-Public-Oct 2020
No ratings yet
Spark NLP Training-Public-Oct 2020
50 pages
Deep Learning and TensorFlow
No ratings yet
Deep Learning and TensorFlow
50 pages
Lecture # 12 - Introduction To React JS
No ratings yet
Lecture # 12 - Introduction To React JS
76 pages
A New Dimeric Secoiridoid Glycoside From The Leaves of Olea Ferruginea Royle
No ratings yet
A New Dimeric Secoiridoid Glycoside From The Leaves of Olea Ferruginea Royle
6 pages
Number Recognition 1 10 2
100% (1)
Number Recognition 1 10 2
14 pages
Data Science ML Full Stack 2022 GitHub
No ratings yet
Data Science ML Full Stack 2022 GitHub
9 pages
64 Natural Language Processing Interview Questions and Answers-18 Juli 2019
No ratings yet
64 Natural Language Processing Interview Questions and Answers-18 Juli 2019
30 pages
Soft Skills-Lab Manual
No ratings yet
Soft Skills-Lab Manual
15 pages
Big Data Tools 2 - Apache Spark With PySpark
No ratings yet
Big Data Tools 2 - Apache Spark With PySpark
33 pages
Oops Unit 1 - Part1
No ratings yet
Oops Unit 1 - Part1
19 pages
1ST Term JSS3 Agricultural Science
No ratings yet
1ST Term JSS3 Agricultural Science
17 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
LanzaTech Investor Presentation - March 2022 - VF
No ratings yet
LanzaTech Investor Presentation - March 2022 - VF
60 pages
Laporan Master Barang 20200615-170438
No ratings yet
Laporan Master Barang 20200615-170438
138 pages
Anaconda Installation Guidelines
No ratings yet
Anaconda Installation Guidelines
6 pages
Donald Ngandeu 1
No ratings yet
Donald Ngandeu 1
6 pages
11 Ci Sinif Sentyabr Sinaq - 1 2
No ratings yet
11 Ci Sinif Sentyabr Sinaq - 1 2
3 pages
Byju's Magazine (PDFDrive)
No ratings yet
Byju's Magazine (PDFDrive)
23 pages
Verizon Fios Digital Voice User Guide
No ratings yet
Verizon Fios Digital Voice User Guide
31 pages
Dufwenberg 2002
No ratings yet
Dufwenberg 2002
13 pages
The Morphology of Innovations
No ratings yet
The Morphology of Innovations
5 pages
Code of Conduct For The Laboratories: It Department Unix Lab
No ratings yet
Code of Conduct For The Laboratories: It Department Unix Lab
15 pages
A Review On Current Scenario of Nanocatalysts in Biofuel Production and Potential of Organic and Inorganic Nanoparticles in Biohydrogen Production
No ratings yet
A Review On Current Scenario of Nanocatalysts in Biofuel Production and Potential of Organic and Inorganic Nanoparticles in Biohydrogen Production
6 pages
A Seminar ON Captcha: Presented by
No ratings yet
A Seminar ON Captcha: Presented by
20 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Questionbank EST IMP
0% (1)
Questionbank EST IMP
12 pages
Angielski Studia
No ratings yet
Angielski Studia
1 page
Np-Sample-Collaborative-Agreement 1 1
No ratings yet
Np-Sample-Collaborative-Agreement 1 1
2 pages
CV of MD Nazrul Islam Industrial Engineer
No ratings yet
CV of MD Nazrul Islam Industrial Engineer
2 pages
Trnsys17 Installationguide
No ratings yet
Trnsys17 Installationguide
7 pages
SY 2021 2022 Nominees Information Sheet
No ratings yet
SY 2021 2022 Nominees Information Sheet
1 page
Operations On Functions2
No ratings yet
Operations On Functions2
24 pages
Q&a Rizal
No ratings yet
Q&a Rizal
7 pages
Single Slit Diffraction
100% (1)
Single Slit Diffraction
6 pages
Error Analysis in The Chemistry Laboratory
No ratings yet
Error Analysis in The Chemistry Laboratory
1 page
4 - Barriers and Effective Listening
No ratings yet
4 - Barriers and Effective Listening
22 pages
Planta Elect. Cat 3512
No ratings yet
Planta Elect. Cat 3512
5 pages
Databricks Essentials: A Guide to Unified Data Analytics
From Everand
Databricks Essentials: A Guide to Unified Data Analytics
Robert Johnson
No ratings yet
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet

Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer

Uploaded by

Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer

Uploaded by

Deep Learning with

Built on the innovation of

MLOps = ML + DataOps + DevOps

Inference using Horovod

▪ The MLﬂow Tracking API

With many of the popular libraries,

▪ Only ML algorithms for now

MLProject ﬁle for

File under folder

Execute this project using

mlflow run sklearn_elasticnet_wine -P alpha=0.42

You might also like