0% found this document useful (0 votes)

27 views4 pages

TB-Data Engineering - Syllabus-2024

The document outlines a 40+ hour cloud and data engineering syllabus covering topics like data warehousing, SQL, Python, Linux, Apache Hadoop, HDFS, Spark, HBase, Airflow, Kafka, cloud computing on AWS, S3, EC2, EMR, Athena, CLI, DynamoDB, Lambda, Glue, and Step Functions. Students can enroll in the course by contacting the numbers or email provided to learn data engineering basics, core concepts, and how to use AWS services for data storage, processing, querying and orchestrating workflows.

Uploaded by

kiddiesmagicgalaxy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views4 pages

TB-Data Engineering - Syllabus-2024

Uploaded by

kiddiesmagicgalaxy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Cloud & Data Engineering Syllabus (40+ hours)

Data Engineering Roadmap:

Data Engineering Roadmap Video: https://youtu.be/8uVRbry5A2U?feature=shared

Data Engineering basics: (Following will be covered as part of Data Engineering basics)

 Data warehousing
 SQL
 Python
 Linux
Data Engineering Core:

Apache Hadoop - (HDFS - Landing Zone for all the incoming files)

 Introduction to Big Data & Hadoop Fundamentals

 Dimensions of Big data
 Type of Data generation
 Apache ecosystem & its projects
 Hadoop distributors
 HDFS core concepts
 Modes of Hadoop employment
 HDFS Flow architecture
 HDFS MRV1 vs MRV2 architecture
 Types of Data compression techniques
 Rack topology/awareness
 HDFS utility commands with usages
 Min h/w requirements for a cluster & property files changes

Apache Spark - (Spark Jobs deployment with Python Programming)

 Introduction to Spark & features

 Spark Core & SparkSQL concepts
 Actions & Transformations logics
 Spark script to read & write table in Hbase & S3 buckets.

Apache Hbase - (No-SQL Database)

 Introduction to Hbase concepts & features

 Introduction to NoSQL/CAP theorem concepts
 Hbase design/architecture flow
 Hbase table commands

Apache Airflow - (Workflows orchestration)

 Airflow Introduction
 Installation
 Architecture
 Sample Project

Apache Kafka+ Streaming - (Streaming data from datasource)

 Introduction to Kafka and what is streaming data

 Working with Kafka & Installing
 Projects in Kafka

Cloud Computing Services

 Basic overview of the cloud

 Different types of cloud models
 Different types of cloud services
 Different vendors of cloud implementation
 Why to choose AWS?
 Features of AWS and key offerings

AWS S3 (create buckets to have the data ingested & also the transformed data)

 What is AWS S3 & where it is used for?

 What is AWS S3 buckets and how to create buckets in AWS Console?
 How to upload and manage files in AWS S3
 Features & advantages of S3
 How does AWS S3 works?

AWS EC2 - (Instance creation with VPC, Networking, Security Groups etc.)

 What is EC2 and its important features?

 Types of EC2 computing instances
 How to create EC2 instances with selecting AMI, Security Groups, VPC and connect using Putty
 What are the advantages of EC2 instances?

AWS EMR - (Spin up cluster for deploying Spark jobs)

 What is the usage the EMR and big data concepts?

 How to launch and configure the EMR service
 Run a sample Spark program to view the job details to analyse the big data

AWS Athena (Query data in S3 buckets)

 What is Amazon Athena and its features?

 How to create database, tables in Athena from S3 buckets and from DDL?
 How to use Athena with other AWS Services with usecase

AWS CLI (To query data using CLI commands)

 What is Amazon CLI and its features?

 How to use cloudshell for accessing aws services?
 How to use command line interface for triggering & querying datasets in S3 buckets

AWS DynamoDB (No-SQL DB to have the configuration mapping loaded)

 What is Amazon DynamoDB and its features?

 How to Create, Insert and Query A Table In DynamoDB
 How to integrate DynamoDB with other AWS services

AWS Lambda (Serverless computation service)

 What is Amazon Lambda and its features?

 How to write simple & basic Lambda function
 How to integrate Lambda+ S3 with other AWS services

AWS Glue (to convert file format conversion)

 Use AWS Glue Crawlers to discover the schema of your data in S3.
 Create an AWS Glue Data Catalog to store metadata information.
 Develop AWS Glue ETL jobs to transform the data using SparkSQL.
 Utilize AWS Glue Dynamic Frames for schema flexibility.
 Schedule and orchestrate ETL jobs using AWS Glue Triggers and Workflows.

AWS Step Functions (to orchestrate all the workflows step by step)

 What is Step functions and its features?

 How to orchestrate workflows with different AWS Services?
 How to define Tasks, States and create State Machines in AWS
 How to integrate Step functions in AWS with other services

How to enrol this course: If you're interested in joining this course, Please feel free to contact us:

Call: +91 90424 63272, +91 93422 72961

WhatsApp - +91 96196 63272

email id : admin@tamilboomi.com

Cloud Computing
No ratings yet
Cloud Computing
3 pages
AWS Learning Material
No ratings yet
AWS Learning Material
13 pages
AWS Data Engineer 6 Weeks Training Course Content
No ratings yet
AWS Data Engineer 6 Weeks Training Course Content
5 pages
Data Engineering Cookbook
100% (1)
Data Engineering Cookbook
124 pages
Venu Data Engineering Training in Hyderabad 1
No ratings yet
Venu Data Engineering Training in Hyderabad 1
8 pages
Data Engineering Roadmap For Freshers & Resources
No ratings yet
Data Engineering Roadmap For Freshers & Resources
6 pages
Data Engineering Cookbook
100% (1)
Data Engineering Cookbook
125 pages
Data Engineering Roadmap
No ratings yet
Data Engineering Roadmap
2 pages
AWS Syllabus
No ratings yet
AWS Syllabus
2 pages
GCP Data Engineer Curriculum
No ratings yet
GCP Data Engineer Curriculum
7 pages
Data Engineering Cookbook
No ratings yet
Data Engineering Cookbook
125 pages
AWS Solution Architect Course Agenda
No ratings yet
AWS Solution Architect Course Agenda
20 pages
Data Engineering Nanodegree Program Syllabus
No ratings yet
Data Engineering Nanodegree Program Syllabus
16 pages
Objectives: Modules of The Course
No ratings yet
Objectives: Modules of The Course
7 pages
Data Engineering Nanodegree Program Syllabus PDF
No ratings yet
Data Engineering Nanodegree Program Syllabus PDF
5 pages
Course Handout - 21CSE372P - Mastering Cloud Data Services and Analytics With AWS, Azure, and GCP - VF-1
No ratings yet
Course Handout - 21CSE372P - Mastering Cloud Data Services and Analytics With AWS, Azure, and GCP - VF-1
18 pages
Comprehensive Data Engineer Guide
No ratings yet
Comprehensive Data Engineer Guide
6 pages
Amazon Web Services For Data Science - Lynn Lagit
No ratings yet
Amazon Web Services For Data Science - Lynn Lagit
3 pages
Data Engineering Cookbook
100% (2)
Data Engineering Cookbook
127 pages
Cloud Computing Master Program
No ratings yet
Cloud Computing Master Program
34 pages
20B91A05S6
No ratings yet
20B91A05S6
22 pages
Data Engineering Skills Guide
100% (1)
Data Engineering Skills Guide
102 pages
AWS Syllabus
No ratings yet
AWS Syllabus
13 pages
AWS & GCP Cloud Training Course
No ratings yet
AWS & GCP Cloud Training Course
21 pages
AWS-DevOps Course Content-Reyaz - Latest
No ratings yet
AWS-DevOps Course Content-Reyaz - Latest
17 pages
Data Engineering
No ratings yet
Data Engineering
91 pages
Data Engineering Essentials
100% (1)
Data Engineering Essentials
92 pages
Mastering Databricks Data Engineering-AWS-Azure
No ratings yet
Mastering Databricks Data Engineering-AWS-Azure
6 pages
Aws Data Engineer
No ratings yet
Aws Data Engineer
66 pages
Cloud Architect & AWS Training Guide
No ratings yet
Cloud Architect & AWS Training Guide
16 pages
Aws Soa
No ratings yet
Aws Soa
4 pages
AWS Cloud Training Syllabus
No ratings yet
AWS Cloud Training Syllabus
9 pages
AWS Cloud Computing Course Guide
No ratings yet
AWS Cloud Computing Course Guide
1 page
Unit 3 (Ii) - CC
No ratings yet
Unit 3 (Ii) - CC
10 pages
Final Internship Report Content
No ratings yet
Final Internship Report Content
20 pages
DE AWS Test (1) T
No ratings yet
DE AWS Test (1) T
74 pages
02 - AWS Restart Training Modules and Topics
100% (2)
02 - AWS Restart Training Modules and Topics
4 pages
Data Engineering Report Final
No ratings yet
Data Engineering Report Final
56 pages
Data Engineering Skills Guide
100% (1)
Data Engineering Skills Guide
5 pages
AWS Cloud Essentials Guide
No ratings yet
AWS Cloud Essentials Guide
4 pages
Cognixia Course - AWS Cloud Practitioner - Schneider
No ratings yet
Cognixia Course - AWS Cloud Practitioner - Schneider
5 pages
AWS Training
No ratings yet
AWS Training
3 pages
20121a3226 Internship Report
No ratings yet
20121a3226 Internship Report
64 pages
Aws Brochure
No ratings yet
Aws Brochure
2 pages
Geetha Intern de
No ratings yet
Geetha Intern de
26 pages
AWS Architect Course Overview
No ratings yet
AWS Architect Course Overview
5 pages
Roadmap and Skills
No ratings yet
Roadmap and Skills
15 pages
AWS Dev Ops Course Content - Raj Cloud Technologies
No ratings yet
AWS Dev Ops Course Content - Raj Cloud Technologies
9 pages
Major AWS Services For Interview Preparation
No ratings yet
Major AWS Services For Interview Preparation
9 pages
The Ultimate Data Engineering Guide - Apache Spark, Apache Airflow, and AWS Glue
No ratings yet
The Ultimate Data Engineering Guide - Apache Spark, Apache Airflow, and AWS Glue
6 pages
ITE 302 SemiFinal Exam
No ratings yet
ITE 302 SemiFinal Exam
3 pages
Database and Data Modeling
No ratings yet
Database and Data Modeling
31 pages
Database Systems: Hanem A. Eladly Computer Engineering Department
No ratings yet
Database Systems: Hanem A. Eladly Computer Engineering Department
43 pages
He's Forgotten Lyrics - The Meadows - Only On JioSaavn
No ratings yet
He's Forgotten Lyrics - The Meadows - Only On JioSaavn
1 page
Page 2
No ratings yet
Page 2
6 pages
Dmbok - Bi & DW
No ratings yet
Dmbok - Bi & DW
33 pages
Harshit Project File
No ratings yet
Harshit Project File
10 pages
Unit 4 DBMS R23
No ratings yet
Unit 4 DBMS R23
19 pages
Cursors - TSQL Tutorial: Declare Cursor Syntax
No ratings yet
Cursors - TSQL Tutorial: Declare Cursor Syntax
5 pages
RSH Consulting RACF Performance Tuning February 2024
No ratings yet
RSH Consulting RACF Performance Tuning February 2024
36 pages
Practical Sheet 02
No ratings yet
Practical Sheet 02
18 pages
SQL Interview Questions
100% (1)
SQL Interview Questions
10 pages
Practical Paper IP Sets
No ratings yet
Practical Paper IP Sets
10 pages
2.4 - OLAP in Data WareHouse
No ratings yet
2.4 - OLAP in Data WareHouse
9 pages
326 C Informationtechnology 2020
No ratings yet
326 C Informationtechnology 2020
11 pages
Apache Airflow Basics: Key Concepts
No ratings yet
Apache Airflow Basics: Key Concepts
38 pages
Unit-III Notes
No ratings yet
Unit-III Notes
33 pages
Database-Concepts 1
No ratings yet
Database-Concepts 1
23 pages
Chapter 2 and 3
No ratings yet
Chapter 2 and 3
89 pages
Unit Iii
No ratings yet
Unit Iii
20 pages
Springboot
No ratings yet
Springboot
9 pages
Horizontal Fragmentation and Relational Algebra Optimization in A Distributed Database Environment
No ratings yet
Horizontal Fragmentation and Relational Algebra Optimization in A Distributed Database Environment
2 pages
DBMS Complete
No ratings yet
DBMS Complete
26 pages
Interview 101 For Working Professionals Final
No ratings yet
Interview 101 For Working Professionals Final
17 pages
Manipulation of The Model: Problem For Traditional/Manual Method
No ratings yet
Manipulation of The Model: Problem For Traditional/Manual Method
10 pages
HU Student Clearance System
100% (1)
HU Student Clearance System
46 pages
Electronic Shop Management System
100% (2)
Electronic Shop Management System
25 pages
Data Dictionary Slides
No ratings yet
Data Dictionary Slides
11 pages
1z0 908 Demo
No ratings yet
1z0 908 Demo
6 pages
Unit II Lecture Notes
No ratings yet
Unit II Lecture Notes
26 pages

TB-Data Engineering - Syllabus-2024

Uploaded by

TB-Data Engineering - Syllabus-2024

Uploaded by

Cloud & Data Engineering Syllabus (40+ hours)

Data Engineering Roadmap:

Data Engineering Roadmap Video: https://youtu.be/8uVRbry5A2U?feature=shared

 Introduction to Big Data & Hadoop Fundamentals

Apache Spark - (Spark Jobs deployment with Python Programming)

 Introduction to Spark & features

Apache Hbase - (No-SQL Database)

 Introduction to Hbase concepts & features

Apache Airflow - (Workflows orchestration)

Apache Kafka+ Streaming - (Streaming data from datasource)

 Introduction to Kafka and what is streaming data

Cloud Computing Services

 Basic overview of the cloud

 What is AWS S3 & where it is used for?

 What is EC2 and its important features?

AWS EMR - (Spin up cluster for deploying Spark jobs)

 What is the usage the EMR and big data concepts?

AWS Athena (Query data in S3 buckets)

 What is Amazon Athena and its features?

AWS CLI (To query data using CLI commands)

 What is Amazon CLI and its features?

AWS DynamoDB (No-SQL DB to have the configuration mapping loaded)

 What is Amazon DynamoDB and its features?

AWS Lambda (Serverless computation service)

 What is Amazon Lambda and its features?

AWS Glue (to convert file format conversion)

 What is Step functions and its features?

Call: +91 90424 63272, +91 93422 72961

WhatsApp - +91 96196 63272

You might also like