Introduction To Big data-21CS753-syllabus

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

Uploaded by

DARSHAN DARSH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

733 views3 pages

Introduction To Big data-21CS753-syllabus

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

Uploaded by

DARSHAN DARSH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

VII Semester

INTRODUCTION TO BIG DATA

Course Code 21CS753 CIE Marks 50
Teaching Hours/Week (L:T:P: S) 3:0:0:0 SEE Marks 50
Total Hours of Pedagogy 40 Total Marks 100
Credits 03 Exam Hours 03
Course Learning Objectives

CLO 1. Understand Hadoop Distributed File system and examine MapReduce Programming
CLO 2. Explore Hadoop tools and manage Hadoop with Sqoop
CLO 3. Appraise the role of data mining and its applications across industries
CLO 4. Identify various Text Mining techniques
Teaching-Learning Process (General Instructions)

These are sample Strategies, which teachers can use to accelerate the attainment of the various course
outcomes.
1. Lecturer method (L) need not to be only a traditional lecture method, but alternative
effective teaching methods could be adopted to attain the outcomes.
2. Use of Video/Animation to explain functioning of various concepts.
3. Encourage collaborative (Group Learning) Learning in the class.
4. Ask at least three HOT (Higher order Thinking) questions in the class, which promotes
critical thinking.
5. Adopt Problem Based Learning (PBL), which fosters students’ Analytical skills, develop
design thinking skills such as the ability to design, evaluate, generalize, and analyze
information rather than simply recall it.
6. Introduce Topics in manifold representations.
7. Show the different ways to solve the same problem with different circuits/logic and
encourage the students to come up with their own creative ways to solve them.
8. Discuss how every concept can be applied to the real world - and when that's possible, it
helps improve the students' understanding.

Module-1
Hadoop Distributed file system:HDFS Design, Features, HDFS Components, HDFS user commands
Hadoop MapReduce Framework: The MapReduce Model, Map-reduce Parallel Data Flow,Map Reduce
Programming

Textbook 1: Chapter 3,5,68hr

Teaching-Learning Process Chalk and board, Active Learning, Problem based learning
Module-2
Essential Hadoop Tools:Using apache Pig, Using Apache Hive, Using Apache Sqoop, Using Apache
Apache Flume, Apache H Base

Textbook 1: Chapter 78hr

Teaching-Learning Process Chalk and board, Active Learning, Demonstration
Module-3
Data Warehousing: Introduction, Design Consideration, DW Development Approaches, DW
Architectures

Data Mining: Introduction, Gathering, and Selection, data cleaning and preparation, outputs ofData
Mining, Data Mining Techniques

Textbook 2: Chapter 4,5

Teaching-Learning Process Chalk and board, Problem based learning, Demonstration
Module-4
Decision Trees: Introduction, Decision Tree Problem, Decision Tree Constructions, Lessons from
Construction Trees. Decision Tree Algorithm

Regressions: Introduction, Correlations and Relationships, Non-Linear Regression, Logistic Regression,

Advantages and disadvantages.

Textbook 2: Chapter 6,7

Teaching-Learning Process Chalk& board, Problem based learning
Module-5
Text Mining: Introduction, Text Mining Applications, Text Mining Process, Term Document Matrix,
Mining the TDM, Comparison, Best Practices

Web Mining: Introduction, Web Content Mining, Web Structured Mining, Web Usage Mining, Web
Mining Algorithms.

Textbook 2: Chapter 11,14

Teaching-Learning Process Chalk and board, MOOC
Suggested Course Outcomes
At the end of the course the students will be able to:
CO 1. Master the concepts of HDFS and MapReduce framework.
CO 2. Investigate Hadoop related tools for Big Data Analytics and perform basic
CO 3. Infer the importance of core data mining techniques for data analytics
CO 4. Use Machine Learning algorithms for real world big data.
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%.
The minimum passing mark for the CIE is 40% of the maximum marks (20 marks). A student shall be
deemed to have satisfied the academic requirements and earned the credits allotted to each subject/
course if the student secures not less than 35% (18 Marks out of 50) in the semester-end examination
(SEE), and a minimum of 40% (40 marks out of 100) in the sum total of the CIE (Continuous Internal
Evaluation) and SEE (Semester End Examination) taken together
Continuous Internal Evaluation:
Three Unit Tests each of 20 Marks (duration 01 hour)
1. First test at the end of 5th week of the semester
2. Second test at the end of the 10th week of the semester
3. Third test at the end of the 15th week of the semester
Two assignments each of 10 Marks
4. First assignment at the end of 4th week of the semester
5. Second assignment at the end of 9th week of the semester
Group discussion/Seminar/quiz any one of three suitably planned to attain the COs and POs for 20
Marks (duration 01 hours)
6. At the end of the 13th week of the semester
The sum of three tests, two assignments, and quiz/seminar/group discussion will be out of 100 marks
and will be scaled down to 50 marks
(to have less stressed CIE, the portion of the syllabus should not be common /repeated for any of the
methods of the CIE. Each method of CIE should have a different syllabus portion of the course).
CIE methods /question paper has to be designed to attain the different levels of Bloom’s
taxonomy as per the outcome defined for the course.
Semester End Examination:
Theory SEE will be conducted by University as per the scheduled timetable, with common question
papers for the subject (duration 03 hours)
1. The question paper will have ten questions. Each question is set for 20 marks. Marks scored
shall be proportionally reduced to 50 marks
2. There will be 2 questions from each module. Each of the two questions under a module (with a
maximum of 3 sub-questions), should have a mix of topics under that module.

The students have to answer 5 full questions, selecting one full question from each module.
Textbooks
1. Douglas Eadline,"Hadoop 2 Quick-Start Guide: Learn the Essentials of Big DataComputing in
the Apache Hadoop 2 Ecosystem", 1stEdition, Pearson Education,2016.
2. Anil Maheshwari, “Data Analytics”, 1stEdition, McGraw Hill Education,2017
Weblinks and Video Lectures (e-Resources):
1. https://nptel.ac.in/courses/106/104/106104189/
2. https://www.youtube.com/watch?v=mNP44rZYiAU
3. https://www.youtube.com/watch?v=qr_awo5vz0g
4. https://www.youtube.com/watch?v=rr17cbPGWGA
5. https://www.youtube.com/watch?v=G4NYQox4n2g
6. https://www.youtube.com/watch?v=owI7zxCqNY0
7. https://www.youtube.com/watch?v=FuJVLsZYkuE
Activity Based Learning (Suggested Activities in Class)/ Practical Based learning
Real world problem solving: Demonstration of Big Data related projects
Exploring the applications which involves big data.

Session 02
No ratings yet
Session 02
16 pages
Big data analytics notes
No ratings yet
Big data analytics notes
33 pages
10.1007/978 1 4842 1910 2 PDF
No ratings yet
10.1007/978 1 4842 1910 2 PDF
304 pages
(Studies in Big Data) Mamta Mittal - Valentina E. Balas - Lalit Mohan Goyal - Raghvendra Kumar - Big Data Processing Using Spark in Cloud (2019, Springer) PDF
No ratings yet
(Studies in Big Data) Mamta Mittal - Valentina E. Balas - Lalit Mohan Goyal - Raghvendra Kumar - Big Data Processing Using Spark in Cloud (2019, Springer) PDF
274 pages
CS1713-Blockchain Technologies Lecture Notes-Unit I
No ratings yet
CS1713-Blockchain Technologies Lecture Notes-Unit I
40 pages
Govt School Rejuvenation
No ratings yet
Govt School Rejuvenation
14 pages
Module-2 21ec33 Notes Updated
No ratings yet
Module-2 21ec33 Notes Updated
51 pages
Balamurugan - Big Data Concepts, Technology and Architecture
No ratings yet
Balamurugan - Big Data Concepts, Technology and Architecture
371 pages
Internship Preview at VTU Roman Technologies
No ratings yet
Internship Preview at VTU Roman Technologies
13 pages
How Map Reduce Work
No ratings yet
How Map Reduce Work
99 pages
BDA Exp Removed Removed
No ratings yet
BDA Exp Removed Removed
33 pages
SYBSc Data Science Sem IV NEP Syllabus 2024-2025
No ratings yet
SYBSc Data Science Sem IV NEP Syllabus 2024-2025
65 pages
Complete Hadoop Map Reduce Hive Setup Step by Step
No ratings yet
Complete Hadoop Map Reduce Hive Setup Step by Step
30 pages
21EC71 Advanced VLSI Module 1 - Module
No ratings yet
21EC71 Advanced VLSI Module 1 - Module
43 pages
Unit 1 BD PDF
No ratings yet
Unit 1 BD PDF
26 pages
VLSI Design Syllabus - 2018 Scheme
No ratings yet
VLSI Design Syllabus - 2018 Scheme
2 pages
Iot Betck105h Notes
No ratings yet
Iot Betck105h Notes
110 pages
Blockchain-Mini-project Report
No ratings yet
Blockchain-Mini-project Report
13 pages
Big Data Unit -1
No ratings yet
Big Data Unit -1
17 pages
Ece 3-1 Lab Manual
100% (1)
Ece 3-1 Lab Manual
269 pages
Internship - Presentation For VTU
No ratings yet
Internship - Presentation For VTU
15 pages
7th Cssyll
No ratings yet
7th Cssyll
49 pages
Hadoop Architec
No ratings yet
Hadoop Architec
14 pages
Hadoop and Their Ecosystem
100% (2)
Hadoop and Their Ecosystem
24 pages
Hadoop Tools - A Brief Overview
No ratings yet
Hadoop Tools - A Brief Overview
18 pages
IoT Module 5 IoT Case Studies and Future Trends
0% (1)
IoT Module 5 IoT Case Studies and Future Trends
48 pages
Rooman VLSI MiniProject List
No ratings yet
Rooman VLSI MiniProject List
3 pages
Python Module5 Notes
No ratings yet
Python Module5 Notes
36 pages
3 2 DS Mid2 BDA
No ratings yet
3 2 DS Mid2 BDA
2 pages
Academor Student Brochure ...
No ratings yet
Academor Student Brochure ...
15 pages
Big Data Hadoop Training 8214944.ppsx
No ratings yet
Big Data Hadoop Training 8214944.ppsx
52 pages
DocScanner 27 Jun 2024 9 47 PM
No ratings yet
DocScanner 27 Jun 2024 9 47 PM
42 pages
RTK Notes m1
No ratings yet
RTK Notes m1
16 pages
Ecschsyll 21 Scheme
No ratings yet
Ecschsyll 21 Scheme
174 pages
21EC63 Module 4B
No ratings yet
21EC63 Module 4B
29 pages
KIT MaFoi 2025 Batch
No ratings yet
KIT MaFoi 2025 Batch
32 pages
DC Module2 Notes
No ratings yet
DC Module2 Notes
32 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
5 pages
Updated 5th and 6th Sem 2021 Scheme and Syllabus
No ratings yet
Updated 5th and 6th Sem 2021 Scheme and Syllabus
71 pages
Unit 5 2 Marks
No ratings yet
Unit 5 2 Marks
10 pages
Big Data Ia Answers
No ratings yet
Big Data Ia Answers
14 pages
Module 2 Exmples
No ratings yet
Module 2 Exmples
8 pages
Dept. of Information Technology Class - SE: Programming Skill Development Lab
No ratings yet
Dept. of Information Technology Class - SE: Programming Skill Development Lab
8 pages
Module 1 NS Notes
100% (1)
Module 1 NS Notes
27 pages
A Thorough Introduction To Distributed Systems
No ratings yet
A Thorough Introduction To Distributed Systems
31 pages
Top Network & Cyber Security Viva Question With Answer
0% (1)
Top Network & Cyber Security Viva Question With Answer
5 pages
IV CSE Handbook SEM 1 CSE 21-22-79-104 ES
No ratings yet
IV CSE Handbook SEM 1 CSE 21-22-79-104 ES
26 pages
Big Data Analytics
No ratings yet
Big Data Analytics
1 page
Data Mining With Hadoop and Hive Introduction To Architecture
No ratings yet
Data Mining With Hadoop and Hive Introduction To Architecture
39 pages
Module 2: Divide and Conquer: Design and Analysis of Algorithms 18CS42
No ratings yet
Module 2: Divide and Conquer: Design and Analysis of Algorithms 18CS42
82 pages
Besck104e-204e
No ratings yet
Besck104e-204e
3 pages
Module 1
No ratings yet
Module 1
53 pages
Data Mining-Constraint Based Cluster Analysis
100% (1)
Data Mining-Constraint Based Cluster Analysis
4 pages
Mitt Robo Challenge - 20241031 - 113639 - 0000
No ratings yet
Mitt Robo Challenge - 20241031 - 113639 - 0000
15 pages
Report On Robotics
No ratings yet
Report On Robotics
40 pages
Python Module
No ratings yet
Python Module
2 pages
Technical Seminar Report and PPT Format
100% (1)
Technical Seminar Report and PPT Format
2 pages
VLSI Assignment 1
No ratings yet
VLSI Assignment 1
16 pages
8th Sem Vtu Syllabus
No ratings yet
8th Sem Vtu Syllabus
15 pages
Seminar
No ratings yet
Seminar
17 pages
Cloudera CCD 410
100% (1)
Cloudera CCD 410
21 pages
21EC61 Model Question Paper
50% (2)
21EC61 Model Question Paper
2 pages
Chapter-1-IoT For Ist Sem Final For Module1 Updated 28-12-22 (Autosaved)
No ratings yet
Chapter-1-IoT For Ist Sem Final For Module1 Updated 28-12-22 (Autosaved)
60 pages
Data Analytics For Ioe: Syllabus
No ratings yet
Data Analytics For Ioe: Syllabus
23 pages
ECE Syllabus 2013-2017 Old PDF
No ratings yet
ECE Syllabus 2013-2017 Old PDF
48 pages
BIG DATA-SPARK LAB SYLLABUS
No ratings yet
BIG DATA-SPARK LAB SYLLABUS
2 pages
IoT-Enabling-Technologies
No ratings yet
IoT-Enabling-Technologies
17 pages
21 Scheme Nss All Cerificates
No ratings yet
21 Scheme Nss All Cerificates
4 pages
21CS34 SIMP Questions - 21SCHEME: Module-1 (Study Any 5 Questions)
No ratings yet
21CS34 SIMP Questions - 21SCHEME: Module-1 (Study Any 5 Questions)
4 pages
Weather Monitoring System
No ratings yet
Weather Monitoring System
22 pages
ABEYAANTRIX Report
No ratings yet
ABEYAANTRIX Report
7 pages
CSS Notes
No ratings yet
CSS Notes
3 pages
College Fest014
No ratings yet
College Fest014
5 pages
Big Data Testing
No ratings yet
Big Data Testing
9 pages
Design and Implementation of Traffic Lights Controller Using Fpga
No ratings yet
Design and Implementation of Traffic Lights Controller Using Fpga
29 pages
21MAT41
No ratings yet
21MAT41
5 pages
The Role of Big Data Analytics For The Internet of Things (Iot)
No ratings yet
The Role of Big Data Analytics For The Internet of Things (Iot)
15 pages
San 18cs822 Module Wise Questions
No ratings yet
San 18cs822 Module Wise Questions
3 pages
1 - 30 - VLSI Major Project Titles List 2021
No ratings yet
1 - 30 - VLSI Major Project Titles List 2021
3 pages
IO Blocks and Programmable Interconnection Points
No ratings yet
IO Blocks and Programmable Interconnection Points
29 pages
HTML Notes
No ratings yet
HTML Notes
2 pages
Domain Specific Iot
No ratings yet
Domain Specific Iot
17 pages
Devops Syllabus
No ratings yet
Devops Syllabus
4 pages
Question 2
No ratings yet
Question 2
3 pages
Manoj G Activity Report
No ratings yet
Manoj G Activity Report
20 pages
Internship Report Anthony and Joshil PDF
No ratings yet
Internship Report Anthony and Joshil PDF
20 pages
21EC62 Model Question Paper
No ratings yet
21EC62 Model Question Paper
5 pages
Case Study: Using MongoDB For An E-Commerce Platform
100% (8)
Case Study: Using MongoDB For An E-Commerce Platform
32 pages
Big Data Testing: Why & How
No ratings yet
Big Data Testing: Why & How
1 page
Apache Spark 24 Hours PDF
100% (6)
Apache Spark 24 Hours PDF
1,129 pages
Module 1 Question Bank Dspa - Kms
No ratings yet
Module 1 Question Bank Dspa - Kms
1 page
CS1403 CASE Tools Lab Manual
100% (2)
CS1403 CASE Tools Lab Manual
67 pages
Sample Report 22-23 1
No ratings yet
Sample Report 22-23 1
30 pages
DBMS Mini Project Report 2-12-2022
No ratings yet
DBMS Mini Project Report 2-12-2022
6 pages
Module#2
No ratings yet
Module#2
1 page
Model Question Paper II - 21cs642 - 6 Sem (2021 Scheme)
No ratings yet
Model Question Paper II - 21cs642 - 6 Sem (2021 Scheme)
2 pages
Fs Lab Manual
No ratings yet
Fs Lab Manual
57 pages
21ec741 Iot & WSN
100% (1)
21ec741 Iot & WSN
1 page
NPTEL Domain
No ratings yet
NPTEL Domain
1 page
ESD & IOT Syllabus
No ratings yet
ESD & IOT Syllabus
4 pages
Updated Resume Kaar
No ratings yet
Updated Resume Kaar
1 page
Ece-Vii-dsp Algorithms & Architecture (10ec751) - Question Paper
No ratings yet
Ece-Vii-dsp Algorithms & Architecture (10ec751) - Question Paper
9 pages
Final Year Projects List Wireless Communication
No ratings yet
Final Year Projects List Wireless Communication
5 pages
Hadoop in Action
No ratings yet
Hadoop in Action
1 page
Trackpad Ver. 2.0 Class 8
From Everand
Trackpad Ver. 2.0 Class 8
Nidhi Arora
No ratings yet
Touchpad Plus Ver. 4.0 Class 7
From Everand
Touchpad Plus Ver. 4.0 Class 7
Nidhi Gupta
No ratings yet
Trackpad Pro Ver. 5.0 Class 6
From Everand
Trackpad Pro Ver. 5.0 Class 6
Nidhi Arora
No ratings yet
C & Data Structures
From Everand
C & Data Structures
Prof. P. Padmanabham
No ratings yet
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
Introduction to Linux: Installation and Programming
From Everand
Introduction to Linux: Installation and Programming
N. B. Venkateswarlu
No ratings yet
Touchpad Plus Ver. 1.1 Class 7
From Everand
Touchpad Plus Ver. 1.1 Class 7
Nisha Batra
No ratings yet

Introduction To Big data-21CS753-syllabus

Uploaded by

Introduction To Big data-21CS753-syllabus

Uploaded by

VII Semester

INTRODUCTION TO BIG DATA

Textbook 1: Chapter 3,5,68hr

Textbook 1: Chapter 78hr

Textbook 2: Chapter 4,5

Regressions: Introduction, Correlations and Relationships, Non-Linear Regression, Logistic Regression,

Textbook 2: Chapter 6,7

Textbook 2: Chapter 11,14

You might also like