[go: up one dir, main page]

0% found this document useful (0 votes)
733 views3 pages

Introduction To Big data-21CS753-syllabus

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

Uploaded by

DARSHAN DARSH
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
733 views3 pages

Introduction To Big data-21CS753-syllabus

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

Uploaded by

DARSHAN DARSH
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

VII Semester

INTRODUCTION TO BIG DATA


Course Code 21CS753 CIE Marks 50
Teaching Hours/Week (L:T:P: S) 3:0:0:0 SEE Marks 50
Total Hours of Pedagogy 40 Total Marks 100
Credits 03 Exam Hours 03
Course Learning Objectives

CLO 1. Understand Hadoop Distributed File system and examine MapReduce Programming
CLO 2. Explore Hadoop tools and manage Hadoop with Sqoop
CLO 3. Appraise the role of data mining and its applications across industries
CLO 4. Identify various Text Mining techniques
Teaching-Learning Process (General Instructions)

These are sample Strategies, which teachers can use to accelerate the attainment of the various course
outcomes.
1. Lecturer method (L) need not to be only a traditional lecture method, but alternative
effective teaching methods could be adopted to attain the outcomes.
2. Use of Video/Animation to explain functioning of various concepts.
3. Encourage collaborative (Group Learning) Learning in the class.
4. Ask at least three HOT (Higher order Thinking) questions in the class, which promotes
critical thinking.
5. Adopt Problem Based Learning (PBL), which fosters students’ Analytical skills, develop
design thinking skills such as the ability to design, evaluate, generalize, and analyze
information rather than simply recall it.
6. Introduce Topics in manifold representations.
7. Show the different ways to solve the same problem with different circuits/logic and
encourage the students to come up with their own creative ways to solve them.
8. Discuss how every concept can be applied to the real world - and when that's possible, it
helps improve the students' understanding.

Module-1
Hadoop Distributed file system:HDFS Design, Features, HDFS Components, HDFS user commands
Hadoop MapReduce Framework: The MapReduce Model, Map-reduce Parallel Data Flow,Map Reduce
Programming

Textbook 1: Chapter 3,5,68hr


Teaching-Learning Process Chalk and board, Active Learning, Problem based learning
Module-2
Essential Hadoop Tools:Using apache Pig, Using Apache Hive, Using Apache Sqoop, Using Apache
Apache Flume, Apache H Base

Textbook 1: Chapter 78hr


Teaching-Learning Process Chalk and board, Active Learning, Demonstration
Module-3
Data Warehousing: Introduction, Design Consideration, DW Development Approaches, DW
Architectures

Data Mining: Introduction, Gathering, and Selection, data cleaning and preparation, outputs ofData
Mining, Data Mining Techniques

Textbook 2: Chapter 4,5


Teaching-Learning Process Chalk and board, Problem based learning, Demonstration
Module-4
Decision Trees: Introduction, Decision Tree Problem, Decision Tree Constructions, Lessons from
Construction Trees. Decision Tree Algorithm

Regressions: Introduction, Correlations and Relationships, Non-Linear Regression, Logistic Regression,


Advantages and disadvantages.

Textbook 2: Chapter 6,7


Teaching-Learning Process Chalk& board, Problem based learning
Module-5
Text Mining: Introduction, Text Mining Applications, Text Mining Process, Term Document Matrix,
Mining the TDM, Comparison, Best Practices

Web Mining: Introduction, Web Content Mining, Web Structured Mining, Web Usage Mining, Web
Mining Algorithms.

Textbook 2: Chapter 11,14


Teaching-Learning Process Chalk and board, MOOC
Suggested Course Outcomes
At the end of the course the students will be able to:
CO 1. Master the concepts of HDFS and MapReduce framework.
CO 2. Investigate Hadoop related tools for Big Data Analytics and perform basic
CO 3. Infer the importance of core data mining techniques for data analytics
CO 4. Use Machine Learning algorithms for real world big data.
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%.
The minimum passing mark for the CIE is 40% of the maximum marks (20 marks). A student shall be
deemed to have satisfied the academic requirements and earned the credits allotted to each subject/
course if the student secures not less than 35% (18 Marks out of 50) in the semester-end examination
(SEE), and a minimum of 40% (40 marks out of 100) in the sum total of the CIE (Continuous Internal
Evaluation) and SEE (Semester End Examination) taken together
Continuous Internal Evaluation:
Three Unit Tests each of 20 Marks (duration 01 hour)
1. First test at the end of 5th week of the semester
2. Second test at the end of the 10th week of the semester
3. Third test at the end of the 15th week of the semester
Two assignments each of 10 Marks
4. First assignment at the end of 4th week of the semester
5. Second assignment at the end of 9th week of the semester
Group discussion/Seminar/quiz any one of three suitably planned to attain the COs and POs for 20
Marks (duration 01 hours)
6. At the end of the 13th week of the semester
The sum of three tests, two assignments, and quiz/seminar/group discussion will be out of 100 marks
and will be scaled down to 50 marks
(to have less stressed CIE, the portion of the syllabus should not be common /repeated for any of the
methods of the CIE. Each method of CIE should have a different syllabus portion of the course).
CIE methods /question paper has to be designed to attain the different levels of Bloom’s
taxonomy as per the outcome defined for the course.
Semester End Examination:
Theory SEE will be conducted by University as per the scheduled timetable, with common question
papers for the subject (duration 03 hours)
1. The question paper will have ten questions. Each question is set for 20 marks. Marks scored
shall be proportionally reduced to 50 marks
2. There will be 2 questions from each module. Each of the two questions under a module (with a
maximum of 3 sub-questions), should have a mix of topics under that module.

The students have to answer 5 full questions, selecting one full question from each module.
Textbooks
1. Douglas Eadline,"Hadoop 2 Quick-Start Guide: Learn the Essentials of Big DataComputing in
the Apache Hadoop 2 Ecosystem", 1stEdition, Pearson Education,2016.
2. Anil Maheshwari, “Data Analytics”, 1stEdition, McGraw Hill Education,2017
Weblinks and Video Lectures (e-Resources):
1. https://nptel.ac.in/courses/106/104/106104189/
2. https://www.youtube.com/watch?v=mNP44rZYiAU
3. https://www.youtube.com/watch?v=qr_awo5vz0g
4. https://www.youtube.com/watch?v=rr17cbPGWGA
5. https://www.youtube.com/watch?v=G4NYQox4n2g
6. https://www.youtube.com/watch?v=owI7zxCqNY0
7. https://www.youtube.com/watch?v=FuJVLsZYkuE
Activity Based Learning (Suggested Activities in Class)/ Practical Based learning
Real world problem solving: Demonstration of Big Data related projects
Exploring the applications which involves big data.

You might also like