[go: up one dir, main page]

0% found this document useful (0 votes)
336 views2 pages

Big Data Analytics Syllabus - 22UAI603C - 204 - 2025

The document outlines the syllabus for a Big Data Analytics course at Basaveshwar Engineering College, covering topics such as types of digital data, Big Data technologies like NoSQL and Hadoop, and tools like MongoDB, Hive, and Pig. It includes course outcomes that emphasize analyzing digital data characteristics, challenges in Big Data analytics, and applying various tools for data processing. Additional resources and textbooks are provided for further learning.

Uploaded by

alpha.gamer.661
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
336 views2 pages

Big Data Analytics Syllabus - 22UAI603C - 204 - 2025

The document outlines the syllabus for a Big Data Analytics course at Basaveshwar Engineering College, covering topics such as types of digital data, Big Data technologies like NoSQL and Hadoop, and tools like MongoDB, Hive, and Pig. It includes course outcomes that emphasize analyzing digital data characteristics, challenges in Big Data analytics, and applying various tools for data processing. Additional resources and textbooks are provided for further learning.

Uploaded by

alpha.gamer.661
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

BASAVESHWAR ENGINEERING COLLEGE, BAGALKOTE- 587 103

DEPARTMENT OF ARTIFICIAL INTELLIGENCE &


MACHINE LEARNING

B. V. V. S’s

22UAI603C 03-Credits
Big Data Analytics
Hrs/Week: 03 L:T:P:3:0:0 CIE Marks:50
Total Hours:40 SEE Marks:50

UNIT - I 10 Hrs
Types of Digital Data: Classification of Digital Data – Structured Data, Semi-Structured Data, and
Unstructured Data. Introduction to Big Data: Characteristics of Data, Evolution of Big Data,
Definition of Big Data, Challenges with Big Data, definition of Big Data, other Characteristics of
Data Which are not Definitional Traits of Big Data, Need of Big Data, traditional Business
Intelligence (BI) versus Big Data, A Typical Data Warehouse Environment, A Typical Hadoop
Environment, Today’s trend, Evolution in the Realms of Big Data.
Big Data Analytics: Meaning of Big Data Analytics, classification of Analytics, Greatest
Challenges that Prevent Businesses from Capitalizing on Big Data, Top Challenges Facing Big Data,
Importance of Big Data Analytics, Technologies to Meet the Challenges Posed by Big Data, Data
Science, Data Scientist. Terminologies Used in Big Data Environments, Basically Available Soft
State Eventual Consistency (BASE), Few Top Analytics Tools.
UNIT – II 10 Hrs
Big Data Technology Landscape - NoSQL (Not Only SQL) and Hadoop.
NoSQL (Not Only SQL) – use of NoSQL, Types of NoSQL databases, benefits of NoSQL,
Advantages of NoSQL, NoSQL Vendors, SQL Versus NoSQL, NewSQL, Comparison of SQL,
NoSQL, and NewSQL.
Hadoop: Features of Hadoop, Key advantages of Hadoop, Versions of Hadoop - Hadoop 1.0,
Hadoop 2.0, Overview of Hadoop Ecosystems, Hadoop Versus, SQL, Integrated Hadoop systems
offered by leading market vendors, Cloud based Hadoop solutions. Introducing Hadoop, Hadoop vs
RDBMS, RDBMS versus Hadoop, Distributed Computing Challenges, History of Hadoop, Hadoop
Overview, Use Case of Hadoop, Hadoop Distributors, HDFS (Hadoop Distributed File System),
Processing Data with Hadoop, Managing Resources and Applications with Hadoop YARN (Yet
another Resource Negotiator), Interacting with Hadoop Ecosystem.
UNIT - III 10 Hrs
Introduction to MongoDB: definition of MongoDB, benefits of MongoDB, Terms Used in
RDBMS and MongoDB, Data Types in MongoDB, MongoDB Query Language.- Insert, Save,
Update, Remove, find methods, Dealing with NULL values, Count, Limit, Sort and Skip Methods
Introduction to Cassandra: An Introduction, Features of Cassandra, CQL Data types, CQLSH,
Keyspaces, CRUD (Create, Read, Update and Delete) Operations, Collections.
UNIT - IV 10 Hrs
Hive: Hive Architecture, Hive Data Types, Hive File Formats, Hive Query Language (HQL),
RCFile Implementation, SerDe, and User-defined Function (UDF).
Introduction to Pig: The Anatomy of Pig, Pig on Hadoop, Pig Philosophy. Use Case for Pig: ETL
Processing, Pig Latin Overview, Data Types in Pig, Running Pig, Execution Modes of Pig,
Relational Operators, Eval Function,Complex Data Types.
BASAVESHWAR ENGINEERING COLLEGE, BAGALKOTE- 587 103
DEPARTMENT OF ARTIFICIAL INTELLIGENCE &
MACHINE LEARNING

B. V. V. S’s

Text Books:
1. Seema. Acharya and Subhashini. C, “Big Data and Analytics”, 1st Edition, Wiley India, 2015 (Chapters
1,2,3,4,5,6,7,9,10).
Reference books:
1. Bart. Baesens, “Analytics in a Big Data World: The Essential Guide to Data Science and its
Applications”, 1 st Edition, Wiley, 2014.
2. DT Editorial Services, “Big Data: Black Book, Comprehensive Problem Solver”, 1 st
Edition, Dreamtech Press, 2016.
3. Tom. White, “Hadoop – The Definitive Guide”, 3rd Edition, O’Reilly, 2012.
4. Alex Holmes, “Hadoop in Practice”, 2nd Edition, Dreamtech Press India Pvt. Ltd, 2014.
5. Dayong. Du, “Apache Hive Essentials”, 2 nd Edition, Packt Publishing Limited, 2018.
6. Alan. Gates, “Programming Pig”, 2nd Edition, Shroff/O’Reilly, 2016.
7. Alan. Gates, “Programming Pig: Dataflow Scripting with Hadoop”, 2 nd Edition,
Shroff/O’Reilly, 2016.
Online Resources:
1. https://www.guru99.com/machine-learning-tutorial.htm
2. https://www.tutorialspoint.com/machine_learning_with_python/index.htm
3. https://www.geeksforgeeks.org/machine-learning/
4. http://archive.ics.uci.edu/ml/index.php (Popular dataset resource for ML beginners)
5. https://www.simplilearn.com/tutorials/mongodb-tutorial
Course Outcomes: After completing the course the student will be able to:

CO1: Analyze the characteristics of digital data and it's challenges in Big data environment.
CO2: Analyze the challenges of big data analytics and its terminologies that prevent businesses from
capitalizing.
CO3: Build meaningful conversations on Big Data and analytics using Hadoop.
CO4: Identify suitable types of NoSQL databases to solve complex engineering problems.
CO5: Apply Hive and Pig tools on structured data for processing and analyzing

You might also like