[go: up one dir, main page]

0% found this document useful (0 votes)
50 views2 pages

Big Data Technologies Course Outline

Uploaded by

Mutomba Tichaona
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
50 views2 pages

Big Data Technologies Course Outline

Uploaded by

Mutomba Tichaona
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Course Description: This course provides final-year Computer Science students with a

comprehensive understanding of the technologies and techniques used to manage and


analyze massive datasets. Students will explore the challenges and opportunities
presented by big data, and gain practical experience working with key tools and
frameworks.

Prerequisites:

 Strong programming skills in at least one language (Python, Java, or Scala


preferred)
 Understanding of data structures and algorithms
 Basic knowledge of databases

Course Objectives: Upon successful completion of the course, students will be able to:

 Explain the characteristics and challenges of big data.


 Describe the key technologies and architectures used in big data processing and
storage.
 Design and implement solutions to process and analyze large datasets using
appropriate tools and techniques.
 Understand the ethical and societal implications of big data.

Course Outline:

1. Introduction to Big Data (week1)

 Definition of big data: volume, velocity, variety


 Challenges and opportunities of big data
 Big data use cases across various domains
 Overview of big data technologies and ecosystem

2. Big Data Storage (week2)

 Distributed file systems: HDFS, Ceph, GlusterFS


 NoSQL databases: HBase, Cassandra, MongoDB
 Data Warehousing and Data Lakes
 Cloud Storage solutions: Amazon S3, Google Cloud Storage, Azure Blob
Storage

3. Big Data Processing (week2)

 MapReduce paradigm and Hadoop ecosystem


 Spark: Core concepts, RDDs, DataFrames, Spark SQL
 Stream processing: Apache Kafka, Apache Flink
 Real-time analytics
4. Big Data Analytics and Machine Learning (week3)

 Data cleaning and preparation


 Exploratory Data Analysis (EDA)
 Machine learning algorithms for big data
 Model building and evaluation

5. Big Data Visualization (week3)

 Techniques for visualizing large datasets


 Tools for big data visualization: Tableau, Power BI, D3.js

6. Big Data Ethics and Security (week4)

 Privacy concerns and data anonymization


 Data security and access control
 Ethical considerations in big data applications

Assessment:

 Assignments (20%) - Practical implementation of big data technologies and


analysis
 Midterm Exam (20%) - Theoretical understanding of big data concepts
 Final Exam (60%) - Design and implementation of a big data solution to a real-
world problem

Recommended Textbooks:

 "Big Data: Principles and Best Practices of Scalable Realtime Data Systems" by
Nathan Marz and James Warren
 "Hadoop: The Definitive Guide" by Tom White
 "Learning Spark: Lightning-Fast Big Data Analysis" by Holden Karau, Andy
Konwinski, Patrick Wendell, and Matei Zaharia

Additional Resources:

 Online tutorials and documentation for Hadoop, Spark, Kafka, and other big data
technologies
 Research papers and articles on latest advancements in big data
 Open-source big data projects and datasets

You might also like